- Data collection 2.1 Build Webclawer 2.2 Chrome configuration 2.3 Selenium configuration 2.4 Identifying data by XPath 2.5 Dealing with Exceptions while True: try: brower = .... pending_number = .... if pending_number == 0: break
except Exception:
代码语言:txt复制 brower.close()
代码语言:txt复制 browser.quit()
代码语言:txt复制 continue
- Data cleaning df = df.drop("Unnamed: 0", "letter", axis=1)
- Data processing i. pd.to_csv(path, encoding="utf-8-sig", index=False)
- Data conversion i. Use Multiterm Converter to convert csv file by Spreadsheet options ii. Create termbase using Multiterm iii. Import termbase definition file created by Multiterm Converter iv. Import xml file created by Multiterm Converter
Russian-english and russian-chinese termbases for trados haved been create and used for translation.