You can add a new dictionary. The dictionary shall be in text, csv or tsv format.
It must contain the following columns with headings:
['language', 'entry', 'main', 'second', 'label', 'dictionary', 'homonym']
If there are any lines with "Nan" in the columns: 'language', 'entry', 'main' or 'second', they should be removed from the file.
Besides, you need to apply drop_duplicates or any other tool to remove the complete double lines, which have the same information in every column.
It should contain list of lines with tabulation separated fields as follows:
- Language - name of the language according to Glottolog
- Entry - the lexeme
- Main - first meaning
- Second - second meaning
- Label - 1) grammatical labels of the 1st meanings 2) labels for the 2nd meaning
- not mandatory
- Dictionary - bibliographical data about the source
- Homonym - homonymity ("1" if the word is considered by the dictionary as a homonym, "-" elsewhere)
language | entry | main | second | label | dictionary | homonym |
Abaza | мцыра | пустой порожний | свободный незанятый | 1) прил. 2) перен. | Адзинов Абазинско-русский словарь | - |