Skip to content

Commit 41443d2

Browse files
committed
update to rev34; support BCP47 language variants
1 parent 5a97d4e commit 41443d2

11 files changed

+438
-229
lines changed

AUTHORS

+1-1
Original file line numberDiff line numberDiff line change
@@ -1,2 +1,2 @@
1-
XDXF format - Sergei Snegov, Leonid Soshinskiy [https://github.com/soshial]
1+
XDXF format specification - Sergei Snegov, Leonid Soshinskiy [https://github.com/soshial]
22
makedict (Deprecated) - Evgeniy Dushistov <[email protected]>, kubtek [https://github.com/kubtek]

CHANGELOG

-3
This file was deleted.

CHANGELOG.md

+19
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,19 @@
1+
### Changelog (rev. 34) 2022-01-20
2+
* since rev. 34 the format is only semantic and cannot store any presentational or visual data
3+
* the language code limitation is removed: all languages that exist in BCP47 standard are supported (use http://schneegans.de/lv/?tags=hy-Latn-IT-arevela for validation)
4+
* multilingual dictionaries are now supported: a dictionary may have multiple languages, that are translated from and into. It is also allowed to mark `<k>` and `<def>` tags with `xml:lang`
5+
* description supports line breaks
6+
* transcription info can be directly inside `def` tag
7+
8+
### Changelog (rev. 33)
9+
* `<deftext>` introduced in order to fix multiple errors in DTD scheme
10+
* `<rref>` tag: added `lctn` and `type` attributes, links are not stored inside the tag anymore
11+
* `<kref>` tag: `idref` attribute introduced
12+
* `<c>` tag: added necessary hash sign # in attribute
13+
* `<categ>` tag now is a list of `<kref>` tags
14+
* `<etm>` may now have `<mrkd>` inside to mark etymological ancestors/cognates
15+
* `<dtrn>` may now contain `<kref>` tag(s) inside
16+
* `<u>` tag for underlined text introduced
17+
* `<br/>` tag introduced for newlines inside articles
18+
* `<ex>` now might have `<iref>` tag inside
19+
* `<ex>`, `<tr>`, `<co>` tags now may have user-set attribute values

README.md

+4-3
Original file line numberDiff line numberDiff line change
@@ -11,11 +11,12 @@ Moreover, the format has many tags that are specific to dictionaries: etymologie
1111

1212
### Any drawbacks?
1313
1. **XML parsing speed**. For opponents of using XML for storing dictionary and the problem of storing and parsing big XML-files in RAM, XDXF schema and structure of any dictionary allow to store all word articles on disk with help of hash-tables/. Some dictionary software applies this approach quite efficiently (for example, see [GoldenDict](http://goldendict.org/)).
14-
2. **Editor software**. Although, there is no software that allows editing dictionaries at the moment, XDXF is a more or less human-readible XML, that is quite easy to edit manually in a text editor even without prior knowledge of the format specifications.
14+
2. **Editing software**. Although, there is no software that allows editing dictionaries at the moment, XDXF is a more or less human-readable XML, that is quite easy to edit manually in a text editor even without prior knowledge of the format specifications.
1515

1616
### Which dictionary software supports XDXF?
17-
* [Goldendict](https://github.com/goldendict/goldendict) (Win, Linux, MacOS, Android)
18-
* (please send me other examples, that I dont know of)
17+
* [Goldendict](https://github.com/goldendict/goldendict) (Win, Linux, MacOS)
18+
* [Alpus](https://alpusapp.com/) (Win, Linux, MacOS, Android, iOS)
19+
* (please send me other examples, that are not listed here)
1920

2021
## What was `makedict`?
2122
In the beginning of the project a converter was written to facilitate conversions to and from XDXF (`dictd/dsl/sdict/stardict/xdxf → dictd/stardict/xdxf`).

format_standard/images/clickable_categories.png

100755100644
File mode changed.

format_standard/images/dtrn_tag_tooltip.png

100755100644
File mode changed.

format_standard/xdxf_description.md

+43-171
Large diffs are not rendered by default.
+119
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,119 @@
1+
<!ELEMENT xdxf (meta_info,lexicon)>
2+
<!ATTLIST xdxf format (visual|logical) "logical">
3+
<!ATTLIST xdxf revision CDATA #REQUIRED>
4+
5+
<!ELEMENT meta_info (title,full_title,description,publisher?,authors?,file_ver,creation_date,last_edited_date,dict_edition?,publishing_date?,dict_src_url?,abbreviations?)>
6+
<!ELEMENT title (#PCDATA)>
7+
<!ELEMENT full_title (#PCDATA)>
8+
<!ELEMENT description (#PCDATA)>
9+
<!ELEMENT publisher (#PCDATA)>
10+
<!ELEMENT authors (author+)>
11+
<!ELEMENT author (#PCDATA)>
12+
<!ATTLIST author role CDATA #IMPLIED>
13+
<!ELEMENT file_ver (#PCDATA)>
14+
<!ELEMENT creation_date (#PCDATA)>
15+
<!ELEMENT last_edited_date (#PCDATA)>
16+
<!ELEMENT dict_edition (#PCDATA)>
17+
<!ELEMENT publishing_date (#PCDATA)>
18+
<!ELEMENT dict_src_url (#PCDATA)>
19+
<!ELEMENT abbreviations (abbr_def+)>
20+
<!ELEMENT abbr_def (abbr_k+,abbr_v)>
21+
<!ATTLIST abbr_def type (stl|grm|aux|knl|oth) #IMPLIED>
22+
<!ELEMENT abbr_k (#PCDATA)>
23+
<!ELEMENT abbr_v (#PCDATA)>
24+
25+
<!ELEMENT lexicon (ar+)>
26+
<!ELEMENT ar (k+,def)>
27+
<!ATTLIST ar f (v|l) "l">
28+
<!ELEMENT k (#PCDATA|opt|sup|sub)*>
29+
<!ATTLIST k id ID #IMPLIED>
30+
<!ELEMENT opt (#PCDATA|sup|sub)*>
31+
<!ENTITY % style "c|sup|sub|i|b|u">
32+
<!ENTITY % ref "kref|rref|iref">
33+
<!ELEMENT def (gr?,co*,(def+|deftext),ex*,sr?,etm?,categ*)>
34+
<!ELEMENT deftext (#PCDATA|tr|dtrn|abbr|co|di|%ref;|%style;|br)*>
35+
<!ATTLIST def id ID #IMPLIED>
36+
<!ATTLIST def cmt CDATA #IMPLIED>
37+
<!ATTLIST def freq CDATA #IMPLIED>
38+
<!ELEMENT sr (kref+)>
39+
<!ELEMENT etm (#PCDATA|tr|abbr|co|di|mrkd|%ref;|%style;|br)*>
40+
<!ELEMENT categ (kref+)>
41+
<!ELEMENT gr (#PCDATA|tr|abbr|co|di|%ref;|%style;)*>
42+
<!ELEMENT tr (#PCDATA)>
43+
<!ATTLIST tr format (IPA|X-SAMPA|erkIPA|CDATA) "IPA">
44+
<!ELEMENT dtrn (#PCDATA|kref)*>
45+
<!ELEMENT kref (#PCDATA|%style;)*>
46+
<!ATTLIST kref idref IDREF #IMPLIED>
47+
<!ATTLIST kref type (syn|ant|hpr|hpn|par|spv|mer|hol|ent|rel|etm) #IMPLIED>
48+
<!ATTLIST kref kcmt CDATA #IMPLIED>
49+
<!ELEMENT rref (#PCDATA)>
50+
<!ATTLIST rref start CDATA "0">
51+
<!ATTLIST rref size CDATA #IMPLIED>
52+
<!ATTLIST rref lctn CDATA #IMPLIED>
53+
<!ATTLIST rref type CDATA #IMPLIED>
54+
<!ELEMENT iref (#PCDATA|%style;)*>
55+
<!ATTLIST iref href CDATA #REQUIRED>
56+
<!ELEMENT abbr (#PCDATA)>
57+
<!ELEMENT ex (ex_orig+,ex_tran*,iref*)>
58+
<!ATTLIST ex type (exm|phr|prv|oth|PCDATA) "exm">
59+
<!ATTLIST ex source CDATA #IMPLIED>
60+
<!ATTLIST ex author CDATA #IMPLIED>
61+
<!ELEMENT ex_orig (#PCDATA|mrkd|co|%ref;|%style;|br)*>
62+
<!ELEMENT ex_tran (#PCDATA|mrkd|co|%ref;|%style;|br)*>
63+
<!ELEMENT mrkd (#PCDATA|kref|%style;)*>
64+
<!ELEMENT co (#PCDATA|co|tr|abbr|di|%ref;|%style;|br)*>
65+
<!ATTLIST co type CDATA #IMPLIED>
66+
<!ELEMENT i (#PCDATA|%style;)*>
67+
<!ELEMENT b (#PCDATA|%style;)*>
68+
<!ELEMENT u (#PCDATA|%style;)*>
69+
<!ELEMENT c (#PCDATA|%style;)*>
70+
<!ATTLIST c c CDATA #IMPLIED>
71+
<!ELEMENT sup (#PCDATA)>
72+
<!ELEMENT sub (#PCDATA)>
73+
<!ELEMENT di (#PCDATA)>
74+
<!ELEMENT br EMPTY>
75+
76+
77+
<!ATTLIST xdxf lang_from (
78+
AAR|ABK|ACE|ACH|ADA|ADY|AFA|AFH|AFR|AIN|AKA|AKK|ALB|ALE|ALG|ALT|AMH|ANG|APA|ARA|ARC|ARG|ARM|ARN|
79+
ARP|ART|ARW|ASM|AST|ATH|AUS|AVA|AVE|AWA|AYM|AZE|BAD|BAI|BAK|BAL|BAM|BAN|BAQ|BAS|BAT|BEJ|BEL|BEM|
80+
BEN|BER|BHO|BIH|BIK|BIN|BIS|BLA|BNT|BOS|BRA|BRE|BTK|BUA|BUG|BUL|BUR|BYN|CAD|CAI|CAR|CAT|CAU|CEB|
81+
CEL|CHA|CHB|CHE|CHG|CHI|CHK|CHM|CHN|CHO|CHP|CHR|CHU|CHV|CHY|CMC|COP|COR|COS|CPE|CPF|CPP|CRE|CRH|
82+
CRP|CSB|CUS|CZE|DAK|DAN|DAR|DAY|DEL|DEN|DGR|DIN|DIV|DOI|DRA|DSB|DUA|DUM|DUT|DYU|DZO|EFI|EGY|EKA|
83+
ELX|ENG|ENM|EPO|EST|EWE|EWO|FAN|FAO|FAT|FIJ|FIL|FIN|FIU|FON|FRE|FRM|FRO|FRY|FUL|FUR|GAA|GAY|GBA|
84+
GEM|GEO|GER|GEZ|GIL|GLA|GLE|GLG|GLV|GMH|GOH|GON|GOR|GOT|GRB|GRC|GRE|GRN|GUJ|GWI|HAI|HAT|HAU|HAW|
85+
HEB|HER|HIL|HIM|HIN|HIT|HMN|HMO|HSB|HUN|HUP|IBA|IBO|ICE|IDO|III|IJO|IKU|ILE|ILO|INA|INC|IND|INE|
86+
INH|IPK|IRA|IRO|ITA|JAV|JBO|JPN|JPR|JRB|KAA|KAB|KAC|KAL|KAM|KAN|KAR|KAS|KAU|KAW|KAZ|KBD|KHA|KHI|
87+
KHM|KHO|KIK|KIN|KIR|KMB|KOK|KOM|KON|KOR|KOS|KPE|KRC|KRO|KRU|KUA|KUM|KUR|KUT|LAD|LAH|LAM|LAO|LAT|
88+
LAV|LEZ|LIM|LIN|LIT|LOL|LOZ|LTZ|LUA|LUB|LUG|LUI|LUN|LUO|LUS|MAC|MAD|MAG|MAH|MAI|MAK|MAL|MAN|MAO|
89+
MAP|MAR|MAS|MAY|MDF|MDR|MEN|MGA|MIC|MIN|MIS|MKH|MLG|MLT|MNC|MNI|MNO|MOH|MOL|MON|MOS|MUL|MUN|MUS|
90+
MWL|MWR|MYN|MYV|NAH|NAI|NAP|NAU|NAV|NBL|NDE|NDO|NDS|NEP|NEW|NIA|NIC|NIU|NNO|NOB|NOG|NON|NOR|NSO|
91+
NUB|NWC|NYA|NYM|NYN|NYO|NZI|OCI|OJI|ORI|ORM|OSA|OSS|OTA|OTO|PAA|PAG|PAL|PAM|PAN|PAP|PAU|PEO|PER|
92+
PHI|PHN|PLI|POL|PON|POR|PRA|PRO|PUS|QAA-QUE|RAJ|RAP|RAR|ROA|ROH|ROM|RUM|RUN|RUP|RUS|SAD|SAG|SAH|
93+
SAI|SAL|SAM|SAN|SAS|SAT|SCC|SCN|SCO|SCR|SEL|SEM|SGA|SGN|SHN|SID|SIN|SIO|SIT|SLA|SLO|SLV|SMA|SME|
94+
SMI|SMJ|SMN|SMO|SMS|SNA|SND|SNK|SOG|SOM|SON|SOT|SPA|SRD|SRR|SSA|SSW|SUK|SUN|SUS|SUX|SWA|SWE|SYR|
95+
TAH|TAI|TAM|TAT|TEL|TEM|TER|TET|TGK|TGL|THA|TIB|TIG|TIR|TIV|TKL|TLH|TLI|TMH|TOG|TON|TPI|TSI|TSN|
96+
TSO|TUK|TUM|TUP|TUR|TUT|TVL|TWI|TYV|UDM|UGA|UIG|UKR|UMB|UND|URD|UZB|VAI|VEN|VIE|VOL|VOT|WAK|WAL|
97+
WAR|WAS|WEL|WEN|WLN|WOL|XAL|XHO|YAO|YAP|YID|YOR|YPK|ZAP|ZEN|ZHA|ZND|ZUL|ZUN) #REQUIRED>
98+
99+
<!ATTLIST xdxf lang_to (
100+
AAR|ABK|ACE|ACH|ADA|ADY|AFA|AFH|AFR|AIN|AKA|AKK|ALB|ALE|ALG|ALT|AMH|ANG|APA|ARA|ARC|ARG|ARM|ARN|
101+
ARP|ART|ARW|ASM|AST|ATH|AUS|AVA|AVE|AWA|AYM|AZE|BAD|BAI|BAK|BAL|BAM|BAN|BAQ|BAS|BAT|BEJ|BEL|BEM|
102+
BEN|BER|BHO|BIH|BIK|BIN|BIS|BLA|BNT|BOS|BRA|BRE|BTK|BUA|BUG|BUL|BUR|BYN|CAD|CAI|CAR|CAT|CAU|CEB|
103+
CEL|CHA|CHB|CHE|CHG|CHI|CHK|CHM|CHN|CHO|CHP|CHR|CHU|CHV|CHY|CMC|COP|COR|COS|CPE|CPF|CPP|CRE|CRH|
104+
CRP|CSB|CUS|CZE|DAK|DAN|DAR|DAY|DEL|DEN|DGR|DIN|DIV|DOI|DRA|DSB|DUA|DUM|DUT|DYU|DZO|EFI|EGY|EKA|
105+
ELX|ENG|ENM|EPO|EST|EWE|EWO|FAN|FAO|FAT|FIJ|FIL|FIN|FIU|FON|FRE|FRM|FRO|FRY|FUL|FUR|GAA|GAY|GBA|
106+
GEM|GEO|GER|GEZ|GIL|GLA|GLE|GLG|GLV|GMH|GOH|GON|GOR|GOT|GRB|GRC|GRE|GRN|GUJ|GWI|HAI|HAT|HAU|HAW|
107+
HEB|HER|HIL|HIM|HIN|HIT|HMN|HMO|HSB|HUN|HUP|IBA|IBO|ICE|IDO|III|IJO|IKU|ILE|ILO|INA|INC|IND|INE|
108+
INH|IPK|IRA|IRO|ITA|JAV|JBO|JPN|JPR|JRB|KAA|KAB|KAC|KAL|KAM|KAN|KAR|KAS|KAU|KAW|KAZ|KBD|KHA|KHI|
109+
KHM|KHO|KIK|KIN|KIR|KMB|KOK|KOM|KON|KOR|KOS|KPE|KRC|KRO|KRU|KUA|KUM|KUR|KUT|LAD|LAH|LAM|LAO|LAT|
110+
LAV|LEZ|LIM|LIN|LIT|LOL|LOZ|LTZ|LUA|LUB|LUG|LUI|LUN|LUO|LUS|MAC|MAD|MAG|MAH|MAI|MAK|MAL|MAN|MAO|
111+
MAP|MAR|MAS|MAY|MDF|MDR|MEN|MGA|MIC|MIN|MIS|MKH|MLG|MLT|MNC|MNI|MNO|MOH|MOL|MON|MOS|MUL|MUN|MUS|
112+
MWL|MWR|MYN|MYV|NAH|NAI|NAP|NAU|NAV|NBL|NDE|NDO|NDS|NEP|NEW|NIA|NIC|NIU|NNO|NOB|NOG|NON|NOR|NSO|
113+
NUB|NWC|NYA|NYM|NYN|NYO|NZI|OCI|OJI|ORI|ORM|OSA|OSS|OTA|OTO|PAA|PAG|PAL|PAM|PAN|PAP|PAU|PEO|PER|
114+
PHI|PHN|PLI|POL|PON|POR|PRA|PRO|PUS|QAA-QUE|RAJ|RAP|RAR|ROA|ROH|ROM|RUM|RUN|RUP|RUS|SAD|SAG|SAH|
115+
SAI|SAL|SAM|SAN|SAS|SAT|SCC|SCN|SCO|SCR|SEL|SEM|SGA|SGN|SHN|SID|SIN|SIO|SIT|SLA|SLO|SLV|SMA|SME|
116+
SMI|SMJ|SMN|SMO|SMS|SNA|SND|SNK|SOG|SOM|SON|SOT|SPA|SRD|SRR|SSA|SSW|SUK|SUN|SUS|SUX|SWA|SWE|SYR|
117+
TAH|TAI|TAM|TAT|TEL|TEM|TER|TET|TGK|TGL|THA|TIB|TIG|TIR|TIV|TKL|TLH|TLI|TMH|TOG|TON|TPI|TSI|TSN|
118+
TSO|TUK|TUM|TUP|TUR|TUT|TVL|TWI|TYV|UDM|UGA|UIG|UKR|UMB|UND|URD|UZB|VAI|VEN|VIE|VOL|VOT|WAK|WAL|
119+
WAR|WAS|WEL|WEN|WLN|WOL|XAL|XHO|YAO|YAP|YID|YOR|YPK|ZAP|ZEN|ZHA|ZND|ZUL|ZUN) #REQUIRED>

format_standard/xdxf_strict.dtd

+13-51
Original file line numberDiff line numberDiff line change
@@ -1,11 +1,16 @@
11
<!ELEMENT xdxf (meta_info,lexicon)>
2-
<!ATTLIST xdxf format (visual|logical) "logical">
32
<!ATTLIST xdxf revision CDATA #REQUIRED>
43

5-
<!ELEMENT meta_info (title,full_title,description,publisher?,authors?,file_ver,creation_date,last_edited_date,dict_edition?,publishing_date?,dict_src_url?,abbreviations?)>
4+
<!-- Meta-information about the dictionary, its abbreviations, creators, publisher and file versions -->
5+
<!ELEMENT meta_info (languages,title,full_title?,description,publisher?,authors?,file_ver,creation_date,last_edited_date,dict_edition?,publishing_date?,dict_src_url?,abbreviations?)>
6+
<!ELEMENT languages (from+,to+)>
7+
<!ELEMENT from EMPTY>
8+
<!ATTLIST from xml:lang NMTOKEN #REQUIRED>
9+
<!ELEMENT to EMPTY>
10+
<!ATTLIST to xml:lang NMTOKEN #REQUIRED>
611
<!ELEMENT title (#PCDATA)>
712
<!ELEMENT full_title (#PCDATA)>
8-
<!ELEMENT description (#PCDATA)>
13+
<!ELEMENT description (#PCDATA|br)>
914
<!ELEMENT publisher (#PCDATA)>
1015
<!ELEMENT authors (author+)>
1116
<!ELEMENT author (#PCDATA)>
@@ -22,19 +27,21 @@
2227
<!ELEMENT abbr_k (#PCDATA)>
2328
<!ELEMENT abbr_v (#PCDATA)>
2429

30+
<!-- The body of the dictionary: it contains word articles -->
2531
<!ELEMENT lexicon (ar+)>
2632
<!ELEMENT ar (k+,def)>
27-
<!ATTLIST ar f (v|l) "l">
2833
<!ELEMENT k (#PCDATA|opt|sup|sub)*>
2934
<!ATTLIST k id ID #IMPLIED>
35+
<!ATTLIST k xml:lang NMTOKEN #IMPLIED>
3036
<!ELEMENT opt (#PCDATA|sup|sub)*>
3137
<!ENTITY % style "c|sup|sub|i|b|u">
3238
<!ENTITY % ref "kref|rref|iref">
33-
<!ELEMENT def (gr?,co*,(def+|deftext),ex*,sr?,etm?,categ*)>
34-
<!ELEMENT deftext (#PCDATA|tr|dtrn|abbr|co|di|%ref;|%style;|br)*>
39+
<!ELEMENT def (tr?,gr?,co*,(def+|deftext),ex*,sr?,etm?,categ*)>
3540
<!ATTLIST def id ID #IMPLIED>
3641
<!ATTLIST def cmt CDATA #IMPLIED>
3742
<!ATTLIST def freq CDATA #IMPLIED>
43+
<!ATTLIST def xml:lang NMTOKEN #IMPLIED>
44+
<!ELEMENT deftext (#PCDATA|tr|dtrn|abbr|co|di|%ref;|%style;|br)*>
3845
<!ELEMENT sr (kref+)>
3946
<!ELEMENT etm (#PCDATA|tr|abbr|co|di|mrkd|%ref;|%style;|br)*>
4047
<!ELEMENT categ (kref+)>
@@ -72,48 +79,3 @@
7279
<!ELEMENT sub (#PCDATA)>
7380
<!ELEMENT di (#PCDATA)>
7481
<!ELEMENT br EMPTY>
75-
76-
77-
<!ATTLIST xdxf lang_from (
78-
AAR|ABK|ACE|ACH|ADA|ADY|AFA|AFH|AFR|AIN|AKA|AKK|ALB|ALE|ALG|ALT|AMH|ANG|APA|ARA|ARC|ARG|ARM|ARN|
79-
ARP|ART|ARW|ASM|AST|ATH|AUS|AVA|AVE|AWA|AYM|AZE|BAD|BAI|BAK|BAL|BAM|BAN|BAQ|BAS|BAT|BEJ|BEL|BEM|
80-
BEN|BER|BHO|BIH|BIK|BIN|BIS|BLA|BNT|BOS|BRA|BRE|BTK|BUA|BUG|BUL|BUR|BYN|CAD|CAI|CAR|CAT|CAU|CEB|
81-
CEL|CHA|CHB|CHE|CHG|CHI|CHK|CHM|CHN|CHO|CHP|CHR|CHU|CHV|CHY|CMC|COP|COR|COS|CPE|CPF|CPP|CRE|CRH|
82-
CRP|CSB|CUS|CZE|DAK|DAN|DAR|DAY|DEL|DEN|DGR|DIN|DIV|DOI|DRA|DSB|DUA|DUM|DUT|DYU|DZO|EFI|EGY|EKA|
83-
ELX|ENG|ENM|EPO|EST|EWE|EWO|FAN|FAO|FAT|FIJ|FIL|FIN|FIU|FON|FRE|FRM|FRO|FRY|FUL|FUR|GAA|GAY|GBA|
84-
GEM|GEO|GER|GEZ|GIL|GLA|GLE|GLG|GLV|GMH|GOH|GON|GOR|GOT|GRB|GRC|GRE|GRN|GUJ|GWI|HAI|HAT|HAU|HAW|
85-
HEB|HER|HIL|HIM|HIN|HIT|HMN|HMO|HSB|HUN|HUP|IBA|IBO|ICE|IDO|III|IJO|IKU|ILE|ILO|INA|INC|IND|INE|
86-
INH|IPK|IRA|IRO|ITA|JAV|JBO|JPN|JPR|JRB|KAA|KAB|KAC|KAL|KAM|KAN|KAR|KAS|KAU|KAW|KAZ|KBD|KHA|KHI|
87-
KHM|KHO|KIK|KIN|KIR|KMB|KOK|KOM|KON|KOR|KOS|KPE|KRC|KRO|KRU|KUA|KUM|KUR|KUT|LAD|LAH|LAM|LAO|LAT|
88-
LAV|LEZ|LIM|LIN|LIT|LOL|LOZ|LTZ|LUA|LUB|LUG|LUI|LUN|LUO|LUS|MAC|MAD|MAG|MAH|MAI|MAK|MAL|MAN|MAO|
89-
MAP|MAR|MAS|MAY|MDF|MDR|MEN|MGA|MIC|MIN|MIS|MKH|MLG|MLT|MNC|MNI|MNO|MOH|MOL|MON|MOS|MUL|MUN|MUS|
90-
MWL|MWR|MYN|MYV|NAH|NAI|NAP|NAU|NAV|NBL|NDE|NDO|NDS|NEP|NEW|NIA|NIC|NIU|NNO|NOB|NOG|NON|NOR|NSO|
91-
NUB|NWC|NYA|NYM|NYN|NYO|NZI|OCI|OJI|ORI|ORM|OSA|OSS|OTA|OTO|PAA|PAG|PAL|PAM|PAN|PAP|PAU|PEO|PER|
92-
PHI|PHN|PLI|POL|PON|POR|PRA|PRO|PUS|QAA-QUE|RAJ|RAP|RAR|ROA|ROH|ROM|RUM|RUN|RUP|RUS|SAD|SAG|SAH|
93-
SAI|SAL|SAM|SAN|SAS|SAT|SCC|SCN|SCO|SCR|SEL|SEM|SGA|SGN|SHN|SID|SIN|SIO|SIT|SLA|SLO|SLV|SMA|SME|
94-
SMI|SMJ|SMN|SMO|SMS|SNA|SND|SNK|SOG|SOM|SON|SOT|SPA|SRD|SRR|SSA|SSW|SUK|SUN|SUS|SUX|SWA|SWE|SYR|
95-
TAH|TAI|TAM|TAT|TEL|TEM|TER|TET|TGK|TGL|THA|TIB|TIG|TIR|TIV|TKL|TLH|TLI|TMH|TOG|TON|TPI|TSI|TSN|
96-
TSO|TUK|TUM|TUP|TUR|TUT|TVL|TWI|TYV|UDM|UGA|UIG|UKR|UMB|UND|URD|UZB|VAI|VEN|VIE|VOL|VOT|WAK|WAL|
97-
WAR|WAS|WEL|WEN|WLN|WOL|XAL|XHO|YAO|YAP|YID|YOR|YPK|ZAP|ZEN|ZHA|ZND|ZUL|ZUN) #REQUIRED>
98-
99-
<!ATTLIST xdxf lang_to (
100-
AAR|ABK|ACE|ACH|ADA|ADY|AFA|AFH|AFR|AIN|AKA|AKK|ALB|ALE|ALG|ALT|AMH|ANG|APA|ARA|ARC|ARG|ARM|ARN|
101-
ARP|ART|ARW|ASM|AST|ATH|AUS|AVA|AVE|AWA|AYM|AZE|BAD|BAI|BAK|BAL|BAM|BAN|BAQ|BAS|BAT|BEJ|BEL|BEM|
102-
BEN|BER|BHO|BIH|BIK|BIN|BIS|BLA|BNT|BOS|BRA|BRE|BTK|BUA|BUG|BUL|BUR|BYN|CAD|CAI|CAR|CAT|CAU|CEB|
103-
CEL|CHA|CHB|CHE|CHG|CHI|CHK|CHM|CHN|CHO|CHP|CHR|CHU|CHV|CHY|CMC|COP|COR|COS|CPE|CPF|CPP|CRE|CRH|
104-
CRP|CSB|CUS|CZE|DAK|DAN|DAR|DAY|DEL|DEN|DGR|DIN|DIV|DOI|DRA|DSB|DUA|DUM|DUT|DYU|DZO|EFI|EGY|EKA|
105-
ELX|ENG|ENM|EPO|EST|EWE|EWO|FAN|FAO|FAT|FIJ|FIL|FIN|FIU|FON|FRE|FRM|FRO|FRY|FUL|FUR|GAA|GAY|GBA|
106-
GEM|GEO|GER|GEZ|GIL|GLA|GLE|GLG|GLV|GMH|GOH|GON|GOR|GOT|GRB|GRC|GRE|GRN|GUJ|GWI|HAI|HAT|HAU|HAW|
107-
HEB|HER|HIL|HIM|HIN|HIT|HMN|HMO|HSB|HUN|HUP|IBA|IBO|ICE|IDO|III|IJO|IKU|ILE|ILO|INA|INC|IND|INE|
108-
INH|IPK|IRA|IRO|ITA|JAV|JBO|JPN|JPR|JRB|KAA|KAB|KAC|KAL|KAM|KAN|KAR|KAS|KAU|KAW|KAZ|KBD|KHA|KHI|
109-
KHM|KHO|KIK|KIN|KIR|KMB|KOK|KOM|KON|KOR|KOS|KPE|KRC|KRO|KRU|KUA|KUM|KUR|KUT|LAD|LAH|LAM|LAO|LAT|
110-
LAV|LEZ|LIM|LIN|LIT|LOL|LOZ|LTZ|LUA|LUB|LUG|LUI|LUN|LUO|LUS|MAC|MAD|MAG|MAH|MAI|MAK|MAL|MAN|MAO|
111-
MAP|MAR|MAS|MAY|MDF|MDR|MEN|MGA|MIC|MIN|MIS|MKH|MLG|MLT|MNC|MNI|MNO|MOH|MOL|MON|MOS|MUL|MUN|MUS|
112-
MWL|MWR|MYN|MYV|NAH|NAI|NAP|NAU|NAV|NBL|NDE|NDO|NDS|NEP|NEW|NIA|NIC|NIU|NNO|NOB|NOG|NON|NOR|NSO|
113-
NUB|NWC|NYA|NYM|NYN|NYO|NZI|OCI|OJI|ORI|ORM|OSA|OSS|OTA|OTO|PAA|PAG|PAL|PAM|PAN|PAP|PAU|PEO|PER|
114-
PHI|PHN|PLI|POL|PON|POR|PRA|PRO|PUS|QAA-QUE|RAJ|RAP|RAR|ROA|ROH|ROM|RUM|RUN|RUP|RUS|SAD|SAG|SAH|
115-
SAI|SAL|SAM|SAN|SAS|SAT|SCC|SCN|SCO|SCR|SEL|SEM|SGA|SGN|SHN|SID|SIN|SIO|SIT|SLA|SLO|SLV|SMA|SME|
116-
SMI|SMJ|SMN|SMO|SMS|SNA|SND|SNK|SOG|SOM|SON|SOT|SPA|SRD|SRR|SSA|SSW|SUK|SUN|SUS|SUX|SWA|SWE|SYR|
117-
TAH|TAI|TAM|TAT|TEL|TEM|TER|TET|TGK|TGL|THA|TIB|TIG|TIR|TIV|TKL|TLH|TLI|TMH|TOG|TON|TPI|TSI|TSN|
118-
TSO|TUK|TUM|TUP|TUR|TUT|TVL|TWI|TYV|UDM|UGA|UIG|UKR|UMB|UND|URD|UZB|VAI|VEN|VIE|VOL|VOT|WAK|WAL|
119-
WAR|WAS|WEL|WEN|WLN|WOL|XAL|XHO|YAO|YAP|YID|YOR|YPK|ZAP|ZEN|ZHA|ZND|ZUL|ZUN) #REQUIRED>

sample-dicts/rev33.xml

+63
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,63 @@
1+
<?xml version="1.0" encoding="UTF-8" ?>
2+
<!DOCTYPE xdxf SYSTEM "xdxf_old_schema_rev33.dtd">
3+
<xdxf lang_from="ENG" lang_to="ENG" format="logical" revision="033">
4+
<meta_info>
5+
<title>Webster's Dictionary</title>
6+
<full_title>Webster's Unabridged Dictionary</full_title>
7+
<description>Webster's Unabridged Dictionary published 1913 by the Webster Institute</description>
8+
<file_ver>001</file_ver>
9+
<creation_date>07-04-2013</creation_date>
10+
<last_edited_date>13-10-2017</last_edited_date>
11+
<abbreviations>
12+
<abbr_def><abbr_k>n.</abbr_k><abbr_v>noun</abbr_v></abbr_def>
13+
<abbr_def><abbr_k>v.</abbr_k><abbr_v>verb</abbr_v></abbr_def>
14+
<abbr_def><abbr_k>Av.</abbr_k><abbr_k>Ave.</abbr_k><abbr_v>Avenue</abbr_v> </abbr_def>
15+
</abbreviations>
16+
</meta_info>
17+
<lexicon>
18+
<ar>
19+
<k>home</k>
20+
<def>
21+
<gr><tr>'həum</tr><abbr>n.</abbr> <rref start="16384" size="512" lctn="sounds_of_words.ogg"/></gr>
22+
<co>XDXF <iref href="http://xdxf.sourceforge.net"><b>Home</b> page</iref></co>
23+
<def><deftext>One's own dwelling place; the house in which one lives.</deftext></def>
24+
<def><deftext>One's native land; the place or country in which one dwells.</deftext></def>
25+
<def>
26+
<deftext>The abiding place of the affections.</deftext>
27+
<ex><ex_orig>For without hearts there is no home.</ex_orig></ex>
28+
</def>
29+
<def>
30+
<deftext>
31+
<dtrn>дом</dtrn>, at home - дома, у себя;
32+
</deftext>
33+
<ex>
34+
<ex_orig>make yourself at <mrkd>home</mrkd></ex_orig>
35+
<ex_tran>будьте как <mrkd>дома</mrkd></ex_tran>
36+
</ex>
37+
<categ><kref idref="fb982hk">Society</kref></categ>
38+
</def>
39+
<sr><kref type="rel">home-made</kref></sr>
40+
</def>
41+
</ar>
42+
<ar f="v">
43+
<k id="fb982hk">Society</k>
44+
<def>
45+
<deftext>Plural form of word <kref>index</kref>.</deftext>
46+
</def>
47+
</ar>
48+
<ar>
49+
<k>disc</k>
50+
<k>disk</k>
51+
<def>
52+
<gr><abbr>n.</abbr></gr>
53+
<deftext>A flat, circular plate; as, a disk of metal or paper.</deftext>
54+
</def>
55+
</ar>
56+
<ar>
57+
<k>CO<sub>2</sub></k>
58+
<def>
59+
<deftext>Carbon dioxide (CO<sub>2</sub>) - a heavy odorless gas formed during respiration.</deftext>
60+
</def>
61+
</ar>
62+
</lexicon>
63+
</xdxf>

0 commit comments

Comments
 (0)