-
Notifications
You must be signed in to change notification settings - Fork 17
Dictionary: Diseases
Priya
Disease
https://github.com/petermr/openVirus/tree/master/dictionaries/diseases/disease_new.xml
https://github.com/petermr/openVirus/blob/master/dictionaries/test/disease_synonym.xml
https://github.com/petermr/openVirus/tree/master/dictionaries/diseases/disease_dict.md
This dictionary contains the names of diseases that commonly occur and the names of co-occuring diseases during a viral epidemic.
Number of entries : 17223
The diseases' names were collected from Wikidata using Wikidata Query Service.
This dictionary must be updated by the names of disease codes from the source ICD-10 using Wikidata.
I have updated the synonym dictionary for disease
at https://github.com/petermr/openVirus/blob/master/dictionaries/test/disease_synonym.xml. I look at a few pages and I got some questions with the synonyms
created in the dictionary...
- The
synonyms
included some common words/letters/numbers like2
,Male
,face
,and neck
,X
,X-linked
and so on. If this dictionary was used inami search
, will it then createDataTables
including these common words? - I saw some words containing some special letters but they contain wikidata id or were mentioned in the synonym like
Uberkoten
(the correct special letters are not able to mention) has wikidataQ332590
,Chédiak
&François
were mentioned insynonyms
. Is that okay to be left or should be manualy removed? - Some entry names contain the Wikidata id instead of names. The wikidata id
Q886810
&Q1607642
containno entry name
but the id itself. Should I remove them manually? - In the synonyms, there were bracket words like
<synonym>Dwarfism : [pitutary] or [hypophyseal (& Lorain - Levi)]</synonym>
. Will it be used altogether or separately or both? If only altogether, might it misses some words/terms? - Some synonyms were repeated more than twice in different entries like
NOS
,X-linked
. Should they be removed? - There were also
acronyms
mentioned in thesynonyms
likePAN
(Polyarteritis nodsa),DISH
(Diffuse Idiopathic Skeletal Hyerostosis), etc.
- For the disease_new dictionary, the SPARQL query was downloaded as
csv
file and changed into atext document
. The changed text document transformed some symbols such as-
,'
into special characters. They were not recognized byamidict
for dictionary creation. So, they were edited manually.
- For the synonym dictionary, they contain common words as
synonym
. These will createfalse positives
inDatatables
. So, they must be removed.