Lexicon for Named Entities

FinancerProgramme NameYearTargeted taskResource modality SourceDomain Content TypesManual-Autodirect / indirect use Language Updated regularly Size (in words if no indication)FormatLicenseAvailability Catalogue referenceLink/Doc/Ref
crowdWikipedia2001IEWGENnames, triggers, translationsmanycrowdImultiYwikiopen
crowd, DBpedia associationDbpedia2007ELWGENnames, triggers, translationsmanyautoDmultiYrdfopen
DBpedia Lexicalizations DatasetNERC,ELWDBpediaGENnamesmanyautoDenYnot foundrdfopenhttp://wiki.dbpedia.org/Datasets/NLP, http://dbpedia.org/lexicalizations, https://github.com/dbpedia-spotlight/dbpedia-spotlight/wiki/Downloads
YAGO2008WSD,ELWWKPD, WN, GeonamesGENnamesmanyautoDenY10M entitiesrdfopen
Babelnet2012WSD, ELWWKPD, WN, MTGENseveral, names, translationsmanyautoDmultiY7,7MrdfCC-BY-SA
WordNet1985WSDWGENnamesmanymanualDenY
JRC-Names2011several, NERC, ELWNPGENnames, triggers, translationsP,OautoDmultiY1.7Mrdffree for researchhttps://ec.europa.eu/jrc/en/language-technologies/jrc-names
GeonamesIE,ELWN/AGEOnames, translationsLmixedDmultiY10MrdfCC-BY
Prolex2003NERCWGENnamesmanymixedDfrY96klmf
Prolex2003NERCWGENnamesmanymixedDplY39klmf
Prolex2003NERCWGENnamesmanymixedDenY19klmf
CESAR projectPolish Named Entity Gazetteer2012NERCWGENnames, triggersP,L,OmixedDplN135ktxt, LMFCC-BY-SAfree of chargehttp://metashare.elda.org/repository/browse/polish-named-entity-gazetteer/c404a89a6aff11e284b6000423bfd61c902eff198dae43649d4ce4a9b82ec092/
CESAR projectNamed entity lexical database2013speech synthesisSGENnamesP,LmanualDhuN131ktxt, CC-BYnon-freehttp://speechlab.tmit.bme.hu/CESAR/name_hu_description_en.pdf
CESAR projectBulgarian MWE dictionary2013NLP applicationWWKPD, dictionaries, electronic corporaGENnamesN/AmanualIbgNN/A, contains some NECC-BY-NC
CESAR projectPNET (Polish Named Entity Triggers)2012NERCWWKPD, PoliMorfGENtriggersmixedDplN28kAvailable - Restricted Usefree of chargehttp://www.nkjp.pl/settings/papers/sav-pisk-iis10.pdf, http://zil.ipipan.waw.pl/PNET
MINELex (Multilingual, Interoperable Named Entity Lexicon) 2011NERCWWKPDGENnamesautoDenN975k NEsdownloadablehttp://www.computing.dcu.ie/~atoral/resources.html
MINELex (Multilingual, Interoperable Named Entity Lexicon) 2011NERCWWKPDGENnamesautoDesN137kdownloadablehttp://www.computing.dcu.ie/~atoral/resources.html
MINELex (Multilingual, Interoperable Named Entity Lexicon) 2011NERCWWKPDGENnamesautoDitN125kdownloadablehttp://www.computing.dcu.ie/~atoral/resources.html
Arabic NET2013TRWWKPD, newsGENnames, translationsCONLLautoDarN60ktxtdownloadablehttp://www.qatar.cmu.edu/~behrang/NETLexicon/
Arabic NEs2011NERCWWKPDGENnames, ontologymanymanualDarN45kLMFdownloadablehttps://sourceforge.net/projects/arabicnes/, http://www.lrec-conf.org/proceedings/lrec2010/pdf/797_Paper.pdf
BioLexicon2009Text MiningWterminology, gene namesBIOnams sqlenNELRA-T0373, 152-047-849-795-0sqlAcademic - Non Commercial Use
GLiCom Spanish Wordform list – Regular word-formsseveral,NERCWgenericseveral, namesLDesN8knon freeELRA-L0095-01
Historical and NE lexica2011OCR, NERCWhistorical textsgenericnamesBASICmixedDenN241ksql,xmlnot decideddownloadable (members)http://www.digitisation.eu/tools-resources/language-resources/historical-and-named-entities-lexica-of-german/
Historical and NE lexica2011OCR, NERCWhistorical textsgenericnamesBASICmixedDduN475ksql,xmlnot decideddownloadable (members)http://www.digitisation.eu/tools-resources/language-resources/historical-and-named-entities-lexica-of-german/
Historical and NE lexica2011OCR, NERCWhistorical textsgenericnamesBASICmixedDdeN?sql,xmlnot decideddownloadable (members)http://www.digitisation.eu/tools-resources/language-resources/historical-and-named-entities-lexica-of-german/
Name-hu2013NERCW,SgenericnamesP,LmanualYhuN130knon freehttp://metashare.elda.org/repository/browse/named-entity-lexical-database/7dccf07681b611e2892a000c29bfc0d4df16d6f6efe84e43969eeff4c3d2f3d6/
Common Thesaurus Audiovisual Archives2010several,NERCBCAudioVisual arhcivesGENnamesP,L,Onot saiddu160kRDF
Multilingual lexicon of toponyms2012NERCWWKPDGENnamesPautoplN155kAcademic - Non Commercial Usehttp://metashare.elda.org/repository/browse/multilingual-lexicon-of-toponyms/ec2cbb6e6aff11e284b6000423bfd61cca437e34902c40e790b23005f5fc3c43/
Uniprot/Swissprotlong-standingNERCWscientific literatureBIOnamesmanualYenY542kRDF,XML,FASTA,GFFCC BY ND 3.0
Uniprot/TrEMBLElong-standingNERCWscientific literatureBIOnamesautoYenY54MRDF,XML,FASTA,GFFCC BY ND 3.0
UniParclong-standingELWBIOenYRDF,XML,FASTA,GFFCC BY ND 3.0
UniMESlong-standingNERCWBIOenYRDF,XML,FASTA,GFFCC BY ND 3.0
UMLS Metathesauruslong-standingNERC, ELWmany other terminologiesBIO, MEDICALnamesYmultiYRRF (Rich Release Format)https://www.nlm.nih.gov/pubs/factsheets/umls.html, https://www.nlm.nih.gov/research/umls/sourcereleasedocs/index.html#

Acronyms