Licensing FAQ

What does a monolingual dataset from Oxford Dictionaries offer?

A typical monolingual dataset contains between 30,000 and 150,000 defined terms of one language. Most of these datasets also include information such as part-of-speech details, phonetics,examples, idioms, and derivatives. This may vary depending on the source dictionary of the dataset. 

 

What does a bilingual dataset from Oxford Dictionaries offer?

A typical bilingual dataset contains between 30,000 and 150,000 terms translated from one language into another. Most of our bilingual datasets are two-way, meaning translations in both directions,for example from English to Korean as well as from Korean to English. Bilingual dictionaries are available in several different language combinations. 

 

What does a synonym dataset from Oxford Dictionaries offer?

Our synonym datasets group words and phrases that are similar in meaning to the headword. Synonym datasets usually offer between 10,000 and 80,000 synonyms and, unlike monolingual datasets, generally do not provide definitions of the headword. Some synonym datasets provide usage examples and notes on nuances of meaning between synonyms and when to use one word in preference to another.. We offer both stand-alone synonym datasets and synonyms linked to monolingual or bilingual content.


What does a wordlist from Oxford Dictionaries offer?

Wordlists can take different shapes depending on your needs. Some clients request wordlists drawn from web-crawled corpora that contain billions of words of current language.These corpora are analysed to include frequency and part-of-speech information by Oxford’s in-house language experts, who use language engineering techniques to ensure the high quality of the resulting wordlist. Such wordlists include inflected forms, proper nouns, trademarks, acronyms and abbreviations, slang, and vulgar language. Other clients ask that we supply wordlists as datasets of linked inflections along with our monolingual and bilingual datasets, while others need customized wordlists with the removal of certain words (such as vulgarities).Our technologists can engineer Oxford Dictionaries content to create a variety of general and domain-specific wordlists.

How is Oxford Dictionaries content used?

Widely!Clients approach us for language content for their hardware and software, and also use our data for research and development purposes.

OxfordDictionaries content can be found in:

•e-readers

•assistive technology products

•language-learning software

•predictive text applications

•word games

•back-end purposes such as machine learning, search, and text analytics

How is Oxford Dictionaries data delivered?

Our lexical content combines OUP-curated content with content sourced from respected publishers and content owners worldwide and validated by our renowned dictionary team. To streamline your engineering process, all our datasets can be can be delivered via the Oxford Dictionaries API or as a complete dataset in XML implementing our standard data structure. Other data formats may be available on request.

Do you have an API?

Developers who want to interact instantly with our dictionary data via an API can visit our dedicated API self-service portal . Our language data includes lists of words, definitions, and translations into a wide range of languages.

powered by oxford
close
close