Consulting for the development of linguistic resources
Methods and materials for structuring linguistic knowledge
- construction of generic or specialized textual corpora,
- annotation of corpora for training and validation of NLP systems,
- production of training corpora for statistical classification and extraction systems,
- development of multilingual lexica for cross-language applications,
- creation of semantic-conceptual networks in monolingual and multilingual varieties,
- development of grammars for syntactic analysis, disambiguation or chunking...
These are just a few examples of the activities for which our international multilingual team offers its experience and expertise. Our linguistic resources are always designed and developed to be easily integrable into our clients' applications.
Our competence enables us to fulfil specific needs, covering the whole linguistic spectrum: phonetics and prosody, morphology, syntax and semantics. We are currently working on the following languages: Italian, English in its different variants, French, Spanish, Catalan, Portuguese, German, Dutch, Swedish, Norwegian, Finnish, Danish, Polish, Russian, Belarusian, Estonian, Latvian, Lithuanian, Ukrainian, Greek, Turkish, Arabic, Hebrew, Armenian, Albanian, Croatian, Serbian, Czech, Slovak, Slovenian, Romanian, Bulgarian, Hungarian, Chinese, Japanese.