Most human diseases are influenced by our genes. Identifying the genes that cause diseases is key to pinpoint novel strategies for prevention and treatment.

A text mining engine using deep learning approaches, tailored to the peculiarities of the biomedical literature

Our expertise is accredited in our more than 10 years of experience in developing DISGENET,

a resource widely used in the biomedical community with over 50,000 users per year, and more than 2,500 bibliographic citations.

In DISGENET plus we have incorporated:

  • Deep learning models to detect mentions of diseases and genes
  • New module to manage acronyms & abbreviations
  • Deep learning models to detect and characterize negations
  • New module to detect mention of animal models

Increased performance score (F) to 92%

DISGENET plus engine improves the detection and normalization of mentions of genes and diseases by relying on deep learning models complemented by an extensive set of custom heuristics. New text analysis modules have been integrated into DISGENET plus so as to consistently detect and characterize distinctive linguistic traits of biomedical texts like the use of acronyms, abbreviations, and negations.

