Useful Artificial Intelligence (AI) applications rely on high-quality data: “If your data is bad, your machine learning tools are useless”.

(Thomas C. Redman) Ph.D., the Data Doc and President of Navesink Consulting Group

Datasets for training AI applications in precision medicine

More than 10 years of experience

We create corpora to develop machine learning algorithms

3 Easy Steps to Get Started

  • We can support the development of your AI application
  • We base on expert-guided semi-automatic processes
  • Manual curation of corpora and datasets

Let's work together!

Selected Publications

Our foundation comes from the first step of innovation, research.

J. Pérez-Granado, J. Piñero, and L. I. Furlong. ResMarkerDB: A database of biomarkers of response to antibody therapy in breast and colorectal cancer. Database, 2019(1), 2019. doi:10.1093/database/baz060.
A. Gutiérrez-Sacristán, À. Bravo, M. Portero-Tresserra, O. Valverde, A. Armario, M. C. Blanco-Gandía, A. Farré, L. Fernández-Ibarrondo, F. Fonseca, J. Giraldo, Others, A. Gutié Rrez-Sacrist An, A. Bravo, M. Portero-Tresserra, O. Valverde, A. Armario, M. C. Blanco- Gand Ia, A. Farré, L. Fern Andez-Ibarrondo, F. Fonseca, J. Us Giraldo, A. Leis, A. Mané, M. A. Mayer, S. Montagud-Romero, R. Nadal, J. Ortiz, F. J. Pavon, E. Jes Us Perez, M. Rodr Iguez-Arias, A. Serrano, M. Torrens, V. Warnault, F. Sanz, and L. I. Furlong. Text mining and expert curation to develop a database on psychiatric diseases and their genes. Database, 2017:43, 2017. doi:10.1093/database/bax043.
À. Bravo, J. Piñero, N. Queralt-Rosinach, M. Rautschka, and L. I. Furlong. Extraction of relations between genes and diseases from text and large-scale data analysis: implications for translational research. BMC bioinformatics, 16(1):55, 2015. doi:10.1186/s12859-015- 0472-9

E. M. Van Mulligen, A. Fourrier-Reglat, D. Gurwitz, M. Molokhia, A. Nieto, G. Tri- firo, J. A. Kors, and L. I. Furlong. The EU-ADR corpus: annotated drugs, diseases, targets, and their relationships. Journal of biomedical informatics, 45(5):879–884, 2012. doi:10.1016/j.jbi.2012.04.004.

D. Rebholz-Schuhmann, A. J. Yepes, C. Li, S. Kafkas, I. Lewin, N. Kang, P. Corbett, D. Milward, E. Buyko, E. Beisswanger, and Others. Assessment of NER solutions against the first and second CALBC Silver Standard Corpus. Journal of biomedical semantics, 2 (5):S11, 2011. doi:10.1186/2041-1480-2-S5-S11.