We consider community-run evaluation challenges as the best way to provide an independent and unbiased evaluation of text mining tools. We participated in the BioCreative challenges (since 2006), the BioNLP shared task (2009) and CALBC (2010). In BioCreative II (2006) we obtained very good results in the tasks of detecting protein interactions, and the best results in detecting experimental methods . In BioCreative II.5 (2009) we obtained the best results in the PPI task (extraction of protein-protein interactions) . In BioCreative III (2010) we were among the three best ranked teams in the PPI-ACT task (binary decision whether a paper contains a curatable protein-protein interaction) and PPI-IMT task (detection of experimental methods). In the latter task, we were the only group which managed to return results with full recall (while maintaining near-best AUC score). Our ODIN tool received favourable comments from curators in the IAT (interactive) task. In the "triage task" of BioCreative 2012 we obtained once again the best overall results.
Our selected publications provide a good overview of the techniques used in the OntoGene system. We are based at the Institute of Computational Linguistics (Department of Computer Science) of the University of Zurich.
- We are ready to start our new NIH-funded project with the RegulonDB group!!!
- A project proposal submitted to the National Institute of Health (US) in collaboration with the Center for Genomic Sciences, UNAM, Mexico has been selected for funding.
- Our role will be delivering text mining services and assisted curation tools.
- Our proposal for a novel BioCreative task (Extraction of causal network information in Biological Expression Language (BEL)), in collaboration with PMI R&D and Fraunhofer SCAI, has been approved. Stay tuned for more information about this exciting new text mining challenge!
- We are moving into PubMed-scale processing. We recently parsed the entire PubMed in search of protein-protein interactions.
- Check our limited preview of our PubMed-wide PPI search application!
- A paper describing the OntoGene web services has been published in BMC Bioinformatics.
- OntoGene Web Services for Biomedical Text Mining. Fabio
Rinaldi, Simon Clematide, Hernani Marques, Tilia Ellendorff, Raul
Rodriguez-Esteban, Martin Romacker. BMC Bioinformatics, 2014,
- 2014 Roche, RegulonDB, BioC, SMBM 2014
- OntoGene enters a new strategic partnership with the Data Science group at Hoffmann-La Roche.
- Dr. Fabio Rinaldi gives a set of invited talks at:
- Feb 11, Center for Genomic Sciences, Universidad Nacional Autónoma de México (UNAM), Campus Morelos, Cuernavaca, Mexico.
- Feb 14, Information Sciences Institute, Marina del Rey, CA, USA.
- Feb 18, Instituto Politécnico Nacional (IPN), Ciudad de Mexico, Distrito Federal, Mexiko.
- Dr. Fabio Rinaldi invited to co-chair SMBM 2014, October 6-7, University of Aveiro, Portugal.
- Please check the proceedings.
- Dr. Fabio Rinaldi gives a highlight presentation at ECCB 2014.
- Collaboration with the RegulonDB group, UNAM, Mexico:
- presentation about assisted curation with ODIN at BioCuration 2014
- joint paper published in DATABASE: The Journal of Biological Databases and Curation
- Joint project proposal accepted for funding by the National Institute of Health (NIH), US
- Collaboration with the Wilbur group at the National Center for Biotechnology Information / National Library of Medicine (NLM/NCBI) on BioC.
- A joint paper on BioC implementations has been published.
- BioC is a new standard for text representation in biomedical text mining
- 2013 BioCreative 2013, LBM 2013
- OntoGene's participation in BioCreative 2013 was a resounding success. We were involved in 3 tasks (in task 5 with two slightly different versions of ODIN), obtaining top ranked results in the CTD tasks and very good comments on the BioC and IAT tasks (which did not have a quantitative evaluation). We had a total of 4 papers, 3 presentations and one demo (we are probably the most active participant in BioCreative)! More information and papers here. As part of our 2013 BioCreative participation we provide:
- PyBioC: a native python implementation of the BioC core
- ODIN-CTD: a version of ODIN (our curation interface) customized for CTD
- Ontogene text mining web services via RESTful interfaces: http://kitt.cl.uzh.ch/kitt/ontogene/webservices/
- Dr. Fabio Rinaldi invited to co-chair LBM 2013, December 12-13, University of Tokyo, Japan.
- 2012 BioCreative 2012, SMBM 2012
- We organized the 5th International Symposium on Semantic Mining in Biomedicine (SMBM).
- We obtained the best overall results in the 'Triage' task of BioCreative 2012 (in particular due to very accurate entity recognition), as described in this paper
- 2011 Text Mining for Pharmacogenomics
- Our collaboration with the PharmGKB group at Stanford University on assisted curation resulted in the following journal publication: doi:10.1093/database/bas021.
- The text mining technologies used for our pharmacogenomics experiments are described in this journal publication: doi:10.1016/j.jbi.2012.04.014.
- 2010 CALBC, BioCreative 2010
- In the BioCreative III shared task we were one of only two groups which participated in all of the tasks, obtaining satisfactory results in all of them. We were involved as co-authors in four journal papers in the special issue of BMC Bioinformatics on BioCreative III.
- In the CALBC competition (2009/2010), which targets large-scale entity detection across the biomedical literature, our system achieved best results in the 'species' and 'diseases' categories.
- 2009 BioCreative 2009
- Our participation in the BioCreative II.5 competitive evaluation (2009) of biomedical text mining systems resulted in the best run for the detection of protein-protein interactions (according to the 'raw' AUC iP/R metric). Our system was overall considered as one of the best three.
- From Aug 2010 to Jul 2014 we were funded by the Swiss National Science Foundation (SNF) through the SASEBio project (Semi-Automated Semantic Enrichment of the Biomedical Literature). Additional funding provided by Novartis and Roche.
- Additionally in 2012-2013 a post-doc in our group (Gintarė Grigonytė) was funded by a Sciex fellowship (see BioTermEvo project).
- In 2008/2009 we were funded by the SNF project Detection of Biological Interactions from Biomedical Literature (grant 100014-118396/1).
OntoGene is a non-commercial research project. We have nothing to do with a commercial product of the same name. We are also not related to the so-called "Ontogene network".