Access the full text.
Sign up today, get DeepDyve free for 14 days.
J. Hornberger (2009)
Electronic Health Records: A Guide for Clinicians and AdministratorsJAMA, 301
Özlem Uzuner (2009)
Viewpoint Paper: Recognizing Obesity and Comorbidities in Sparse DataJournal of the American Medical Informatics Association : JAMIA, 16 4
Yoshinobu Kano, W. Baumgartner, L. McCrohon, S. Ananiadou, K. Cohen, L. Hunter, Junichi Tsujii (2009)
U-Compare: share and compare text mining tools with UIMABioinformatics, 25
(2008)
A comparison between CRFs and SVMs in Disorder Named Entity Recognition in Clinic Texts. Intelligent Data Analysis in Biomedicine and Pharmacology (IDAMAP)
K. Cohen, P. Ogren, L. Fox, L. Hunter (2005)
Empirical data on corpus design and usage in biomedical natural language processingAMIA ... Annual Symposium proceedings. AMIA Symposium
R. Rosenfeld (1996)
A maximum entropy approach to adaptive statistical language modellingComput. Speech Lang., 10
(2008)
Mayo clinic system for patient smoking status classification, 15
G. Hripcsak, G. Kuperman, C. Friedman (1998)
Extracting Findings from Narrative Reports: Software Transferability and Sources of Physician DisagreementMethods of Information in Medicine, 37
R. Mack, Sougata Mukherjea, A. Soffer, N. Uramoto, E. Brown, A. Coden, J. Cooper, Akihiro Inokuchi, Bhavani Iyer, Y. Mass, H. Matsuzawa, L. Subramaniam (2004)
Text analytics for life science using the Unstructured Information Management ArchitectureIBM Syst. J., 43
Yuan Luo, I. Kohane (2007)
JAMIA Focus on Medical Identification Identifying Patient Smoking Status from Medical Discharge Records
G. Savova, A. Coden, I. Sominsky, Rie Johnson, P. Ogren, P. Groen, C. Chute (2008)
Word sense disambiguation across two domains: Biomedical literature and clinical notesJournal of biomedical informatics, 41 6
G. Savova, Steven Bethard, IV WilliamF.Styler, James Martin, Martha Palmer, James Masanz, Wayne Ward (2009)
Towards Temporal Relation Discovery from the Clinical NarrativeAMIA ... Annual Symposium proceedings. AMIA Symposium, 2009
(2005)
Agreement, the F-Measure, and reliability in information retrieval, 12
S. Sohn, G. Savova (2009)
Mayo Clinic Smoking Status Classification System: Extensions and ImprovementsAMIA ... Annual Symposium proceedings. AMIA Symposium, 2009
R. Rosenfeld (2001)
A Maximum Entropy Approach to Adaptive Statistical Language Modeling
W. Trick, W. Chapman, Mary Wisniewski, Brian Peterson, S. Solomon, R. Weinstein (2003)
Electronic Interpretation of Chest Radiograph Reports to Detect Central Venous CathetersInfection Control & Hospital Epidemiology, 24
Beatrice Santorini (1990)
Part-of-speech tagging guidelines for the penn treebank project
S. Meystre, G. Savova, K. Kipper-Schuler, John Hurdle (2008)
Extracting Information from Textual Documents in the Electronic Health Record: A Review of Recent ResearchYearbook of Medical Informatics, 17
PennBioIE
Ontology development and information extraction (ODIE) toolset
(2009)
Electronic health records: a guide for clinicians and administrators. Book and media review
Kaihong Liu, W. Chapman, R. Hwa, R. Crowley (2007)
Methods Paper: Heuristic Sample Selection to Minimize Reference Standard Training Set for a Part-Of-Speech TaggerJ. Am. Medical Informatics Assoc., 14
Burr Settles (2005)
ABNER: an open source tool for automatically tagging genes, proteins and other entity names in textBioinformatics, 21 14
R. Crowley, Melissa Castine, K. Mitchell, G. Chavan, Tara McSherry, Michael Feldman (2010)
caTIES: a grid based system for coding and retrieval of surgical pathology reports and tissue specimens in support of translational researchJournal of the American Medical Informatics Association : JAMIA, 17 3
C. Friedman (1997)
Towards a comprehensive medical language processing system: methods and issuesProceedings : a conference of the American Medical Informatics Association. AMIA Fall Symposium
(1996)
A maximum entropy approach to natural language processing, 22
Rapidly deployable, highly scalable natural language processing using cloud computing and an open source NLP pipeline
K. Schuler, V. Kaggal, James Masanz, P. Ogren, G. Savova (2008)
System Evaluation on a Named Entity Corpus from Clinical Notes
M. Fiszman, W. Chapman, D. Aronsky, R. Evans, P. Haug (2000)
Research Paper: Automatic Detection of Acute Bacterial Pneumonia from Chest X-ray ReportsJournal of the American Medical Informatics Association : JAMIA, 7 6
R. Engelbrecht (2005)
Connecting medical informatics and bio-informatics : proceedings of MIE2005 : the XIXth International Congress of the European Federation for Medical Informatics
Unified Medical Language System (UMLS). http://www.nlm.nih.gov/research/umls/. 18. UIMA MetaMap wrapper. http://sourceforge.net/projects/metamap-uima/. 19. National Center for Text Mining (NaCTeM)
Q. Zeng-Treitler, Sergey Goryachev, S. Weiss, M. Sordo, S. Murphy, R. Lazarus (2006)
Extracting principal diagnosis, co-morbidity and smoking status for asthma research: evaluation of a natural language processing systemBMC Medical Informatics and Decision Making, 6
A. Aronson, O. Bodenreider, Florence Chang, S. Humphrey, James Mork, S. Nelson, T. Rindflesch, W. Wilbur (2000)
The NLM Indexing InitiativeProceedings. AMIA Symposium
Frederick Aldama, Christopher González (1931)
LanguageNeuroImage, 19
G. Hripcsak, A. Rothschild (2005)
Technical Brief: Agreement, the F-Measure, and Reliability in Information RetrievalJournal of the American Medical Informatics Association : JAMIA, 12 3
W. Chapman, Will Bridewell, P. Hanbury, G. Cooper, B. Buchanan (2001)
A Simple Algorithm for Identifying Negated Findings and Diseases in Discharge SummariesJournal of biomedical informatics, 34 5
S. Nightingale (1999)
Electronic Orange BookJAMA, 281
(2009)
Automatically extracting cancer disease characteristics from pathology reports into a cancer disease knowledge model, 42
JULIE Lab
(2009)
Recognizing obesity and comorbidities in sparse data, 16
(2005)
Evaluation of medical problem extraction from electronic clinical documents using MetaMap transfers (MMTx), in Connecting Medical Informatics and Bio-Informatics
Fei Sha, Fernando Pereira (2003)
Shallow Parsing with Conditional Random Fields
Catalog/CatalogEntry.jsp? catalogId¼LDC2008T20. 36. Clinical Document Architecture (CDA)
(2007)
Heuristic sample selection to minimize reference standard training set for a part-of-speech tagger, 14
K. Cohen, L. Fox, P. Ogren, L. Hunter (2005)
Corpus Design for Biomedical Natural Language Processing
Özlem Uzuner, Henry Ware, C. Mullett, V. Jagannathan, S. Meystre, N. Grabar, Thierry Hamon, T. Dart (2008)
The Second i2b2 Workshop on Challenges in Natural Language Processing for Clinical Data
R. Califf, L. Muhlbaier (2003)
Health Insurance Portability and Accountability Act (HIPAA): must there be a trade-off between privacy and quality of health care, or can we advance both?Circulation, 108 8
Mitchell Marcus, Beatrice Santorini, Mary Marcinkiewicz (1993)
Building a Large Annotated Corpus of English: The Penn TreebankComput. Linguistics, 19
LVG user guide
A. McCallum, Wei Li (2003)
Early results for Named Entity Recognition with Conditional Random Fields, Feature Induction and Web-Enhanced Lexicons
J. Lafferty, A. McCallum, Fernando Pereira (2001)
Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data
NCI Enterprise Vocabulary System (EVS)
M. Schuemie, J. Kors, B. Mons (2005)
Word Sense Disambiguation in the Biomedical Domain: An OverviewJournal of computational biology : a journal of computational molecular cell biology, 12 5
D. Ferrucci, Adam Lally (2004)
UIMA: an architectural approach to unstructured information processing in the corporate research environmentNatural Language Engineering, 10
P. Kantor (2001)
Foundations of Statistical Natural Language ProcessingInformation Retrieval, 4
C. Friedman (2000)
A broad-coverage natural language processing systemProceedings. AMIA Symposium
(2008)
Identifying patient smoking status from medical discharge records, 15
Lee Christensen, P. Haug, M. Fiszman (2002)
MPLUS: a probabilistic medical language understanding system
T. Benson (2010)
Clinical Document Architecture
E. Buyko, J. Wermter, M. Poprat, U. Hahn (2006)
Automatically Adapting an NLP Core Engine to the Biology Domain
(2009)
Mayo clinic smoking status classification system
N. Enzer, C. Keith (1990)
BOOK AND MEDIA REVIEWJournal of the American Academy of Child and Adolescent Psychiatry, 29
O. Bodenreider, A. McCray
Exploring Semantic Groups through Visual Approaches
A. Coden, Serguei Pakhomov, R. Ando, Patrick Duffy, C. Chute (2005)
Domain-specific language models and lexicons for taggingJournal of biomedical informatics, 38 6
Massimo Poesio, R. Vieira (1997)
A Corpus-based Investigation of Definite Description UseArXiv, cmp-lg/9710007
A. Ratnaparkhi, Mitchell Marcus (1998)
Maximum entropy models for natural language ambiguity resolution
Corinna Cortes, V. Vapnik (1995)
Support-Vector NetworksMachine Learning, 20
Ann Bies, Mark Ferguson, Karen Katz, R. MacIntyre (1995)
Bracketing Guidelines For Treebank II Style Penn Treebank Project
(2009)
Annotation schema for anaphoric relations in the clinical domain
G. Savova, P. Ogren, Patrick Duffy, J. Buntrock, C. Chute (2008)
Technical Brief: Mayo Clinic NLP System for Patient Smoking Status IdentificationJournal of the American Medical Informatics Association : JAMIA, 15 1
(2000)
Automatic detection of acute bacterial pneumonia from chest X-ray reports
P. Ogren, G. Savova, C. Chute (2008)
Constructing Evaluation Corpora for Automated Clinical Named Entity Recognition
M. Fiszman, P. Haug, P. Frederick (1998)
Automatic extraction of PIOPED interpretations from ventilation/perfusion lung scan reportsProceedings. AMIA Symposium
G. Savova, Cheryl Clark, Jiaping Zheng, K. Cohen, Ben Wellner, David Harris, Marcia Lazo, J. Aberdeen, Qian Hu, C. Chute, L. Hirschman (2008)
The Mayo/MITRE System for Discovery of Obesity and Its Comorbidities
eMERGE
Lionel Cheng, Jiaping Zheng, G. Savova, B. Erickson (2009)
Discerning Tumor Status from Unstructured MRI Reports—Completeness of Information in Existing Reports and Utility of Automated Natural Language ProcessingJournal of Digital Imaging: the official journal of the Society for Computer Applications in Radiology, 23
P. Haug, S. Koehler, L. Lau, Ping Wang, Roberto Rocha, S. Huff (1995)
Experience with a mixed semantic/syntactic parser.Proceedings. Symposium on Computer Applications in Medical Care
A. Coden, G. Savova, I. Sominsky, M. Tanenblatt, James Masanz, K. Schuler, J. Cooper, Wei Guan, P. Groen (2009)
Automatically extracting cancer disease characteristics from pathology reports into a Disease Knowledge Representation ModelJournal of biomedical informatics, 42 5
Health Information Text Extraction (HITEx)
Jacob Cohen (1960)
A Coefficient of Agreement for Nominal ScalesEducational and Psychological Measurement, 20
S. Meystre, P. Haug (2005)
Evaluation of Medical Problem Extraction from Electronic Clinical Documents Using MetaMap Transfer (MMTx)Studies in health technology and informatics, 116
We aim to build and evaluate an open-source natural language processing system for information extraction from electronic medical record clinical free-text. We describe and evaluate our system, the clinical Text Analysis and Knowledge Extraction System (cTAKES), released open-source at http://www.ohnlp.org. The cTAKES builds on existing open-source technologies—the Unstructured Information Management Architecture framework and OpenNLP natural language processing toolkit. Its components, specifically trained for the clinical domain, create rich linguistic and semantic annotations. Performance of individual components: sentence boundary detector accuracy=0.949; tokenizer accuracy=0.949; part-of-speech tagger accuracy=0.936; shallow parser F-score=0.924; named entity recognizer and system-level evaluation F-score=0.715 for exact and 0.824 for overlapping spans, and accuracy for concept mapping, negation, and status attributes for exact and overlapping spans of 0.957, 0.943, 0.859, and 0.580, 0.939, and 0.839, respectively. Overall performance is discussed against five applications. The cTAKES annotations are the foundation for methods and modules for higher-level semantic processing of clinical free-text.
Journal of the American Medical Informatics Association – Oxford University Press
Published: Sep 1, 2010
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.