“Opensource: Natural Language Process”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
Lr讨论 | 贡献
 
第3行: 第3行:
 
*Lingpipe: http://alias-i.com/lingpipe/index.html
 
*Lingpipe: http://alias-i.com/lingpipe/index.html
 
*fudannlp(复旦大学,中文NLP):https://code.google.com/p/fudannlp/
 
*fudannlp(复旦大学,中文NLP):https://code.google.com/p/fudannlp/
*Python NLTK  
+
*Python NLTK[http://nltk.org/]
 
*OpenNLP: http://opennlp.apache.org/
 
*OpenNLP: http://opennlp.apache.org/
*GATE
+
*GATE[http://gate.ac.uk/]
 
*BALIE, anguage identification, tokenization, sentence boundary detection, named-entity recognition.
 
*BALIE, anguage identification, tokenization, sentence boundary detection, named-entity recognition.
 
==Topic Modeling==
 
==Topic Modeling==
*D. Blei homepage, topic modeling
+
*D. Blei homepage, topic modeling[http://www.cs.princeton.edu/~blei/topicmodeling.html]
*Mallet
+
*Mallet[http://chentingpc.me/article/mallet.cs.umass.edu]
*Gensim,一个python写的topic modeling的开源项目:http://radimrehurek.com/gensim/
+
*Gensim[http://balie.sourceforge.net/],一个python写的topic modeling的开源项目:http://radimrehurek.com/gensim/
 
==Academic==
 
==Academic==
*Reference extraction: The cb2Bib is a free, open source, and multiplatform application for rapidly extracting unformatted, or unstandardized bibliographic references from email alerts, journal Web pages, and PDF files.
+
*Reference extraction: The cb2Bib[http://www.molspaces.com/d_cb2bib-overview.php] is a free, open source, and multiplatform application for rapidly extracting unformatted, or unstandardized bibliographic references from email alerts, journal Web pages, and PDF files.
*Crossref lab,crossref好像是搞学术文章索引的,核心点在于DOI? Anyway,它的lab页面收录了不少好的开源工具,比如可以做PDF文件的抽取等。
+
*Crossref lab[http://labs.crossref.org/index.html],crossref好像是搞学术文章索引的,核心点在于DOI? Anyway,它的lab页面收录了不少好的开源工具,比如可以做PDF文件的抽取[http://labs.crossref.org/styled-6/pdf_extract.html]等。
 
*Bible Passage Reference Parser: https://github.com/souliberty/Bible-Passage-Reference-Parser
 
*Bible Passage Reference Parser: https://github.com/souliberty/Bible-Passage-Reference-Parser
  
 
reference site:http://chentingpc.me/article/?id=430
 
reference site:http://chentingpc.me/article/?id=430

2014年10月9日 (四) 09:03的最后版本

Integrated System

Topic Modeling

Academic

  • Reference extraction: The cb2Bib[6] is a free, open source, and multiplatform application for rapidly extracting unformatted, or unstandardized bibliographic references from email alerts, journal Web pages, and PDF files.
  • Crossref lab[7],crossref好像是搞学术文章索引的,核心点在于DOI? Anyway,它的lab页面收录了不少好的开源工具,比如可以做PDF文件的抽取[8]等。
  • Bible Passage Reference Parser: https://github.com/souliberty/Bible-Passage-Reference-Parser

reference site:http://chentingpc.me/article/?id=430