“QA test”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
Lr讨论 | 贡献
Tool
 
(相同用户的68个中间修订版本未显示)
第1行: 第1行:
==140901==
+
==Laboratory==
===TREC TEST===
+
* [[Opensource: Natural Language Process]]
* Data set: http://cogcomp.cs.illinois.edu/Data/QA/QC/
+
* [[open system]]
* Method: vsm-tfidf/No classifier
+
* classes:9-bigclasses,,48-smallclasses
+
* Result:
+
  
{| border="2px"
+
==Tool==
|+ classification result
+
* SEMPRE (QA toolkit) [http://www-nlp.stanford.edu/software/sempre/]
|-
+
* Z-MERT[http://www.cs.jhu.edu/~ozaidan/zmert/]
! Training Set !! 1000 !! 2000 !! 3000 !! 4000 !! 5500
+
* templatemaker[https://github.com/paulsmith/templatemaker]
|-
+
:*可以从一堆输入样板句子中提取他们的不变部分,然后利用模板可以做匹配检查,成分提取等。对清理Web数据,简单的模式学习超级有用
! bigclass
+
* SPMF: A Java Open-Source Pattern Mining Library
| 0.678 || 0.718 || 0.708 || 0.708 || 0.73
+
:* SPMF is a cross-platform library implemented in Java, specialized for discovering patterns in transaction and sequence databases such as frequent itemsets, association rules and sequential patterns.clustering.
|-
+
! smallclass
+
| 0.58 || 0.606 || 0.606 || 0.616 || 0.628
+
|-
+
|}
+
  
==NanShanData==
+
==Paper==
===Data Set===
+
*[[2014-10-08:qa]]
* big class:教育,社保,就业,医疗,住房,婚育收养,证件办理,资质认定,企业开办,经营纳税,公用事业
+
*[[2014-08-22-qalr]]
* small class:
+
* search in ML
:* 教育:学期教育,小学教育,初中教育,高中教育,职业教育,继续教育,特殊教育,教育救助
+
:* ML for Search and Ads(刘铁岩) NLPCC 2014[http://cslt.riit.tsinghua.edu.cn/mediawiki/index.php/%E6%96%87%E4%BB%B6:L06-ML_for_Search_and_Ads_-_ADL52.pdf]
 +
:* emantic Matching in Search_ADL [http://cslt.riit.tsinghua.edu.cn/mediawiki/index.php/%E6%96%87%E4%BB%B6:L04-Semantic_Matching_in_Search_ADL_Jun_XU_final.pdf]
 +
* 知识图谱
 +
:* Constructing and Mining Web-scale Knowledge Graphs(KDD 2014)[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/4/4c/Kdd2014_gabrilovich_bordes_knowledge_graphs.pdf]
 +
:* 垂直知识图谱工具与应用[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/6/6f/%E5%9E%82%E7%9B%B4%E7%9F%A5%E8%AF%86%E5%9B%BE%E8%B0%B1%E5%B7%A5%E5%85%B7%E4%B8%8E%E5%BA%94%E7%94%A810%E6%9C%8816%E6%97%A5.pdf]
 +
:* 知识图谱:大数据语义链接的基石[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/c/c6/%E7%9F%A5%E8%AF%86%E5%9B%BE%E8%B0%B1%EF%BC%9A%E5%A4%A7%E6%95%B0%E6%8D%AE%E8%AF%AD%E4%B9%89%E9%93%BE%E6%8E%A5%E7%9A%84%E5%9F%BA%E7%9F%B3-%E6%9D%8E%E6%B6%93%E5%AD%90_%281%29.pdf]
 +
:* Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph[]
  
 +
==huilan==
 +
*[[huilian-work]]
 +
*[[qa-Algorithm]]
 +
*[[others]]
  
===140905===
+
==TEST==
 
+
*[[TREC TEST]]
*Data Set
+
*[[NanShanData]]
:*
+

2014年12月29日 (一) 04:02的最后版本

Laboratory

Tool

  • SEMPRE (QA toolkit) [1]
  • Z-MERT[2]
  • templatemaker[3]
  • 可以从一堆输入样板句子中提取他们的不变部分,然后利用模板可以做匹配检查,成分提取等。对清理Web数据,简单的模式学习超级有用
  • SPMF: A Java Open-Source Pattern Mining Library
  • SPMF is a cross-platform library implemented in Java, specialized for discovering patterns in transaction and sequence databases such as frequent itemsets, association rules and sequential patterns.clustering.

Paper

  • ML for Search and Ads(刘铁岩) NLPCC 2014[4]
  • emantic Matching in Search_ADL [5]
  • 知识图谱
  • Constructing and Mining Web-scale Knowledge Graphs(KDD 2014)[6]
  • 垂直知识图谱工具与应用[7]
  • 知识图谱:大数据语义链接的基石[8]
  • Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph[]

huilan

TEST