1) ATF×PDF
ATF×PDF
1.
This paper presents a keyword extraction method by first calculating word weight with ATF×PDF(Average Term Frequency *Proportional Document Frequency) and then determining the keywords by ajoint weighconsidering the semantic si milarity between words.
该文提出一种多文档关键词抽取方法,该方法提出ATF×PDF(Average Term Frequency×ProportionalDocument Frequency)来计算词语权重,并根据候选关键词之间的语义相似度,采用联合权重方法重新计算候选关键词的权重来抽取关键词。
2) ATF~*PDF
ATF~*PDF
1.
This paper presented a keyword extraction in multi-document based on TextRank method,calculated words weight with ATF~*PDF for selecting candidates,and constructed TextRank model rely on semantic relation between candidates,iterated the graph-based ranking algorithm until convergence,at last generated a list of keyword and extract- ed keyword.
本文提出一种基于TextRaak的多文档关键词抽取方法,该方法利用ATF~*PDF方法计算文档集中的词语权重,抽取权重较大的实词为候选关键词,并根据候选关键诃之问的语义相似关系建立TextRank模型,递归计算至收敛,最后生成关键词序列并抽取关键词。
3) ATF*PDF
ATF*PDF
1.
The method of TF*IDF can only be used for calculating words weight of single document,the paper proposes a method that is named as ATF*PDF (Average Term Frequency*Proportional Document Frequency) for calculating words weight of multi-document First calculate word s weight and select candidates,and then combine with semantic similarity between words so as to extract keywords.
考虑到TF*IDF方法仅适于计算词语在单个文档中的权重,本文提出一种计算词语权重的方法ATF*PDF(Average Term Frequency*Proportional Document Frequency),此方法能计算词语在多文档中的权重。