textrank: Summarize Text by Ranking Sentences and Finding Keywords

The 'textrank' algorithm is an extension of the 'Pagerank' algorithm for text. The algorithm allows to summarize text by calculating how sentences are related to one another. This is done by looking at overlapping terminology used in sentences in order to set up links between sentences. The resulting sentence network is next plugged into the 'Pagerank' algorithm which identifies the most important sentences in your text and ranks them. In a similar way 'textrank' can also be used to extract keywords. A word network is constructed by looking if words are following one another. On top of that network the 'Pagerank' algorithm is applied to extract relevant words after which relevant words which are following one another are combined to get keywords. More information can be found in the paper from Mihalcea, Rada & Tarau, Paul (2004) <https://www.aclweb.org/anthology/W04-3252/>.

Version: 0.3.1
Imports: utils, data.table (≥ 1.9.6), igraph, digest
Suggests: textreuse, knitr, rmarkdown, udpipe (≥ 0.2)
Published: 2020-10-12
Author: Jan Wijffels [aut, cre, cph], BNOSAC [cph]
Maintainer: Jan Wijffels <jwijffels at bnosac.be>
License: MPL-2.0
URL: https://github.com/bnosac/textrank
NeedsCompilation: no
Materials: README NEWS
In views: NaturalLanguageProcessing
CRAN checks: textrank results

Documentation:

Reference manual: textrank.pdf
Vignettes: Textrank for summarizing text

Downloads:

Package source: textrank_0.3.1.tar.gz
Windows binaries: r-devel: textrank_0.3.1.zip, r-release: textrank_0.3.1.zip, r-oldrel: textrank_0.3.1.zip
macOS binaries: r-release (arm64): textrank_0.3.1.tgz, r-oldrel (arm64): textrank_0.3.1.tgz, r-release (x86_64): textrank_0.3.1.tgz
Old sources: textrank archive

Reverse dependencies:

Reverse imports: finnsurveytext

Linking:

Please use the canonical form https://CRAN.R-project.org/package=textrank to link to this page.