bdpar: Big Data Preprocessing Architecture

Provide a tool to easily build customized data flows to pre-process large volumes of information from different sources. To this end, 'bdpar' allows to (i) easily use and create new functionalities and (ii) develop new data source extractors according to the user needs. Additionally, the package provides by default a predefined data flow to extract and pre-process the most relevant information (tokens, dates, ... ) from some textual sources (SMS, Email, YouTube comments).

Version: 3.1.0
Depends: R (≥ 3.5.0)
Imports: digest, parallel, R6, rlist, tools, utils
Suggests: cld2, knitr, rex, rjson, rmarkdown, stringi, stringr, testthat (≥ 2.3.1), tuber
Published: 2023-12-12
Author: Miguel Ferreiro-Díaz [aut, cre], David Ruano-Ordás [aut, ctr], Tomás R. Cotos-Yañez [aut, ctr], José Ramón Méndez Reboredo [aut, ctr], University of Vigo [cph]
Maintainer: Miguel Ferreiro-Díaz <miguel.ferreiro.diaz at gmail.com>
BugReports: https://github.com/miferreiro/bdpar/issues
License: GPL-3
URL: https://github.com/miferreiro/bdpar
NeedsCompilation: no
SystemRequirements: Python (>= 2.7 or >= 3.6)
Materials: NEWS
CRAN checks: bdpar results

Documentation:

Reference manual: bdpar.pdf
Vignettes: A Brief Introduction to bdpar
Basic example using bdpar package
Image processing example using bdpar package

Downloads:

Package source: bdpar_3.1.0.tar.gz
Windows binaries: r-devel: bdpar_3.1.0.zip, r-release: bdpar_3.1.0.zip, r-oldrel: bdpar_3.1.0.zip
macOS binaries: r-release (arm64): bdpar_3.1.0.tgz, r-oldrel (arm64): bdpar_3.1.0.tgz, r-release (x86_64): bdpar_3.1.0.tgz
Old sources: bdpar archive

Linking:

Please use the canonical form https://CRAN.R-project.org/package=bdpar to link to this page.