R/Collect.thread.reddit.R
, R/wrappers.R
Collect.thread.reddit.Rd
Collects comments made by users on one or more specified subreddit conversation threads and structures
the data into a dataframe with the class names "datasource"
and "reddit"
.
# S3 method for thread.reddit
Collect(
credential,
endpoint,
threadUrls,
sort = NA,
waitTime = c(6, 8),
ua = getOption("HTTPUserAgent"),
writeToFile = FALSE,
verbose = FALSE,
...
)
collect_reddit_threads(
threadUrls,
sort = "best",
waitTime = c(6, 8),
ua = vsml_ua(),
writeToFile = FALSE,
verbose = FALSE,
...
)
A credential
object generated from Authenticate
with class name "reddit"
.
API endpoint.
Character vector. Reddit thread urls to collect data from.
Character vector. Reddit comment sort order. Options are "best"
, "top"
, "new"
,
"controversial"
, "old"
, and "qa"
. Default is NA
.
Numeric vector. Time range in seconds to select random wait from in-between url collection requests.
Minimum is 3 seconds. Default is c(6, 8)
for a wait time chosen from between 6 and 8 seconds.
Character string. Override User-Agent string to use in Reddit thread requests. Default is
option("HTTPUserAgent")
value as set by vosonSML.
Logical. Write collected data to file. Default is FALSE
.
Logical. Output additional information about the data collection. Default is TRUE
.
Additional parameters passed to function. Not used in this method.
A tibble
object with class names "datasource"
and "reddit"
.
The reddit web endpoint used for collection has maximum limit of 500 comments per thread url.