Package 'thomashardyr'

Title: An R Package for Thomas Hardy's Novels
Description: This package provides access to the full texts of 6 of Thomas Hardy's novels.
Authors: Matthew Henderson [aut, cre]
Maintainer: Matthew Henderson <[email protected]>
License: MIT + file LICENSE
Version: 0.2.1
Built: 2024-10-26 04:10:22 UTC
Source: https://github.com/MHenderson/thomashardyr

Help Index


The text of Thomas Hardy's novel "Under the Greenwood Tree"

Description

A dataset containing the text of Thomas Hardy's novel "Under the Greenwood Tree", published in 1872. The UTF-8 plain text was sourced from Project Gutenberg and is divided into elements of up to about 70 characters each. (Some elements are blank.)

Usage

greenwood

Format

A character vector with 7003 elements

Source

http://www.gutenberg.org/ebooks/2662


Tidy data frame of 7 of Thomas Hardy's novels

Description

Returns a tidy data frame of 7 of Thomas Hardy's completed, published novels with two columns: text, which contains the text of the novels divided into elements of up to about 70 characters each, and book, which contains the titles of the novels as a factor in order of publication.

Usage

hardy_books()

Value

A data frame with two columns: text and book

Examples

library(dplyr)

hardy_books() %>% group_by(book) %>%
     summarise(total_lines = n())

The text of Thomas Hardy's novel "Jude the Obscure"

Description

A dataset containing the text of Thomas Hardy's novel "Jude the Obscure", published in 1895. The UTF-8 plain text was sourced from Project Gutenberg and is divided into elements of up to about 70 characters each. (Some elements are blank.)

Usage

jude

Format

A character vector with 15848 elements

Source

http://www.gutenberg.org/ebooks/153


The text of Thomas Hardy's novel "Far from the Madding Crowd"

Description

A dataset containing the text of Thomas Hardy's 1874 novel "Far from the Madding Crowd". The UTF-8 plain text was sourced from Project Gutenberg and is divided into elements of up to about 70 characters each. (Some elements are blank.)

Usage

maddingcrowd

Format

A character vector with 16293 elements

Source

http://www.gutenberg.org/ebooks/27


The text of Thomas Hardy's novel "The Mayor of Casterbridge"

Description

A dataset containing the text of Thomas Hardy's 1886 novel "The Mayor of Casterbridge". The UTF-8 plain text was sourced from Project Gutenberg and is divided into elements of up to about 70 characters each. (Some elements are blank.)

Usage

mayor

Format

A character vector with 12464 elements

Source

http://www.gutenberg.org/ebooks/143


The text of Thomas Hardy's novel "The Return of the Native"

Description

A dataset containing the text of Thomas Hardy's 1878 novel "The Return of the Native". The UTF-8 plain text was sourced from Project Gutenberg and is divided into elements of up to about 70 characters each. (Some elements are blank.)

Usage

native

Format

A character vector with 15021 elements

Source

http://www.gutenberg.org/ebooks/122


The text of Thomas Hardy's novel "Tess of the D'Urbervilles"

Description

A dataset containing the text of Thomas Hardy's novel "Tess of the D'Urbervilles" published in 1891. The UTF-8 plain text was sourced from Project Gutenberg and is divided into elements of up to about 70 characters each. (Some elements are blank.)

Usage

tess

Format

A character vector with 16091 elements

Source

http://www.gutenberg.org/ebooks/110


Thomas Hardy's Novels

Description

This package contains the complete text of Thomas Hardy's novels.

Author(s)

Maintainer: Matthew Henderson [email protected]

See Also

Useful links:


The text of Thomas Hardy's novel "The Woodlanders"

Description

A dataset containing the text of Thomas Hardy's 1887 novel "The Woodlanders". The UTF-8 plain text was sourced from Project Gutenberg and is divided into elements of up to about 70 characters each. (Some elements are blank.)

Usage

woodlanders

Format

A character vector with 14095 elements

Source

http://www.gutenberg.org/ebooks/482