Facilitates the creation of page layout visualizations in which words are represented as rectangles with sizes relating to the length of the words. Which then is divided in lines and pages for easy overview of up to quite large texts.
ggpage is a package to create pagestyled visualizations of text based
data. It uses ggplot2 and final returns are ggplot2 objects.
In this new version I have worked to include a lot of use cases that wasn’t available in the first version. These new elements are previewed in the vignette.
You can install ggpage from github with:
The package includes The Tinder-box by H.C. Andersen for examples.
library(tidyverse)library(ggpage)head(tinderbox, 10)## # A tibble: 10 x 2## text book## <chr> <chr>## 1 "A soldier came marching along the high road: \"Left, rig… The tinder-…## 2 had his knapsack on his back, and a sword at his side; he… The tinder-…## 3 and was now returning home. As he walked on, he met a ver… The tinder-…## 4 witch in the road. Her under-lip hung quite down on her b… The tinder-…## 5 "and said, \"Good evening, soldier; you have a very fine … The tinder-…## 6 knapsack, and you are a real soldier; so you shall have a… The tinder-…## 7 "you like.\"" The tinder-…## 8 "\"Thank you, old witch,\" said the soldier." The tinder-…## 9 "\"Do you see that large tree,\" said the witch, pointing… The tinder-…## 10 "beside them. \"Well, it is quite hollow inside, and you … The tinder-…
The basic workflow with ggpage is using either
ggpage_quickfor a quick one function call plot or,
ggpage_plotto do analysis (NLP for example) before the final plot is produced.
For a simple demonstration we apply
ggpage_quick to our
object. It is important that the data.frame that is used have the text
in a column named “text”.
# Also pipeable# tinderbox %>% ggpage_quick()
The same result would be achieved by using
tinderbox %>%ggpage_build() %>%ggpage_plot()
But this approach allows us to introduce more code between
ggpage_plot giving us multiple more ways to enhance
tinderbox %>%ggpage_build() %>%mutate(long_word = stringr::str_length(word) > 8) %>%ggpage_plot(aes(fill = long_word)) +labs(title = "Longer words throughout The Tinder-box") +scale_fill_manual(values = c("grey70", "blue"),labels = c("8 or less", "9 or more"),name = "Word length")
And it will work nicely with other tidyverse packages
library(ggpage)library(purrr)library(gganimate)prebuild <- tinderbox %>%ggpage_build() %>%left_join(get_sentiments("afinn"), by = "word")midbuild <- map_df(.x = 0:50 * 10 + 1,~ prebuild %>%mutate(score = ifelse(is.na(score), 0, score),score_smooth = zoo::rollmean(score, .x, 0),score_smooth = score_smooth / max(score_smooth),rolls = .x))p <- midbuild %>%ggpage_plot(aes(fill = score_smooth, frame = rolls)) +scale_fill_gradient2(low = "red", high = "blue", mid = "grey", midpoint = 0) +guides(fill = "none") +labs(title = "Smoothed sentiment of The Tinder-box, rolling average of")gganimate(p, interval = .2)
NEWS.mdfile to track changes to the package.