count_functions | Counting functions from textfeatures |
okc_text | OkCupid Text Data |
step_sequence_onehot | Generate the basic set of text features |
step_stem | Stemming of list-column variables |
step_stopwords | Filtering of stopwords from a list-column variable |
step_textfeature | Generate the basic set of text features |
step_texthash | Term frequency of tokens |
step_tf | Term frequency of tokens |
step_tfidf | Term frequency-inverse document frequency of tokens |
step_tokenfilter | Filter the tokens based on term frequency |
step_tokenize | Tokenization of character variables |
step_tokenmerge | Generate the basic set of text features |
step_untokenize | Untokenization of list-column variables |
step_word2vec | Calculates word2vec dimension estimates |
tidy.step_sequence_onehot | Generate the basic set of text features |
tidy.step_stem | Stemming of list-column variables |
tidy.step_stopwords | Filtering of stopwords from a list-column variable |
tidy.step_textfeature | Generate the basic set of text features |
tidy.step_texthash | Term frequency of tokens |
tidy.step_tf | Term frequency of tokens |
tidy.step_tfidf | Term frequency-inverse document frequency of tokens |
tidy.step_tokenfilter | Filter the tokens based on term frequency |
tidy.step_tokenize | Tokenization of character variables |
tidy.step_tokenmerge | Generate the basic set of text features |
tidy.step_untokenize | Untokenization of list-column variables |
tidy.step_word2vec | Calculates word2vec dimension estimates |