perennial-beginner
perennial-beginner
The Perennial Beginner
7 posts
Don't wanna be here? Send us removal request.
perennial-beginner · 5 years ago
Link
Example of doing tf-idf for Chinese in Python.
0 notes
perennial-beginner · 5 years ago
Text
Create a new column by applying function to existing column
df['new_column'] = df['column'].apply(lambda x: function(x))
0 notes
perennial-beginner · 5 years ago
Link
Instead use the new nltk.parse.corenlp.CoreNLPParser API and NLTK v3.3. 
The same instructions are given on the nltk Github repo: https://github.com/nltk/nltk/wiki/Stanford-CoreNLP-API-in-NLTK
Most recent version, published in 2018.
0 notes
perennial-beginner · 5 years ago
Text
Make a Shiny mobile app.
0 notes
perennial-beginner · 5 years ago
Text
Logit function
Tumblr media
From Essential Statistics for Data Scientists
0 notes
perennial-beginner · 5 years ago
Photo
Tumblr media
## Import libraries library(ClustOfVar) library(PCAmixdata) library(dendextend) ## Split up continuous and categorical varibles split <- splitmix(PimaIndiansDiabetes2) X1 <- split$X.quanti X2 <- split$X.quali ## Hierarchical clustering tree <- hclustvar(X.quanti = X1, X.quali = X2) ## Evaluate the stability of each partition stability(tree, B=40) ## 60 bootstrap samples ## Plot dend <- tree %>% as.dendrogram %>% hang.dendrogram dend %>% color_branches(k=5) %>% color_labels(k=5) %>% plot(horiz=TRUE)
0 notes
perennial-beginner · 5 years ago
Photo
Tumblr media
library(DataExplorer) plot_correlation(df)
0 notes