#and different naming conventions EVEN WITHIN dfs from the same source
Explore tagged Tumblr posts
Text
I am willing to accept a degree of variable repetition when you’ve clearly pieced a dataframe together from multiple sources.
But.
Having Year and Year.Code from the same dataframe, with the EXACT same values, RIGHT next to each other, is too much. Clean your data even SLIGHTLY before making it publicly available, I BEG of you
#im having fun its just that i. have to piece together six seperate dataframes for this project#theres so many redundant columns#and different naming conventions EVEN WITHIN dfs from the same source#i feel so sorry for the people that have to work with my data files after me. but at least i trim my columns
0 notes