Posts Tagged ‘#datamining’

Same Stats, Different Graphs: Generating Datasets with Varied Appearance and Identical Statistics through Simulated Annealing (The Datasaurus Dozen) | Autodesk Research

May 15, 2017

https://www.autodeskresearch.com/publications/samestats

https://twitter.com/JustinMatejka/status/859075295059562498

great viz

An Introduction to Statistical Learning: with Applications in R – Gareth James, Daniela Witten, Trevor Hastie, Robert Tibshirani – Google Books

February 13, 2016

https://books.google.com/books?id=qcI_AAAAQBAJ&printsec=frontcover&dq=An+Introduction+to+Statistical+Learning:+with+Applications+in+R+type:pdf&hl=en&sa=X&ved=0ahUKEwjy9uaMjfXKAhUKSiYKHStsA0YQ6AEIKzAA#v=onepage&q&f=false

https://books.google.com/books?id=qcI_AAAAQBAJ&printsec=frontcover&dq=An+Introduction+to+Statistical+Learning:+with+Applications+in+R+type:pdf&hl=en&sa=X&ved=0ahUKEwjy9uaMjfXKAhUKSiYKHStsA0YQ6AEIKzAA#v=onepage&q&f=false

They’re Watching You at Work

September 21, 2014

They’re Watching You at Work http://www.theatlantic.com/magazine/archive/2013/12/theyre-watching-you-at-work/354681 Will HR analytics be a corporate big brother or personal coach? #Datamining & #Privacy

My public notes from KDD 2014

August 31, 2014

https://storify.com/markgerstein/tweets-related-to-kdd-2014-i0kdd-kdd2014

https://www.flickr.com/photos/mbgmbg/tags/seriesspacyworldofbloombergbldg

https://linkstream2.gerstein.info/tag/i0kdd/

http://archive.gersteinlab.org/meetings/s/2014/08.28/kdd2014-i0kdd-meeting-materials/ (need password)

http://www.kdd.org/kdd2014/

PLOS Computational Biology: Improving Breast Cancer Survival Analysis through Competition-Based Multidimensional Modeling

August 31, 2014

– apply to metabric consortium
– 17K clin feat. + ~50K gene exp. + ~30K CNVs ==to-predict==> 10yr survival – uses CI instead of AUC for real valued predictions
– combine collaboration & competition to beat the baseline (cox regression on only clinical features)
– mol. feat. on their own don’t work well due to the curse of dimensionality – features more important than the learning method

http://www.ploscompbiol.org/article/info%3Adoi%2F10.1371%2Fjournal.pcbi.1003047

Pandey mentions: Cancer Survival Analysis through
Competition-Based…Modeling, using Human #Ensembles
http://www.ploscompbiol.org/article/info%3Adoi%2F10.1371%2Fjournal.pcbi.1003047 #kdd2014

IEEE Xplore Abstract – A Comparative Analysis of Ensemble Classifiers: Case Studies in Genomics

August 24, 2014

Pandey mentions: Comparative Analysis of #Ensemble Classifiers [eg mean agg. or stacking]…in Genomics
http://ieeexplore.ieee.org/xpl/login.jsp?tp=&arnumber=6729565&url=http%3A%2F%2Fieeexplore.ieee.org%2Fxpls%2Fabs_all.jsp%3Farnumber%3D6729565 #kdd2014

performance-diversity tradeoff: should one incl. higher performance, lower diversity ones…. but still adding diversity is good

related to https://github.com/shwhalen/datasink

Ensemble Methods in Machine Learning. Proceedings of the First International Workshop on Multiple Classifier Systems

July 13, 2014

Rich C, Alexandru N-M, Geoff C, Alex K (2004) Ensemble selection from libraries of
models. Proceedings of the twenty-first international conference on Machine learning. Banff, Alberta, Canada: ACM.
http://www.niculescu-mizil.org/papers/shotgun.icml04.revised.rev2.pdf

Thomas GD (2000) Ensemble Methods in Machine Learning. Proceedings of the First International Workshop on Multiple Classifier Systems: Springer-Verlag.
http://www.eecs.wsu.edu/~holder/courses/CptS570/fall07/papers/Dietterich00.pdf http://dl.acm.org/citation.cfm?id=743935

.@deniseOme Good ref is TG Dietterich #Ensemble Methods in
#MachineLearning MCS ’00
http://www.eecs.wsu.edu/~holder/courses/CptS570/fall07/papers/Dietterich00.pdf Not rel. to @ensembl #ismb #afp14

ref 17 & 18

Information Fiduciary: Solution to Facebook digital gerrymandering | New Republic

June 14, 2014

Facebook Could Decide an Election—Without You Ever Finding Out. @zittrain advocates regulating digital gerrymandering
http://www.newrepublic.com/article/117878/information-fiduciary-solution-facebook-digital-gerrymandering

BIOKDD 2014

June 7, 2014

http://www.kdd.org/kdd2014/
8/24-8/27

13th International Workshop on Data Mining in Bioinformatics (BIOKDD’14) August 24, 2014 * New York City, NY, USA
http://home.biokdd.org/biokdd14

The Wedding Data: What Marriage Notices Say About Social Change – Megan Garber – The Atlantic

September 8, 2013

Interesting site for social #datamining: http://weddingcrunchers.com via @laurahelmuth @Slate

http://www.theatlantic.com/technology/archive/2013/09/the-wedding-data-what-marriage-notices-say-about-social-change/279411/