June 6, 2018

Great talk today @Yale by @MooreJH. He describes flow of calculations in biomed. #DataScience, including feature construction, machine learning & downstream interpretation.

Great slide on ML derived from

Carl Zimmer To Speak At Bio-IT World, Tackle Heredity, Genes, And How Our Understanding Of The Two Is Changing – Bio-IT World

May 11, 2018

“It was a huge amount of fun watching them take that raw data and put it through their own pipelines,” Zimmer told me, but he also felt uncomfortable pointing out discrepancies to the scientists he worked with. “I still remember, I was sitting down with Chris Mason at Weill Cornell. He and his students were so enthusiastically going through their findings with me… and they showed me, among other things, how many SNPs I had. Not too long beforehand I’d gone through the same experience with Mark Gerstein and his team at Yale, and their numbers for my SNPs were off by hundreds of thousands. … It was a little awkward with Chris, but I just said, ‘Hey, I got a very different number from Mark Gerstein,’ and Chris just shrugged and said, ‘Oh yeah, that happens.’”

It turns out, there’s a lot about our current understanding of our genes and how we pass them on that isn’t perfectly clear cut. “}}

Carl Zimmer’s tweets from the last class – for ref.

May 1, 2018

Reporting Grades < Yale University

April 22, 2018

Term grades for the spring term are due seven days after the end of final examination period; in 2017–2018, this date is May 16, 2018. In the spring term, grades are due for seniors within forty-eight hours of the end of final examination period; in 2017–2018, this date is May 11, 2018.

Points of significance: Machine learning: supervised methods

March 3, 2018

Points of significance – #MachineLearning: supervised methods Nice discussion of the k in k-NN & the slack parm. C, penalizing misclassified points in SVM — both which act somewhat analogously as regularizers. Good for #teaching

Genic Intolerance to Functional Variation and the Interpretation of Personal Genomes

February 24, 2018

Genic Intolerance to Functional Variation & the Interpretation of Personal Genomes Nice plot of the number of rare v common variants in each gene to find outliers particularly tolerant to impactful (eg #LOF) mutations

Petrovski et al ’13

Vertabelo for db design

February 10, 2018

How can we effectively and efficiently teach statistical thinking and computation to students with little to no background in either? How can we equip them with the skills and tools for reasoning with various types of data and leave them wanting to learn more?

In this talk we describe an introductory data science course that is our (working) answer to these questions. ….

Training Calendar | Research Data Support

September 16, 2017 Research Data Support website has published a unified calendar for data and research skills training provided by the Library, Center for Research Computing, Medical Library, and Center for Teaching and Learning.

DataScience related courses at Yale

July 27, 2017

The Research Data Consultation Group ( has considered aggregating data science training information into a unified calendar.

Also, there’s an instruction calendar at the library