Visualization of Power Analysis http://amarder.GITHUB.io/power-analysis/ Useful sliders giving one a feel of the #statistics

## Posts Tagged ‘ans’

### Visualization of Statistical Power Analysis

July 28, 2016### Software licenses and retraction

November 25, 2015Treefinder Retraction Note [interestingly, done editorially due to change in software-license terms] http://www.biomedcentral.com/1471-2148/15/243 HT @bornalibran

### Panorama of ancient metazoan macromolecular complexes : Nature : Nature Publishing Group

September 26, 2015Panorama of ancient metazoan #macromolecular complexes

http://www.nature.com/nature/journal/v525/n7569/abs/nature14877.html Finding many more #complexes from integrating many co-elutions

http://www.nature.com/nature/journal/v525/n7569/full/nature14877.html#affil-auth

### IBM Research: Preserving Validity in Adaptive Data Analysis

September 23, 2015Preserving Validity in Adaptive Data Analysis http://ibmresearchnews.blogspot.com/2015/08/preserving-validity-in-adaptive-data_6.html Using differential #privacy for correct #stats even w/ test-set reuse

QT:{{"

“A common next step would be to use the least-squares linear regression to check whether a simple linear combination of the three strongly correlated foods can predict the grade. It turns out that a little combination goes a long way: we discover that a linear combination of the three selected foods can explain a significant fraction of variance in the grade (plotted below). The regression analysis also reports that the p-value of this result is 0.00009 meaning that the probability of this happening purely by chance is less than 1 in 10,000.

Recall that no relationship exists in the true data distribution, so this discovery is clearly false. This spurious effect is known to experts as Freedman’s paradox. It arises since the variables (foods) used in the regression were chosen using the data itself.

…

We found that challenges of adaptivity can be addressed using techniques developed for privacy-preserving data analysis. These techniques rely on the notion of differential privacy that guarantees that the data analysis is not too sensitive to the data of any single individual. We rigorously demonstrated that ensuring differential privacy of an analysis also guarantees that the findings will be statistically valid. We then also developed additional approaches to the problem based on a new way to measure how much information an analysis reveals about a dataset.

The Thresholdout Algorithm

Using our new approach we designed an algorithm, called Thresholdout, that allows an analyst to reuse the holdout set of data for validating a large number of results, even when those results are produced by an adaptive analysis.

"}}

### seasonal effects on gene expression

August 9, 2015Widespread [25% genes] seasonal…expression reveals [circ]annual differences in…immunity, relevant for vaccination http://www.nature.com/ncomms/2015/150512/ncomms8000/full/ncomms8000.html

In addition to circadian rhythms, batch effects, now consider seasonal effects on gene expression

### Illumina poster of -seq experiments

August 9, 2015Poster of -Seq expts from @Illumina

http://www.illumina.com/content/dam/illumina-marketing/documents/applications/ngs-library-prep/ForAllYouSeqMethods.pdf Nextgen update to @Roche’s famous biochem. #pathway chart

http://biochemical-pathways.com/#/map/1

Originally Boehringer Mannheim chart of pathways

### Single cell Genome + Transcriptome Sequencing

July 28, 2015G&T-seq: parallel…#singlecell genomes & transcriptomes by @CGATist

lab & others http://www.nature.com/nmeth/journal/vaop/ncurrent/full/nmeth.3370.html low cov. matched data on 130+ cells

https://twitter.com/CGATist/status/604271101678587904

G&T-seq: parallel sequencing of single-cell genomes and transcriptomes

Iain C Macaulay,

Wilfried Haerty,

Parveen Kumar,

Yang I Li,

Tim Xiaoming Hu,

Mabel J Teng,

Mubeen Goolam,

Nathalie Saurat,

Paul Coupland,

Lesley M Shirley,

Miriam Smith,

Niels Van der Aa,

Ruby Banerjee,

Peter D Ellis,

Michael A Quail,

Harold P Swerdlow,

Magdalena Zernicka-Goetz,

Frederick J Livesey,

Chris P Ponting

& Thierry Voet

Nature Methods 12, 519–522 (2015) doi:10.1038/nmeth.3370Received 18 November 2014 Accepted 27 March 2015 Published online 27 April 2015

### CRAM vs BAM

June 10, 2015Benchmarking

http://www.htslib.org/benchmarks/CRAM.html

### Found this review

May 31, 2015Nice graph of seq. machine output v time

http://www.cell.com/molecular-cell/abstract/S1097-2765(15)00340-8