Posts Tagged ‘datascience’

How statistics lost their power – and why we should fear what comes next | William Davies | Politics | Th e Guardian

January 30, 2017

How stats lost their power via @alexvespi
https://www.theguardian.com/politics/2017/jan/19/crisis-of-statistics-big-data-democracy Death of #DataScience in a “post-truth” world; anecdotes v elitist numbers

News Outlets Wonder Where the Predictions Went Wrong – The New York Times

November 12, 2016

News Outlets Wonder Where the Predictions Went Wrong
http://www.nytimes.com/2016/11/10/business/media/news-outlets-wonder-where-the-predictions-went-wrong.html The real loser in the election: #datascience-based polling

Research Parasites

January 23, 2016

Dara sharing http://www.nejm.org/doi/full/10.1056/NEJMe1516564 Deems #datascientists as “research parasites,” using another’s data for their own ends via @dspakowicz

QT:{{”
“A second concern held by some is that a new class of research person will emerge — people who had nothing to do with the design and execution of the study but use another group’s data for their own ends, possibly stealing from the research productivity planned by the data gatherers, or even use the data to try to disprove what the original investigators had posited. There is concern among some front-line researchers that the system will be taken over by what some researchers have characterized as “research parasites.””
“}}

10 types of regressions. Which one to use?

December 8, 2015

10 types of #regressions. Which one to use?
http://www.datasciencecentral.com/forum/topics/10-types-of-regressions-which-one-to-use Pitfalls of common approaches, eg linear or logistic via @KirkDBorne

Are Polls Ruining Democracy?

November 29, 2015

Are Polls Ruining Democracy?
http://www.newyorker.com/magazine/2015/11/16/politics-and-the-new-machine “#Datascience is the child of a rocky marriage between the academy & Silicon Valley”

QT:{{”
“If public-opinion polling is the child of a strained marriage between the press and the academy, data science is the child of a rocky marriage between the academy and Silicon Valley. The term “data science” was coined in 1960, one year after the Democratic National Committee hired Simulmatics Corporation, a company founded by Ithiel de Sola Pool, a political scientist from M.I.T., to provide strategic analysis in advance of the upcoming Presidential election. Pool and his team collected punch cards from pollsters who had archived more than sixty polls from the elections of 1952, 1954, 1956, 1958, and 1960, representing more than a hundred thousand interviews, and fed them into a UNIVAC. They then sorted voters into four hundred and eighty possible types (for example, “Eastern, metropolitan,
lower-income, white, Catholic, female Democrat”) and sorted issues into fifty-two clusters (for example, foreign aid). Simulmatics’ first task, completed just before the Democratic National Convention, was a study of “the Negro vote in the North.” Its report, which is thought to have influenced the civil-rights paragraphs added to the Party’s platform, concluded that between 1954 and 1956 “a small but
significant shift to the Republicans occurred among Northern Negroes, which cost the Democrats about 1 per cent of the total votes in 8 key states.” After the nominating convention, the D.N.C. commissioned Simulmatics to prepare three more reports, including one that involved running simulations about different ways in which Kennedy might discuss his Catholicism.”
“}}

NIH approves strategic vision to transform National Library of Medicine

June 17, 2015

Great focus on datascience for the NLMhttp://www.nih.gov/news/health/jun2015/od-11.htm

The Programmer’s Price

December 1, 2014

Programmer’s Price
http://www.newyorker.com/magazine/2014/11/24/programmers-price@10xmgmt: Talent agency for the developer stack, UI guru to datascientist, even a bioinformatician

American Chronicles NYer NOVEMBER 24, 2014 ISSUE
Want to hire a coding superstar? Call the agent.
BY LIZZIE WIDDICOMBE

QT:{{”
Solomon leaned back in his chair and flipped through a mental Rolodex of his clients. “I definitely have some ideas,” he said, after a minute. “The first person who comes to mind, he’s also a
bioinformatician.” He rattled off a dazzling list of accomplishments: the developer does work for the Scripps Research Institute, in La Jolla, where he is attempting to attack complicated biological problems using crowdsourcing, and had created Twitter tools capable of influencing elections. Solomon thought that he might be interested in AuthorBee’s use of Twitter. “He knows the Twitter A.P.I. in his sleep.”

And, like actual rock stars, rock-star developers come in a range of personality types. Guvench had briefed me at the coffee shop: front-end guys—designers and user-interface engineers—make products that interact with what he referred to as “normal” people. As a result, “they’re sort of hip,” he said. “Especially designers—they dress nicely.” The further you get down the “stack,” Guvench explained, “the more . . .” He paused. “ ‘Neckbeard’ is the word that comes to mind.” Back-end engineers, like data scientists and system administrators, “are the most brilliant people,” he said. “They may not be the most fun to talk to at a party, but they’re really fucking good at talking to computers.” Of course, he added, the stereotype doesn’t apply to his clients.
“}}

My public notes from the Yale Day of Data (#ydod2014, i0dataday)

September 30, 2014

https://linkstream2.gerstein.info/tag/i0dataday
http://elischolar.library.yale.edu/dayofdata/2014/
https://storify.com/markgerstein/tweets-related-to-the-yale-day-of-data-2014-ydod20

For Big-Data Scientists, ‘Janitor Work’ Is Key Hurdle to Insights

September 29, 2014

For #BigData Scientists, Janitor Work Is Key http://www.nytimes.com/2014/08/18/technology/for-big-data-scientists-hurdle-to-insights-is-janitor-work.html Is a #datascientist a digital maid or a data priest? Perhaps a hybrid.

O’Neil talk

Phylomemetics—Evolutionary Analysis beyond the Gene

September 21, 2014

Phylomemetics—Evolutionary Analysis beyond the Gene
http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001069 #Phylogenetics for texts, languages & artifacts (w/ recoding)