Fri, Apr 17, 2020, 8:30 AM
http://fridaycoffeemeetup.com
Random Posts
🔖 Advanced Data Analysis from an Elementary Point of View by Cosma Rohilla Shalizi
Advanced Data Analysis from an Elementary Point of View
by Cosma Rohilla ShaliziThis is a draft textbook on data analysis methods, intended for a one-semester course for advance undergraduate students who have already taken classes in probability, mathematical statistics, and linear regression. It began as the lecture notes for 36-402 at Carnegie Mellon University.
By making this draft generally available, I am not promising to provide any assistance or even clarification whatsoever. Comments are, however, welcome.
The book is under contract to Cambridge University Press; it should be turned over to the press before the end of 2015. A copy of the next-to-final version will remain freely accessible here permanently.
Table of contents:
I. Regression and Its Generalizations
- Regression Basics
- The Truth about Linear Regression
- Model Evaluation
- Smoothing in Regression
- Simulation
- The Bootstrap
- Weighting and Variance
- Splines
- Additive Models
- Testing Regression Specifications
- Logistic Regression
- Generalized Linear Models and Generalized Additive Models
- Classification and Regression Trees
II. Distributions and Latent Structure- Density Estimation
- Relative Distributions and Smooth Tests of Goodness-of-Fit
- Principal Components Analysis
- Factor Models
- Nonlinear Dimensionality Reduction
- Mixture Models
- Graphical Models
III. Dependent Data- Time Series
- Spatial and Network Data
- Simulation-Based Inference
IV. Causal Inference- Graphical Causal Models
- Identifying Causal Effects
- Causal Inference from Experiments
- Estimating Causal Effects
- Discovering Causal StructureAppendices
- Data-Analysis Problem Sets
- Reminders from Linear Algebra
- Big O and Little o Notation
- Taylor Expansions
- Multivariate Distributions
- Algebra with Expectations and Variances
- Propagation of Error, and Standard Errors for Derived Quantities
- Optimization
- chi-squared and the Likelihood Ratio Test
- Proof of the Gauss-Markov Theorem
- Rudimentary Graph Theory
- Information Theory
- Hypothesis Testing
- Writing R Functions
- Random Variable Generation
Planned changes:
- Unified treatment of information-theoretic topics (relative entropy / Kullback-Leibler divergence, entropy, mutual information and independence, hypothesis-testing interpretations) in an appendix, with references from chapters on density estimation, on EM, and on independence testing
- More detailed treatment of calibration and calibration-checking (part II)
- Missing data and imputation (part II)
- Move d-separation material from “causal models” chapter to graphical models chapter as no specifically causal content (parts II and IV)?
- Expand treatment of partial identification for causal inference, including partial identification of effects by looking at all data-compatible DAGs (part IV)
- Figure out how to cut at least 50 pages
- Make sure notation is consistent throughout: insist that vectors are always matrices, or use more geometric notation?
- Move simulation to an appendix
- Move variance/weights chapter to right before logistic regression
- Move some appendices online (i.e., after references)?
(Text last updated 30 March 2016; this page last updated 6 November 2015)
📺 The Daily Show with Trevor Noah: February 14, 2017
National Security Adviser Michael Flynn resigns and Republicans don't know how to handle questions about it, and Laverne Cox discusses the TV show "Doubt."
The Mike Flynn “cartoon” was an interesting method of storytelling.
There was a nice portion just before the interview in which Noah shows several examples of the ways in which the Trump administration tries to have things both ways, and then denies that they’re not telling the truth. Why must they waffle? Why can’t they just stand up and say “this is who we are.” We couldn’t do anything but respect them for that, but lying just ruins it all.
A map of Russia on the episode makes me wonder why I haven’t seen any pundits or comedians take the map and turn Russia’s highlight into Trump’s hair and then superimpose his face over South Asia. This image may help others with the idea:
🔖 Write, Right? Write! – TRU Writer
Welcome to a new experiment in simple but elegant web publishing. This site let’s you quickly publish full formatted and media rich articles, essays, papers — without requiring any logins or tracking of personal information. Don’t take our word for it, explore one piece published here, chosen at random.
A published work includes a header image which you can upload to the TRU Writer. Choose how you wish to credit yourself as an author, or choose to by anonymous.
You should be able to copy the contents of anything you have written in a Word Processor, or already published on a web page, paste it into the TRU Writer editor. Most standard formatting (headers, bold, italic, underline, lists, blockquotes, hypertext links) will be preserved.
You can then edit/augment your work using a rich text editor, including embedding content from social media sites, and you can upload new images to be included within the text of your writing.
So find an essay or article and see what you can do with it by publishing online with the TRU Writer.
This is implemented as a WordPress theme, so it can be created for many different sites. Learn more about TRU Writer and where to find the theme.
📺 The West Wing (NBC, 1999) Season 1, Episodes 1-4
Created by Aaron Sorkin. With Martin Sheen, Rob Lowe, Allison Janney, John Spencer. Inside the lives of staffers in the West Wing of the White House.
“Five Votes Down”
“A Proportional Response”
“Post Hoc, Ergo Propter Hoc”
“Pilot”
Watched on Netflix via Chromecast to Television
2011 USC Viterbi Lecture “Adventures in Coding Theory” by Elwyn Berklekamp
"Adventures in Coding Theory"
Professor Elwyn Berlekamp
University of California, BerkeleyGerontology Auditorium, Thursday, March 3, 4:30 to 5:30 p.m.
>> Click here for live wedcast
Abstract
The inventors of error-correcting codes were initially motivated by problems in communications engineering. But coding theory has since also influenced several other fields, including memory technology, theoretical computer science, game theory, portfolio theory, and symbolic manipulation. This talk will recall some forays into these subjects.--via viterbi.usc.edu
IndieWebCamp 2020 West: June 27-28, 2020
The broad ideas behind DoOO dovetail quite well and the IndieWeb community has a welcoming, inclusive, and helpful atmosphere with a solid code of conduct.
The upcoming event is called IndieWebCamp 2020 West (based roughly on the Pacific time Zone). I’ve already started proposing a few DoOO-related sessions on their organizing Etherpad. I’d encourage others in the community who are interested to register for the free two day camp to talk about what we can do with our websites and how we can improve them. Students, faculty, staff, and even hobbyists of all levels of ability are welcome. If you’ve got ideas for things you’re interested in doing on or with your website, feel free to propose your own topics (either now or the morning of day one).
We’d love to see everyone there.
Day one is a brief introduction followed by various discussion-based sessions on topics of interest to those who attend. (First time attendees are given the first opportunity to schedule topics.) Day two is a creator day on which people write, create, build, code, or otherwise improve something on their website. If you don’t yet have a website, people will be on hand to help you set one up, or get around obstacles you may have for being able to use and manage your website.
Details and RSVP information can be found here: https://indieweb.org/2020/West
If anyone has questions or needs further details or help proposing potential sessions, don’t hesitate to ask.
Publisher of books about cookery, food history and the ethnology of food.
👓 Trump and His Aides Have No Idea What They’re Talking About | The Atlantic
A procession of contradictory statements leaves the public no closer to understanding the president’s deal with Stormy Daniels, but clearly shows the White House’s dishonesty.
👓 What are people using GitHub for besides coding? | InfoWorld
Git made it possible for programmers to coordinate distributed work across teams -- now GitHub makes it possible for everyone else
A new view of the tree of life
An update to the tree of life has revealed a dominance of bacterial diversity in many ecosystems and extensive evolution in some branches of the tree. It also highlights how few organisms we have been able to cultivate for further investigation.
Abstract
The tree of life is one of the most important organizing principles in biology. Gene surveys suggest the existence of an enormous number of branches, but even an approximation of the full scale of the tree has remained elusive. Recent depictions of the tree of life have focused either on the nature of deep evolutionary relationships or on the known, well-classified diversity of life with an emphasis on eukaryotes. These approaches overlook the dramatic change in our understanding of life’s diversity resulting from genomic sampling of previously unexamined environments. New methods to generate genome sequences illuminate the identity of organisms and their metabolic capacities, placing them in community and ecosystem contexts. Here, we use new genomic data from over 1,000 uncultivated and little known organisms, together with published sequences, to infer a dramatically expanded version of the tree of life, with Bacteria, Archaea and Eukarya included. The depiction is both a global overview and a snapshot of the diversity within each major lineage. The results reveal the dominance of bacterial diversification and underline the importance of organisms lacking isolated representatives, with substantial evolution concentrated in a major radiation of such organisms. This tree highlights major lineages currently underrepresented in biogeochemical models and identifies radiations that are probably important for future evolutionary analyses.
Laura A. Hug, Brett J. Baker, Karthik Anantharaman, Christopher T. Brown, Alexander J. Probst, Cindy J. Castelle, Cristina N. Butterfield, Alex W. Hernsdorf, Yuki Amano, Kotaro Ise, Yohey Suzuki, Natasha Dudek, David A. Relman, Kari M. Finstad, Ronald Amundson, Brian C. Thomas & Jillian F. Banfield in Nature Microbiology, Article number: 16048 (2016) doi:10.1038/nmicrobiol.2016.48

Carl Zimmer also has a nice little write up of the paper in today’s New York Times:
Most of the diversity outlined on the new tree has been hiding in plain sight.
in Scientists Unveil New ‘Tree of Life’ from The New York Times 4/11/16
At 52, I was accepted to Yale as a freshman. The students I met there surprised me.