      <h1 id="hi">Hi!</h1>

<p>I am a <a href="#data-science">data scientist</a>, <a href="#social-science">social scientist</a>, <a href="#statistics">statistician</a>, and <a href="#software">software developer</a>. I mostly specialize in methods for solving causal inference and business decision problems, and I am particularly interested in building tools for practitioners working on real-world problems. I’m a generalist, I like to hang out with people from many fields and borrow as many ideas as possible. I have collaborated with computer scientists, economists, political scientists, statisticians, machine learning researchers, and business school scholars. It’s fun for me to jump around a bit, continue learning new things, and make connections between fields.</p>

<p>Here are some useful links:</p>

  <li><a href="">Motif Analytics</a> (my startup)</li>
  <li><a href="">Twitter</a></li>
  <li><a href="">Sign up for my newsletter</a></li>
  <li><a href="">Github</a></li>
  <li><a href="">LinkedIn</a></li>
  <li><a href=";authuser=1&amp;user=2VHQIgQAAAAJ">Google Scholar</a></li>
  <li><a href="blog/">My infrequently updated blog</a></li>

<h2 id="background">Background</h2>

  <li>[2022-Present] I’m a co-founder and chief scientist at <a href="">Motif Analytics</a> <a href="">Read more here!</a></li>
  <li><a href="">2019-2022</a> I was a data scientist and manager on the Rideshare Labs team at <a href="">Lyft</a>.</li>
  <li>2012-2019, I was a research scientist and manager on Facebook’s <a href="">Core Data Science Team</a>.</li>
  <li>2008-2013, I was a Ph.D. student at NYU’s Stern School of Business, concentrating in <a href="">Information Systems</a>. My dissertation was titled <a href="">Social Influence from Online Social Signals</a>. My advisor was <a href="">Sinan Aral</a>.</li>
  <li>2006-2008, I was a software engineer at <a href="">Matrix Group, International</a>.</li>
  <li>2004-2006, I was a research assistant at the <a href="">Federal Reserve Board</a>.</li>
  <li>2000-2004, I was an undergraduate at The University of Pennsylvania. I studied Economics, Finance, and Information Systems.</li>
  <li>I grew up in Philadelphia and I’m a <a href="">huge Eagles fan</a>.</li>

<h2 id="videos-and-podcasts">Videos and Podcasts</h2>

  <li><a href="">What’s New in Data?</a> podcast with <a href="">John Kutay</a>.</li>
  <li><a href=";t=2s">When not to use SQL</a> a talk at <a href="">NormConf</a>.</li>
  <li><a href="">Minimum Viable Experimentation</a> on the <a href="">Analytics Engineering Podcast</a></li>
  <li><a href=";">Causal Inference and Sequence Data</a> on the <a href="">Super Data Science podcast</a>.</li>
  <li><a href="">The Relationship between Experimentation and Causal Inference</a> talk at the Nubank Data Science meetup.</li>
  <li><a href="">Causal Inference Approach to Matching in Two-Sided Marketplaces</a>.</li>
  <li><a href="">Interview on The Data Exchange Podcast</a> with <a href="">Ben Lorica</a> and <a href="">Jenn Webb</a>.</li>
  <li><a href="">When do we actually need Causal Inference</a> talk at the <a href="">New York Open Statistical Programming Meetup</a>.</li>
  <li><a href="">TWIML Interview on Causal Models in Practice</a> with <a href="">Sam Charrington</a>.</li>
  <li><a href="">Interview on the Gradient Dissent Podcast</a> with <a href="">Lukas Biewald</a>.</li>
  <li><a href="">Learning Bayesian Statistics Podcast</a> episode with <a href="">Alex Andorra</a>.</li>
  <li><a href="">My Keynote</a> entitled “In Defense of Curve Fitting” at the <a href="">2020 Causal Data Science Meeting</a></li>
  <li><a href="">A short talk about Prophet</a></li>
  <li><a href="">Another talk about Prophet</a> at StanCon, with my friend <a href="">Ben Letham</a>.</li>
  <li><a href="">Interview on the Casual Inference Podcast</a></li>
  <li><a href="">Appearance on Not So Standard Deviations</a> with my friends <a href="">Hilary Parker</a> and <a href="">Roger Peng</a>.</li>
  <li><a href="">A short interview about exploratory data analysis</a></li>
  <li><a href="">Podcast about my Science paper</a> with my friends <a href="">John Myles White</a> and <a href="">Hilary Mason</a>.</li>
  <li><a href=";t=1s">Finding Nate Silver</a>, an <a href="">Ignite talk</a> about a prediction market I co-developed.</li>

<h2 id="--data-science"><a name="data-science"> </a> Data Science</h2>

<p>Here are some data science posts I’ve written:</p>

  <li><a href="">A Personal Retrospective on Prophet</a></li>
  <li><a href="">Bringing more causality to data science</a></li>
  <li><a href="">Locally Optimal</a></li>
  <li><a href="">Designing and Evaluating Metrics</a></li>
  <li><a href="">The Personality Space of Cartoon Characters</a></li>
  <li><a href="">NFL Fans on Facebook</a></li>
  <li><a href="">Debunking Princeton</a></li>
  <li><a href="/post/39573264781/the-statistics-software-signal.html">The Statistics Software Signal</a></li>
  <li><a href="">Real scientists make their own data</a></li>

<h2 id="--social-science"><a name="social-science"> </a> Social Science</h2>

<p>Here are some of my social science papers. Almost all papers are field experiments on online social platforms.</p>

  <li><a href="">Displaying things in common to encourage friendship formation: A large randomized field experiment</a> (Quantitative Marketing and Economics) <a href="">(pdf of conference version)</a></li>
  <li><a href="">Characterizing online public discussions through patterns of participant interactions</a> (CSCW 2018)</li>
  <li><a href="">Social Influence Bias: A Randomized Experiment</a> (Science)</li>
  <li><a href="">Discussion quality diffuses in the digital public square</a> (WWW 2017)</li>
  <li><a href="">Selection Effects in Online Sharing: Consequences for Peer Adoption</a> (EC 2013)</li>

<h2 id="--experimentation-and-statistics"><a name="statistics"> </a> Experimentation and Statistics</h2>

<p>Here are some of my papers on experimentation and statistics. I’m relatively new to this field and mostly a consumer of statistics research, rather than a producer.</p>

  <li><a href="">Variance-Weighted Estimators to Improve Sensitivity in Online Experiments</a> (EC 2020) <a href="">(pdf)</a>. You can watch <a href="">a presentation</a> by my co-author <a href="">Kevin Liou</a>.</li>
  <li><a href="">Randomized experiments to detect and estimate social influence in networks</a>  (Complex Spreading Phenomena in Social Systems). This is a book chapter with my friend <a href="">Dean Eckles</a>.</li>
  <li><a href="">Active Matrix Factorization for Surveys</a> (Annals of Applied Statistics)</li>
  <li><a href=";">Forecasting at Scale</a> (The American Statistician) <a href="">(pdf)</a></li>

<h2 id="--forecasting-software"><a name="software"> </a><a></a> Forecasting Software</h2>

  <li><a href="">Prophet</a> is an open source forecasting package available in R and Python. You can watch <a href="">my talk about Prophet</a>. You can also read my <a href="">explainer thread</a>.</li>

