Institute for Retailing & Data Science
Date | Time | Room | Topics | Slides | Recommended Readings |
---|---|---|---|---|---|
01-08-2024 | 1:00pm - 6:00pm | LC.-1.038 | Introduction | Introduction | Lost in Data Translation, R for Data Science |
01-10-2024 | 1:00pm - 6:00pm | LC.-1.038 | Modelling and Visualization Theory |
Modelling Visualization |
Causal Pitchfork Visualization A Crash Course in Good and Bad Controls The Psychology behind Data Visualization Techniques |
01-15-2024 | 1:00pm - 6:00pm | LC.2.064 | Empirical Model Building I + Coaching | ||
01-17-2024 | 1:00pm - 6:00pm | LC.-1.038 | Empirical Model Building II + Coaching | ||
01-24-2024 | 1:00pm - 5:00pm | LC.2.064 | Project Coaching | ||
01-31-2024 | 1:00pm - 3:00pm | Online | Take home exam | FAQs |
Nils Wlömert
Daniel Winkler
Gain the ability to create & communicate valuable insight from data
R
programming skills to help implementation[…] associate “winning” with the effort process itself. That’s the holy grail of dopamine management for success. It won’t make you dull or unhappy; it will make everything easier and more pleasurable […].
Hire as many data scientists as you can find you’ll still be lost without translators to connect analytics with real business value. […] By 2025 Chief Data Officers and their teams function as a business unit with profit-and-loss responsibilities. The unit, in partnership with business teams, is responsible for ideating new ways to use data, developing a holistic enterprise data strategy (and embedding it as part of a business strategy), and incubating new sources of revenue by monetizing data services and data sharing.
The empirics-first approach is not antagonistic to theory but rather can serve as a stepping-stone to theory. The approach lends itself well to today’s data-rich environment, which can reveal novel research questions untethered to theory. […] we argue that [empirics first] has a natural arc that bends more easily back to real-world implications.
Golder et al. (2022)
How old is this person?
Thinking in school
Thinking in school
Thinking in school
Thinking in school
Thinking in school
Thinking in life
Thinking in school
Thinking in life
“Who does what better now?”
If you torture the data long enough, it will confess to anything
Ronald Coase
e.g., Simpson’s Paradox
OLS 1 | |
---|---|
(Intercept) | 5.565*** |
x | -0.308*** |
+ p < 0.1, * p < 0.05, ** p < 0.01, *** p < 0.001 |
e.g., Simpson’s Paradox
OLS 2 | |
---|---|
(Intercept) | 4.315*** |
x | 0.334*** |
g B | -4.403*** |
+ p < 0.1, * p < 0.05, ** p < 0.01, *** p < 0.001 |
Failed robustness checks should be viewed as learning opportunities that lead to an even broader exploration that considers why a finding obtains in one context but not another
Golder et al. (2022)
Traditionally theory-agnostic predictive analytics tools are likely to have larger impact and lesser bias if they are able to smartly combine theoretical insights […] with large troves of data.
Bradlow et al. (2017)
make use of (quasi-)experiments (Goldfarb, Tucker, and Wang 2022)
Good natural experiments are studies in which there is a transparent exogenous source of variation in the explanatory variables that determine treatment assignment.
Meyer (1995)
choose appropritate models (Cunningham 2021)
think about good and bad controls (Cinelli, Forney, and Pearl 2020)
explore multiple causal pathways
Observation: The floor is wet
Necessary Condition
Without such a condition the observation cannot happen.
e.g., Water was “applied” to the floor.
Sufficient Condition
If such a condition is met the observation happens.
e.g., It is raining right now.
Can you think of counter examples for each?
e.g., Being at least 1.5m tall is not a necessary condition to become Austrian president
What happens if you “flip” conditions?
e.g., Getting a grade is … to know you are signed up in LPIS
What happens if you negate conditions?
e.g., Not being signed up on LPIS is … to know you will not receive a grade
Which of the examples are necessary and sufficient?
Can you come up with more?
Setup
Confrontation
Resolution
Prepare a 1-Minute elevator pitch for your thesis (or some other project)
15 min.
A data analysis can [consist of] importing, cleaning, transforming, and modeling data with a goal to build a machine learning algorithm to decide which product a company should sell.
McGowan, Peng, and Hicks (2022)
6 Principles
tibble [344 × 8] (S3: tbl_df/tbl/data.frame)
$ species : Factor w/ 3 levels "Adelie","Chinstrap",..: 1 1 1 1 1 1 1 1 1 1 ...
$ island : Factor w/ 3 levels "Biscoe","Dream",..: 3 3 3 3 3 3 3 3 3 3 ...
$ bill_length_mm : num [1:344] 39.1 39.5 40.3 NA 36.7 39.3 38.9 39.2 34.1 42 ...
$ bill_depth_mm : num [1:344] 18.7 17.4 18 NA 19.3 20.6 17.8 19.6 18.1 20.2 ...
$ flipper_length_mm: int [1:344] 181 186 195 NA 193 190 181 195 193 190 ...
$ body_mass_g : int [1:344] 3750 3800 3250 NA 3450 3650 3625 4675 3475 4250 ...
$ sex : Factor w/ 2 levels "female","male": 2 1 1 NA 1 2 1 2 NA NA ...
$ year : int [1:344] 2007 2007 2007 2007 2007 2007 2007 2007 2007 2007 ...
Component | Date |
---|---|
Preliminary story outline | Jan. 21 |
Last coaching | Jan. 24 |
Website submission | Feb. 18 |
Preliminary story outline
Final Result
Links
Five Fifty: Lost in translation
LEADERSHIP LAB: The Craft of Writing Effectively
The age of analytics: Competing in a data-driven world
The data-driven enterprise of 2025
Academic References