Insegnamento a.a. 2025-2026

20867 - BUSINESS ECONOMICS - MODULE I (EMPIRICAL METHODS)

Department of Management and Technology

Course taught in English
Go to class group/s: 27 - 28
EMIT (8 credits - I sem. - OB  |  SECS-P/06)
Course Director:
STEFANO BRESCHI

Classes: 27 (I sem.) - 28 (I sem.)
Instructors:
Class 27: STEFANO BRESCHI, Class 28: STEFANO BRESCHI


Mission & Content Summary

MISSION

In today’s data-driven landscape, understanding the dynamics of high-tech ventures requires mastery of complex, multi-source datasets and advanced analytical tools. This course empowers students to streamline and reshape raw startup ecosystem data—from fundraising rounds to performance metrics—into robust, analysis-ready formats. Through hands-on labs in Python and Stata, students will develop reproducible pipelines for data cleaning, integration, and visualization, then apply core econometric techniques (OLS, IV, panel models) to test hypotheses on growth, valuation, and investment trends. Throughout the lab, we emphasize critical thinking and problem-solving as essential skills for navigating complex data workflows and econometric challenges. By the end of the course, students will be fully prepared to carry out robust data transformations and econometric analyses across a range of academic and professional settings.

CONTENT SUMMARY

Part I – Data Engineering Lab

 

A. Python Foundations & Environment

  • IDEs and notebook workflows (Jupyter, VS Code, remote servers)
  • Core Python syntax: variables, data types (numeric, boolean, string), containers (lists, dicts, tuples, sets)
  • Control flow: if/elif/else, for and while loops, error handling (try/except)
  • Functions and modules: defining, scoping, lambdas, package management

 

B. Data Acquisition & Import

  • Reading and writing CSV, JSON, Excel, and Stata files
  • Managing large datasets with chunked loading and memory optimization
  • Ensuring data quality at import: type conversions and error reporting

 

C. Data Wrangling & Cleaning

  • Pandas Series and DataFrame fundamentals: indexing, slicing, filtering
  • Detecting and treating missing values, duplicates, and outliers
  • String operations and date–time parsing
  • Grouped aggregations and transformations (groupby, agg, transform)

 

D. Data Integration & Reshaping

  • Merging, joining, concatenating multiple tables (e.g., rounds, valuations, company profiles)
  • Pivoting and melting for “wide” vs. “long” formats
  • Building event‐time panels and longitudinal structures
  • Chaining operations into reproducible pipelines

 

E. Exploratory Visualization

  • Plotting with Matplotlib (line, bar, scatter, box)
  • Customizing plots: titles, labels, legends, and annotations
  • Exporting figures for reports and presentations

 

F. Introduction to Text Data

  • Regular expressions for parsing term sheets and news feeds
  • Tokenization and basic NLP workflows (TF-IDF)

 

Part II – Econometric Methods

 

A. Econometric Data Structures

  • Cross-section vs. time series vs. panel data
  • Constructing balanced and unbalanced panels from transaction logs

 

B. Simple Linear Regression

  • Economic vs. econometric model specification
  • Ordinary least squares estimation and interpretation
  • Incorporating non-linear terms and interaction dummies

 

C. Inference & Hypothesis Testing

  • Confidence intervals and p-values
  • Robust standard errors and heteroskedasticity tests

 

D. Multiple Regression & Diagnostics

  • Multicollinearity, specification tests, omitted-variable bias
  • Model selection and goodness-of-fit measures

 

E. Binary Outcome Models

  • Linear probability model, Logit, and Probit for exit or follow-on rounds
  • Marginal effects and interpretation

 

F. Panel Data Techniques

  • Pooled OLS, fixed effects, random effects

Hausman test and model choice


Intended Learning Outcomes (ILO)

KNOWLEDGE AND UNDERSTANDING

At the end of the course student will be able to...
  • Describe how to structure and harmonize data from multiple sources into tidy, analysis-ready formats.
  • Demonstrate proficiency in Python and Stata for data ingestion, cleaning, and management.
  • Apply advanced data‐wrangling techniques to detect and correct missing values, outliers, and inconsistencies.
  • Transform and merge disparate tables into cross-sectional and panel datasets ready for analysis.
  • Interpret the output of regression models within the context of venture financing.
  • Evaluate model diagnostics—such as heteroskedasticity tests and endogeneity checks—to ensure valid inference.
  • Present empirical results effectively through concise written reports and visualizations.

APPLYING KNOWLEDGE AND UNDERSTANDING

At the end of the course student will be able to...
  • Construct end-to-end data workflows to ingest, clean, and prepare complex startup datasets for analysis.

  • Design and implement regression models in Python and Stata to test real-world hypotheses.

  • Communicate analytical results through clear visualizations and concise written reports.

  • Adapt analytical techniques to novel data challenges encountered in professional research, policy, or finance settings.


Teaching methods

  • Lectures
  • Guest speaker's talks (in class or in distance)
  • Practical Exercises
  • Individual works / Assignments
  • Collaborative Works / Assignments

DETAILS

The following methods will be employed:

  • Lectures
    • Core concepts in Python, data handling, and econometrics
  • Guest Speaker Seminars
    • Talks by industry practitioners on data sources, startup finance, and market insights
  • Hands-On Sessions
    • Guided lab sessions focused on coding exercises, data cleaning tasks, and visualization
  • Problem-Solving Sessions
    • Instructor-led exercises and self-assessment quizzes to reinforce key techniques
  • Group Project Work
    • Team-based empirical research: hypothesis development, data pipeline construction, model estimation, and presentations

Assessment methods

  Continuous assessment Partial exams General exam
  • Written individual exam (traditional/online)
    x
  • Collaborative Works / Assignment (report, exercise, presentation, project work etc.)
x    

ATTENDING STUDENTS

Assessment of learning:

 

 

  1. Written Examination (60%)
  • A 90-minute exam at the end of the course covering data engineering and econometric methods.
    • Assesses theoretical understanding and practical problem-solving skills across both parts of the curriculum.

2. Group Project (40%)

  • Teams design and execute an empirical study, culminating in an in-class presentation.
    • Evaluation criteria include research design, data pipeline robustness, econometric analysis and presentation effectiveness.
    • Individual contributions and presentation skills are assessed for each team member.

 


NOT ATTENDING STUDENTS

Written Examination (100%)

  • A single, 2½-hour individual exam at the end of the course.
  • Covers all topics from data preparation through econometric modeling.

Teaching materials


ATTENDING STUDENTS

Course materials for Part I will consist of tutorials, code notebooks, and reference readings distributed at the start of the term. For Part II, students will receive lecture slides and a selection of research papers in advance. In addition, the econometrics module will draw on specified chapters from:
Hill, R. C., Griffiths, W. E., & Lim, G. C. (2018). Principles of Econometrics. John Wiley & Sons.


NOT ATTENDING STUDENTS

Hill, R. C., Griffiths, W. E., & Lim, G. C. (2018). Principles of Econometrics. John Wiley & Sons (selected chapters). Additional readings for part I and II will be specified at the beginning of the course.

Last change 27/05/2025 11:33