Python Basics for Statistics (August 10th and 11th, 2026)

Introduction to the Python environment, basic commands and data structures, data management and statistical programming (numpy, scipy, pandas), descriptive statistics, inference statistics (tests and linear regression), graphics, and applied exercises.

Instructor	Lukas Fink, Jay Deshpande
Number of Places	about 25 participants (fu:stat reserves the right to rescedule or cancel the course if the number of participants lies below 15)
Registration	→ Register online
Registration Mode	General information about our course offers (questions about the group of participants, registration or cancelation, about payment, organsation, certificates of participation etc.) you can find here.
Participation Fee	100 € for students (incl. PhD), 200 € for employees, for members of Potsdam Graduate School: Please register through the website of the PoGS, Financial support of Dahlem Research School: Doctoral candidates of the Berlin University Alliance can participate in this course cost free as long as budget lasts.
Room	FB Wirtschaftswissenschaft, Garystr. 21, 14195 Berlin PC-Pool 1
Time	Monday, March 10th, 2026, 9:00 a.m. to 5:00 p.m. Tuesday, March 11th, 2026, 9:00 a.m. to 5:00 p.m.

Student Profile

Students, PhD candidates and academic staff from all universities.

Requirements

Basic knowledge of descriptive and inferential statistics at the level of our courses “Statistik-Kompakt” or “’Statistik-Grundlagen”. No prior knowledge of Python required.

Description:

Python is one of the world’s most widely used programming languages. It is accessible to beginners due to its simple syntax, and it is used in a large variety of applications - from microcontrollers to web programming. Python is open-source and freely available, unlike other common statistical solutions such as SAS, SPSS, MATLAB and STATA. An advantage of Python is the comprehensive standard library, which includes many common functions, as well as the availability of many high-quality libraries for different use cases.

In the last couple of years, Python has been adopted increasingly for scientific programming in fields like economics, mathematics, physics, statistics, psychology, and data science. The scientific programming ecosystem comprises packages like numpy and scipy for numerical computing, pandas for data transformation, scikit-learn for machine learning, TensorFlow for deep learning, and OpenCV for computer vision.

This course provides the basics of the scientific programming environment in Python. It introduces programming concepts, before explaining how to do work with scientific packages like numpy, scipy and pandas. We learn the basic procedures of an empirical analysis like descriptive statistics, statistical tests, linear regression, and data visualization. After the course, participants should be able to use the Python documentation independently and apply the tools to answer research questions.

Topics:

Using Python with Conda and Jupyter
Python programming basics (data structures, functions, importing libraries)
Numerical computing with numpy, scipy
Data editing with pandas
Descriptive statistics
Data visualization in Python
Statistical tests
Linear regression

Important information for the registration

After filling out the registration form below, you will receive a within 7 days to your specified email address.

Statistical Consulting fu:stat