Pandas
and Numpy
, the first packages of our introductory journey,
are
essential for manipulating data.
However, it is important not to overlook the fundamentals of the Python
language when discovering it. A good understanding of the fundamental elements of the language helps to better grasp the logic of data science packages, understand the errors encountered, and results in greater productivity and freedom.
To explore basic objects and the structure of the language, a series of notebooks is provided below. The course is flexible; you can work through these notebooks in any order or only complete parts of them if you are already familiar with some of the content.
After spending some time reviewing your Python
skills, you can find the next part of the course in the “Data wrangling” section.
Review notebooks
Informations additionnelles
This site was built automatically through a Github
action using the Quarto
The environment used to obtain the results is reproducible via uv. The pyproject.toml
file used to build this environment is available on the linogaliana/python-datascientist
repository
pyproject.toml
[project]= "python-datascientist"
name = "0.1.0"
version = "Source code for Lino Galiana's Python for data science course"
description = "README.md"
readme -python = ">=3.12,<3.13"
requires= [
dependencies "altair==5.4.1",
"black==24.8.0",
"cartiflette",
"contextily==1.6.2",
"duckdb>=0.10.1",
"folium>=0.19.6",
"geoplot==0.5.1",
"graphviz==0.20.3",
"great-tables==0.12.0",
"ipykernel>=6.29.5",
"jupyter>=1.1.1",
"jupyter-cache==1.0.0",
"kaleido==0.2.1",
"langchain-community==0.3.9",
"loguru==0.7.3",
"markdown>=3.8",
"nbclient==0.10.0",
"nbformat==5.10.4",
"nltk>=3.9.1",
"pip>=25.1.1",
"plotly>=6.1.2",
"plotnine==0.13.6",
"polars==1.8.2",
"pyarrow==17.0.0",
"pynsee==0.1.8",
"python-dotenv==1.0.1",
"pywaffle==1.1.1",
"requests>=2.32.3",
"scikit-image==0.24.0",
"scipy==1.13.0",
"spacy==3.8.4",
"webdriver-manager==4.0.2",
"wordcloud==1.9.3",
"xlrd==2.0.1",
"yellowbrick==1.5",
]
[tool.uv.sources]= { git = "https://github.com/inseefrlab/cartiflette" } cartiflette
To use exactly the same environment (version of Python
and packages), please refer to the documentation for uv
.
Citation
@book{galiana2023,
author = {Galiana, Lino},
title = {Python Pour La Data Science},
date = {2023},
url = {https://pythonds.linogaliana.fr/},
doi = {10.5281/zenodo.8229676},
langid = {en}
}