The contents of PyPI, in numbers

Total files
1.35 Billion
95,328,294 unique
Total lines of text
429 Billion
428,967,879,093 to be precise
Total uncompressed size
78.0 TiB
That is ~58,437,633.19 floppy disks
Lines of code added per second
5,341
In the month 2024-04-01

This page contains a breakdown of the contents of PyPI from parsing the contents of packages. You can download PyPI locally to do your own analysis or run SQL queries on the data in your browser

Project Contents

This data only counts unique projects, not versions. e.g if a project has published 10 versions in a month, each with a setup.py file, it will only be counted once.

Language Features

This data only counts unique projects, not versions. e.g if a project has published 10 versions in a month, each containing an async function, it will only be counted once.

Breakdown

NameProjectsPercent
list comp226,07249
fstring155,22634
annotations131,14329
generator expression121,22926
dict comp89,23219
async29,3296
dataclasses26,9356
set comp21,7465
walrus9,7282
match2,6761
async comp1,0640
try star220

Secrets Detected

PyPI contains a lot of secrets.
typecount
Google API Key5,178
OpenAI API Key3,982
Tencent Cloud Secret ID2,529
Amazon AWS Secret Access Key1,926
Amazon AWS Access Key ID1,621
Google Cloud Private Key ID1,235
Slack API Token1,105
Telegram Bot Token1,065
Slack Incoming Webhook URL914
PyPI API Token846
Google OAuth Access Token768
SendGrid API Key758
Tencent WeChat API App ID739
Mailgun API Key728
Twilio Account String Identifier702
Total31,357

Growth

11.06 million

85TB

1.3 billion

PyPI is growing fast. If this dangerous expansion not stopped, our advanced machine learning models predict that in only 8 years the number of packages will outnumber human beings.

Witness this inevitable future

Binary files

This shows a breakdown of the binary files on PyPI, by extension. Binary files are the vast majority of the content on PyPI, accounting for nearly 75% of the uncompressed size.

extensiontotal filestotal sizeunique files
.so8,778,10627.6 TiB4,814,612
.pyd2,167,9264.9 TiB1,977,710
.dylib1,440,8604.4 TiB448,641
.2182,7973.5 TiB22,465
No extension5,689,7432.6 TiB1,978,349
.dll1,664,0722.5 TiB463,405
.0794,8951.8 TiB79,687
.jar580,4711.1 TiB53,248
.1391,271668.1 GiB51,973
.png27,598,464609.5 GiB850,259
.exe243,582551.7 GiB65,234
.lib145,930548.3 GiB41,901
.gz4,721,524452.0 GiB692,964
.tgz367,369414.4 GiB167,969
.a187,733408.3 GiB89,022
Total54,954,74352.0 TiB11,797,439

Largest Projects by size

Tensorflow dominates this list with 12.9 TiB of uncompressed data, 17% of all data on PyPI.
project nameunique filestotal filestotal linestotal size
tf-nightly131,25748,022,47220,623,931,8484.7 TiB
tf-nightly-cpu105,42628,958,95012,049,464,9032.6 TiB
lalsuite2,028,52712,473,8355,765,281,8321.4 TiB
tf-nightly-gpu71,11911,976,6574,945,537,3521.4 TiB
fbgemm-gpu-nightly2,66844,8428,935,2451.1 TiB
tf-nightly-cpu-aws62,01413,076,0565,130,252,0531.0 TiB
tensorflow109,9458,757,3273,369,233,736965.7 GiB
paddlepaddle-gpu33,9762,072,060453,389,542921.7 GiB
graphscope-client39,5772,417,047283,962,015876.2 GiB
tf-nightly-intel123,04611,553,2564,875,051,602792.5 GiB
tensorflow-io-nightly14,020927,623116,109,192742.5 GiB
tensorflow-gpu83,9324,284,4141,578,633,349639.2 GiB
catboost-dev32,526256,62066,348,683582.2 GiB
tf-nightly-macos30,5616,141,1212,881,268,907575.1 GiB
tensorflow-cpu62,7435,527,0312,182,125,715533.4 GiB
Total2,931,337156,489,31164,329,525,97418.7 TiB

Stats By Extensions

This only considers the last suffix of the file path as the extension
extensiontotal filestotal linestotal sizeunique files
.py584,486,954154,693,972,0055.7 TiB36,935,177
.h126,256,91837,720,208,1001.4 TiB803,283
No extension70,615,3319,897,343,3813.1 TiB18,446,358
.json66,757,56129,099,361,8631.4 TiB1,829,170
.hpp49,937,65610,600,751,310425.6 GiB311,678
.txt44,240,60920,184,096,986769.9 GiB3,686,436
.js35,376,29114,723,489,6941.3 TiB1,567,727
.png27,637,863913,102609.6 GiB852,364
.rst24,586,5971,528,364,39658.3 GiB1,302,090
.pyi22,755,1424,340,249,876143.1 GiB555,727
.svg21,843,9991,520,598,184195.5 GiB406,964
.yaml17,398,7561,560,837,17955.9 GiB409,882
.html17,092,0743,399,616,387250.4 GiB1,706,219
.md14,621,5881,420,414,29053.3 GiB1,518,061
.pyc11,904,900276,73979.7 GiB5,522,568
Total1,135,512,239290,690,493,49215.5 TiB75,853,704