The leak contains the source codes for most of the company’s services: from mail and taxis to music and the cloud.

🔎 Leakage of source codes of Yandex services

On January 25, 2023, the source codes and accompanying data for many Yandex services and programs appeared on the web. The distribution contains separate archives (.tar.bz2), whose names can be used to identify the corresponding Yandex services.
The total amount of archives (compressed) is more than 44.7 GB.

January 26, 2023 Yandex confirmed the publication of the source codes of some projects from the internal repository.

The hackers released the archive to the public and claim that in July 2022 they downloaded the source codes of the company’s projects, in addition to the anti-spam rules.

😀 “There was no Yandex hack. The Yandex security service has discovered code fragments from an internal repository in the public domain. However, their content differs from the current version of the repository used in Yandex services.

The repository is one of the development tools within most companies that is available to their developers. Repositories are needed to work with code and are not intended to store personal data of users. We are conducting an internal investigation,” the company’s press service told Habr.

The developer Arseniy Shestakov explained that the archive contains only the contents of the git repositories, there are no personal data. There are several API keys, but they were most likely used only for test deployment. Some of the archives contain source code for part of the company’s services, as well as documentation pointing to real intranet URLs.

Yandex GIT Source tree got leaked!




Just a few hours ago I found mention on Twitter that proprietary source code of Russian giant Yandex been leaked on online community called BreachForums. In this post I’ll share results of my friend digging into said archives.

Important details about torrent:
It is just content of repository without anything else.

All files are dated back to 24 February 2022.
It does not contain git history, mostly just code
No pre-built binaries for most of software with only few exceptions
There are no pre-trained ML models with some exceptions
This post is a work-in-progress and will be updated with more details.

Why is this big?
Yandex is one of largest IT companies in Russia. Within country it provide wider range of services than Google. Imagine one company that replace Google, Uber, Amazon, Netflix and Spotify.

Is this leak real?
I personally never worked at Yandex, but I know several people who worked there at different times or work there still. I verified that at least some of archives for sure contain modern source code for company services as well as documentation pointing to real intranet URLs.

What’s inside
It looks like at least source code for all major services of Yandex been leaked:

Search Engine and Indexing Bot
Maps – Like Google Maps and Street View
Alice – AI assistant like Siri / Alexa
Taxi – Uber-like taxi service
Direct – Ads service like Google Ads / Adwords
Mail – Mail service like GMail
Disk – File storage service like Google drive
Market – Marketplace like Amazon
Travel – Like a plus Airplane, Train and Bus tickets
Yandex360 – Like Google Workspaces for services on your own domain
Cloud – Probably not all infrastructure code was leaked.
Pay – Payment processing like Stripe, but with limited set of features
Metrika – Like Google Analytics
And at least backend part of majority of other company services is there. Largest archive called “frontend” is yet to be explored.

Full file list of files:
If you dont want to download torrent, but curious of what’s inside you can get list of files from following gist:

You can also clone it like normnal git repository:

git clone
List of all files can be obtained with following commands.

Full list of files in torrent
aapi.tar.bz2 client_method.tar.bz2 gencfg.tar.bz2 mobile-WARNING-notfull.tar.bz2.part skynet.tar.bz2
admins.tar.bz2 cloud.tar.bz2.part groups.tar.bz2 nginx.tar.bz2 smart_devices.tar.bz2.part
ads.tar.bz2 commerce.tar.bz2.part helpdesk.tar.bz2 noc.tar.bz2.part smarttv.tar.bz2
alice.tar.bz2.part config.tar.bz2 infra.tar.bz2 partner.tar.bz2 solomon.tar.bz2.part
analytics.tar.bz2.part connect.tar.bz2.part intranet.tar.bz2 passport.tar.bz2.part stocks.tar.bz2
antiadblock.tar.bz2 crm.tar.bz2.part investors.tar.bz2 pay.tar.bz2 switch.tar.bz2
antirobot.tar.bz2 crypta.tar.bz2 it-office.tar.bz2 payplatform.tar.bz2.part tasklet.tar.bz2
autocheck.tar.bz2 customer_service.tar.bz2 jupytercloud.tar.bz2 paysys.tar.bz2 taxi.tar.bz2.part
balancer.tar.bz2 datacloud.tar.bz2 kernel.tar.bz2.part portal.tar.bz2.part tools.tar.bz2
billing.tar.bz2 delivery.tar.bz2.part library.tar.bz2.part privacy_office.tar.bz2 travel.tar.bz2.part
bindings.tar.bz2 direct.tar.bz2.part load.tar.bz2.part products.tar.bz2 wmconsole.tar.bz2
captcha.tar.bz2 disk.tar.bz2 mail.tar.bz2.part robot.tar.bz2 yandex360.tar.bz2.part
cdn.tar.bz2 docs.tar.bz2 maps.tar.bz2.part rt-research.tar.bz2 yandex_io.tar.bz2.part
certs.tar.bz2 drive.tar.bz2.part maps_2.tar.bz2.part saas.tar.bz2 yaphone.tar.bz2
ci.tar.bz2.part extsearch.tar.bz2.part maps_adv.tar.bz2 sandbox.tar.bz2 yawe.tar.bz2
classifieds.tar.bz2.part frontend.tar.bz2.part market.tar.bz2.part search.tar.bz2
client_analytics.tar.bz2.part fuzzing.tar.bz2 metrika.tar.bz2.part security.tar.bz29

Security implications.
Since this is leak only contain contents of git repositories there is no personal data. There are at least some API keys, but they are likely only been used for testing deployment only.

By Treadstone 71

@Treadstone71LLC Cognitive Warfare Training, Intelligence and Counterintelligence Tradecraft, Influence Operations, Cyber Operations, OSINT,OPSEC, Darknet, Deepweb, Clandestine Cyber HUMINT, customized training and analysis, cyber psyops, strategic intelligence, Open-Source Intelligence collection, analytic writing, structured analytic techniques, Target Adversary Research, strategic intelligence analysis, estimative intelligence, forecasting intelligence, warning intelligence, Disinformation detection, Analysis as a Service