View all newsletters
Sign up for our free email newsletters

Fighting for quality news media in the digital age.

  1. Comment
April 5, 2016updated 06 Apr 2016 11:49am

Panama Papers: How 400 journalists made sense of biggest ever data leak to media

By Dominic Ponsford

German newspaper Suddeutsche Zeitung has provided a fascinating insight into what may be the biggest journalistic investigation in history – the Panama Papers (sub-titled The Secrets of Dirty Money).
The leak of 11 million documents, dating back 40 years, came from an anonymous source who never met journalists and only made contact via encrypted digital communications.
Coordinated by the US-based International Consortium of Investigative Journalists, some 400 journalists from 109 news organisations based in 80 countries have spent the last year sifting through the information.
The ICIJ says the documents from Panama based law firm Mossack Fonseca reveal "the offshore holdings of world political leaders, links to global scandals, and details of the hidden financial dealings of fraudsters, drug traffickers, billionaires, celebrities, sports stars and more".
The UK partners are The Guardian and BBC Panorama.
The source reportedly told Suddeutsche Zeitung "my life is in danger" and said the reason they were releasing the files was because "I want to make these crimes public".
Several UK national newspapers today lead on the revelation by The Guardian that Prime Minister David Cameron's father "ran an offshore fund that avoided ever having to pay tax in Britain by hiring a small army of Bahamas residents – including a part-time bishop – to sign its paperwork".
The leaked data is structured as follows: Mossack Fonseca created a folder for each shell firm. Each folder contains e-mails, contracts, transcripts, and scanned documents. In some instances, there are several thousand pages of documentation. First, the data had to be systematically indexed to make searching through this sea of information possible.

To this end, the Süddeutsche Zeitung used Nuix, the same program that international investigators work with. Süddeutsche Zeitung and ICIJ uploaded millions of documents onto high-performance computers. They applied optical character recognition (OCR) to transform data into machine-readable and easy to search files. The process turned images – such as scanned IDs and signed contracts – into searchable text. This was an important step: it enabled journalists to comb through as large a portion of the leak as possible using a simple search mask similar to Google.

The journalists compiled lists of important politicians, international criminals, and well-known professional athletes, among others. The digital processing made it possible to then search the leak for the names on these lists. The "party donations scandal" list contained 130 names, and the UN sanctions list more than 600. In just a few minutes, the powerful search algorithm compared the lists with the 11.5 million documents.

Content from our partners
MHP Group's 30 To Watch awards for young journalists open for entries
How PA Media is helping newspapers make the digital transition
Publishing on the open web is broken, how generative AI could help fix it

Topics in this article :

Email pged@pressgazette.co.uk to point out mistakes, provide story tips or send in a letter for publication on our "Letters Page" blog

Select and enter your email address Weekly insight into the big strategic issues affecting the future of the news industry. Essential reading for media leaders every Thursday. Your morning brew of news about the world of news from Press Gazette and elsewhere in the media. Sent at around 10am UK time. Our weekly does of strategic insight about the future of news media aimed at US readers. A fortnightly update from the front-line of news and advertising. Aimed at marketers and those involved in the advertising industry.
  • Business owner/co-owner
  • CEO
  • COO
  • CFO
  • CTO
  • Chairperson
  • Non-Exec Director
  • Other C-Suite
  • Managing Director
  • President/Partner
  • Senior Executive/SVP or Corporate VP or equivalent
  • Director or equivalent
  • Group or Senior Manager
  • Head of Department/Function
  • Manager
  • Non-manager
  • Retired
  • Other
Visit our privacy Policy for more information about our services, how New Statesman Media Group may use, process and share your personal data, including information on your rights in respect of your personal data and how you can unsubscribe from future marketing communications.
Thank you

Thanks for subscribing.

Websites in our network