SlideShare a Scribd company logo
1 of 19
DATA JOURNALISM
Making news more meaningful
CONTENT
• Introduction
• What is data journalism
• Processes in data journalism
• Data journalism tools
• How to become a data journalist
• Benefits of using data journalism skills
INTRODUCTION
• The aim of this presentation is to provide a starting place for
introducing data journalism into the newsroom. It aims to help
journalists to start to develop the skills to write data stories.
• Data journalism is a journalism specialty reflecting the
increased role that numerical data is used in the production and
distribution of information in the digital era. It reflects the
increased interaction between content producers (journalist)
and several other fields such as design, computer
science and statistics. From the point of view of journalists, it
represents "an overlapping set of competencies drawn from
disparate fields".
WHAT IS DATA JOURNALISM?
• What makes data journalism different to the rest of
journalism? Perhaps it is the new possibilities that open up
when you combine the traditional ‘nose for news’ and ability
to tell a compelling story, with the sheer scale and range of
digital information now available
• Data journalism can help a journalist tell a complex story
through engaging infographics.
• Getting started with data journalism does not require huge
resources.
PROCESSES IN DATA JOURNALISM
• 1. Finding data
expert knowledge, contacts and databases
Ability to use computer assisted reporting skills, technical skills such as
MySQL or Python in data gathering.
• 2. Interrogating data
A good understanding of jargon and the wider context within which data
sits, statistics, - a familiarity with requisite tools.
• 3. Cleaning Data
Raw data can be difficult to make meaning of, therefore you need to
process data set into a format which is easily analysed.
Processes
4. Getting The Story Out of Data
The most useful output of analysis is getting a story out of the data through
statistical or mathematical processing.
5. Visualising data
At this stage, data is converted into information through the use of graphics,
pictures, diagrams and charts.
6. Mashing data
The output at this stage is integrated with web page or complementary
elements from two or more sources. Mash-ups are often created by using a
development approach called Ajax.
7. Publish, Distribute, Measure:
Example of data visualization
Examples of Data Visualization
Examples of Data Visualization
Examples of Data Visualization
DATA JOURNALISM TOOLS
1. The spreadsheet
Almost every data journalist begins with the spreadsheet.
2. SQL
SQL allows you to describe exactly the subset of data you want to extract or the exact
changes you want to make, and it allows you to perform these queries across related
data sets.
3. Data cleaning tools
Most data sets are “dirty.” To clean the data and get it into a useful format, you will
need a variety of tools including Google Refine.
4. Visualization tools
A good visualization will allow you to see outliers and trends in ways that can
profoundly alter your understanding of the data. A couple of Web-based visualization
tools which are used in data journalism Google Fusion Tables and Tableau Public.
DATA JOURNALISM TOOLS
5. Mapping software
Google Fusion tables and Tableau Public both include quick and intuitive mapping
capabilities.
6. Scripting language
Python and Ruby seem to be the current favourites among journalists.
7. Web framework
A framework will keep the boring, repetitive work out of your way, help you adopt
best practices, keep you organized and make it easier to collaborate with others.
8. A Flexible editor
To write code, you need a code editor
9. Revision control
10. Document analysis tools
HOW TO BECOME A DATA JOURNALIST
• Journalists have to balance their role in responding to events
with their role as an active seeker of stories - and data is no
different. The New York Times' Aron Pilhofer recommends that
you "Start small, and start with something you already know
and already do. And always, always, always remember that the
goal here is journalism." It is better to find a story that will be
best told through numbers.
• There is no shortage of data being released that you can get
your journalistic teeth into. The open data movement in Ghana
and internationally is seeing a continual release of newsworthy
data, and it's relatively easy to find datasets being released by
MDAs.
HOW TO BECOME A DATA JOURNALIST
• A second approach, however, is to start with a question - "Do speed cameras cost
or save money?" for example, and then search for the data that might answer it.
• Whichever approach you take, it's likely that the real work will lie in finding the
further bits of information and data to fill out the picture you're trying to
clarify. Government data, for example, will often come littered with jargon and
codes you'll need to understand. A call to the relevant organisation can shed some
light. If that's taking too long, an advanced search for one of the more obscure
codes can help too.
• You'll also need to contextualise the initial data with further data. Say you have
some information about a government department's changing wage bill, for
example: has the department workforce expanded? How does it compare to other
government departments? What about wider wages within the industry? What
about inflation and changes in the cost of living? This context can make a
difference between missing and spotting a story.
HOW TO BECOME A DATA JOURNALIST
• Quite often your data will need cleaning up: look out for different names for the
same thing, spelling and punctuation errors, poorly formatted fields (e.g. dates that
are formatted as text), incorrectly entered data and information that is missing
entirely. Tools like Freebase Gridworks can help here.
• At other times the dataset you need will come in an inconvenient format, such as a
PDF, Powerpoint, or a rather ugly webpage. If you're lucky, you may be able to copy
and paste the data into a spreadsheet. But you won't always be lucky.
• At these moments some programming knowledge comes in handy. There's a sliding
scale here: at one end are those who can write scripts from scratch that scrape a
webpage and store the information in a spreadsheet. Alternatively, you can use a
website like Scraperwiki which already has example scripts that you can customise
to your ends - and a community to help. Then there are online tools like Yahoo!
Pipes and the Firefox plugin OutWit Hub. If the data is in a HTML table you can
evenwrite a one-line formula in Google Spreadsheets to pull it in. Failing all the
above, you might just have to record it by hand - but whatever you do, make sure
you publish your spreadsheet online and blog about it so others don't have to
repeat your hard work.
HOW TO BECOME A DATA JOURNALIST
• Once you have the data you need to tell the story, you need to get it ready
to visualise. Trim off everything peripheral to what you need in order to
visualise your story. There are dozens of free online tools you can use to
do this. ManyEyes and Tableau Public are good places to start for charts.
• Play around. If you're good with a graphics package, try making the
visualisation clearer through colour and labelling. And always include a
piece of text giving a link to the data and its source - because infographics
tend to become separated from their original context as they make their
way around the web.
• For maps, the wonderful OpenHeatMap is very easy to use - as long as
your data is categorised by country, local authority, constituency, region or
county. Or you can use Yahoo! Pipes to map the points of interest. Both of
these are actually examples of mashups.
HOW TO BECOME A DATA JOURNALIST
• Data literacy includes statistical literacy but
also understanding how to work with large
data sets, how they were produced, how to
connect various data sets and how to interpret
them.
• You can easily become a seasoned data-
journalist by asking 3 very simple questions.
HOW TO BECOME A DATA JOURNALIST
1. You need to ask your self: How was the data collected?
• When in doubt about a number’s credibility, always double check
2. The second question you need to ask yourself is: What’s in there to learn?
• Always take the distribution and base rate into account. Checking for the mean
and median, as well as mode (the most frequent value in the distribution) helps
you gain insights in the data. Knowing the order of magnitude makes
contextualization easier. Finally, reporting in natural frequencies (1 in 100) is way
easier for readers to understand that using percentage (1%).
BENEFITS OF USING DATA
JOURNALISM SKILLS
• It helps you find and write stories from the huge amount of
data released by governments every day
• It helps you find the strongest stories more quickly.
• Data journalism can help a journalist tell a complex story
through engaging visualizations.
• It can help explain why and how a story relates to the
individual
• Data journalism helps open up the news gathering process
itself

More Related Content

What's hot

What's hot (20)

Politcal reporting in india
Politcal reporting in indiaPolitcal reporting in india
Politcal reporting in india
 
History of Online Journalism 2014
History of Online Journalism 2014History of Online Journalism 2014
History of Online Journalism 2014
 
What is Digital Journalism?
What is Digital Journalism?What is Digital Journalism?
What is Digital Journalism?
 
Role of journalism ppt
Role of journalism pptRole of journalism ppt
Role of journalism ppt
 
News source
News sourceNews source
News source
 
Speeches, News Conferences & Meetings
Speeches, News Conferences & MeetingsSpeeches, News Conferences & Meetings
Speeches, News Conferences & Meetings
 
Online journalism
Online journalismOnline journalism
Online journalism
 
Newsroom 3.0
Newsroom 3.0Newsroom 3.0
Newsroom 3.0
 
Citizen journalism slides
Citizen journalism slidesCitizen journalism slides
Citizen journalism slides
 
Online Journalism
Online JournalismOnline Journalism
Online Journalism
 
Yellow journalism
Yellow journalismYellow journalism
Yellow journalism
 
Investigative reporting
Investigative reportingInvestigative reporting
Investigative reporting
 
Journalists and their Sources
Journalists and their SourcesJournalists and their Sources
Journalists and their Sources
 
What are news values
What are news valuesWhat are news values
What are news values
 
Mechanics of news editing
Mechanics of news editingMechanics of news editing
Mechanics of news editing
 
How to write a news package
How to write a news packageHow to write a news package
How to write a news package
 
Reporting types of beat unit 3
Reporting types of beat unit 3Reporting types of beat unit 3
Reporting types of beat unit 3
 
Sources of news
Sources of newsSources of news
Sources of news
 
Ethics In Journalism
Ethics In JournalismEthics In Journalism
Ethics In Journalism
 
Writing For Public Relations: On Writing And Editing
Writing For Public Relations: On Writing And EditingWriting For Public Relations: On Writing And Editing
Writing For Public Relations: On Writing And Editing
 

Similar to Data journalism presentation

Uses of maps and illustrations in newspaper
Uses of maps and illustrations in newspaperUses of maps and illustrations in newspaper
Uses of maps and illustrations in newspaperRoshan Mastana
 
Data science.chapter-1,2,3
Data science.chapter-1,2,3Data science.chapter-1,2,3
Data science.chapter-1,2,3varshakumar21
 
Bringing a data mindset to your reporting - Brant Houston - Illinois NewsTrai...
Bringing a data mindset to your reporting - Brant Houston - Illinois NewsTrai...Bringing a data mindset to your reporting - Brant Houston - Illinois NewsTrai...
Bringing a data mindset to your reporting - Brant Houston - Illinois NewsTrai...News Leaders Association's NewsTrain
 
Data-driven enterprise off your beat - Aaron Mendelson - Fresno NewsTrain 4.2...
Data-driven enterprise off your beat - Aaron Mendelson - Fresno NewsTrain 4.2...Data-driven enterprise off your beat - Aaron Mendelson - Fresno NewsTrain 4.2...
Data-driven enterprise off your beat - Aaron Mendelson - Fresno NewsTrain 4.2...News Leaders Association's NewsTrain
 
MPhil Lecture on Data Vis for Analysis
MPhil Lecture on Data Vis for AnalysisMPhil Lecture on Data Vis for Analysis
MPhil Lecture on Data Vis for AnalysisShawn Day
 
August Designstorm: Alternative Reporting Formats
August Designstorm: Alternative Reporting FormatsAugust Designstorm: Alternative Reporting Formats
August Designstorm: Alternative Reporting FormatsAmanda Makulec
 
Data-driven enterprise off your beat - Todd Wallack - New England NewsTrain -...
Data-driven enterprise off your beat - Todd Wallack - New England NewsTrain -...Data-driven enterprise off your beat - Todd Wallack - New England NewsTrain -...
Data-driven enterprise off your beat - Todd Wallack - New England NewsTrain -...News Leaders Association's NewsTrain
 
Aftros
Aftros Aftros
Aftros Sezzar
 
HEL_Data_Journalism_Jessica_Mariani
HEL_Data_Journalism_Jessica_MarianiHEL_Data_Journalism_Jessica_Mariani
HEL_Data_Journalism_Jessica_Marianijessicamariani
 
Unit 1-Data Science Process Overview.pptx
Unit 1-Data Science Process Overview.pptxUnit 1-Data Science Process Overview.pptx
Unit 1-Data Science Process Overview.pptxAnusuya123
 
Spreadsheet problems
Spreadsheet problemsSpreadsheet problems
Spreadsheet problemsAugust Severn
 
Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)Thinkful
 
Getting Started in Data Science
Getting Started in Data ScienceGetting Started in Data Science
Getting Started in Data ScienceThinkful
 
Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Thinkful
 
open-data-presentation.pptx
open-data-presentation.pptxopen-data-presentation.pptx
open-data-presentation.pptxDennicaRivera
 
Guide to big data analytics
Guide to big data analyticsGuide to big data analytics
Guide to big data analyticsGahya Pandian
 
Visualizing Healthcare Data with Tableau (Toronto Central LHIN Presentation)
Visualizing Healthcare Data with Tableau (Toronto Central LHIN Presentation)Visualizing Healthcare Data with Tableau (Toronto Central LHIN Presentation)
Visualizing Healthcare Data with Tableau (Toronto Central LHIN Presentation)Stefan Popowycz
 
Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.Aditya205306
 
CHAPTER5Database Systemsand Big DataRafal Olechows
CHAPTER5Database Systemsand Big DataRafal OlechowsCHAPTER5Database Systemsand Big DataRafal Olechows
CHAPTER5Database Systemsand Big DataRafal OlechowsJinElias52
 

Similar to Data journalism presentation (20)

Uses of maps and illustrations in newspaper
Uses of maps and illustrations in newspaperUses of maps and illustrations in newspaper
Uses of maps and illustrations in newspaper
 
Data science.chapter-1,2,3
Data science.chapter-1,2,3Data science.chapter-1,2,3
Data science.chapter-1,2,3
 
Bringing a data mindset to your reporting - Brant Houston - Illinois NewsTrai...
Bringing a data mindset to your reporting - Brant Houston - Illinois NewsTrai...Bringing a data mindset to your reporting - Brant Houston - Illinois NewsTrai...
Bringing a data mindset to your reporting - Brant Houston - Illinois NewsTrai...
 
Data science unit1
Data science unit1Data science unit1
Data science unit1
 
Data-driven enterprise off your beat - Aaron Mendelson - Fresno NewsTrain 4.2...
Data-driven enterprise off your beat - Aaron Mendelson - Fresno NewsTrain 4.2...Data-driven enterprise off your beat - Aaron Mendelson - Fresno NewsTrain 4.2...
Data-driven enterprise off your beat - Aaron Mendelson - Fresno NewsTrain 4.2...
 
MPhil Lecture on Data Vis for Analysis
MPhil Lecture on Data Vis for AnalysisMPhil Lecture on Data Vis for Analysis
MPhil Lecture on Data Vis for Analysis
 
August Designstorm: Alternative Reporting Formats
August Designstorm: Alternative Reporting FormatsAugust Designstorm: Alternative Reporting Formats
August Designstorm: Alternative Reporting Formats
 
Data-driven enterprise off your beat - Todd Wallack - New England NewsTrain -...
Data-driven enterprise off your beat - Todd Wallack - New England NewsTrain -...Data-driven enterprise off your beat - Todd Wallack - New England NewsTrain -...
Data-driven enterprise off your beat - Todd Wallack - New England NewsTrain -...
 
Aftros
Aftros Aftros
Aftros
 
HEL_Data_Journalism_Jessica_Mariani
HEL_Data_Journalism_Jessica_MarianiHEL_Data_Journalism_Jessica_Mariani
HEL_Data_Journalism_Jessica_Mariani
 
Unit 1-Data Science Process Overview.pptx
Unit 1-Data Science Process Overview.pptxUnit 1-Data Science Process Overview.pptx
Unit 1-Data Science Process Overview.pptx
 
Spreadsheet problems
Spreadsheet problemsSpreadsheet problems
Spreadsheet problems
 
Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)
 
Getting Started in Data Science
Getting Started in Data ScienceGetting Started in Data Science
Getting Started in Data Science
 
Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)
 
open-data-presentation.pptx
open-data-presentation.pptxopen-data-presentation.pptx
open-data-presentation.pptx
 
Guide to big data analytics
Guide to big data analyticsGuide to big data analytics
Guide to big data analytics
 
Visualizing Healthcare Data with Tableau (Toronto Central LHIN Presentation)
Visualizing Healthcare Data with Tableau (Toronto Central LHIN Presentation)Visualizing Healthcare Data with Tableau (Toronto Central LHIN Presentation)
Visualizing Healthcare Data with Tableau (Toronto Central LHIN Presentation)
 
Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.
 
CHAPTER5Database Systemsand Big DataRafal Olechows
CHAPTER5Database Systemsand Big DataRafal OlechowsCHAPTER5Database Systemsand Big DataRafal Olechows
CHAPTER5Database Systemsand Big DataRafal Olechows
 

Recently uploaded

12042024_First India Newspaper Jaipur.pdf
12042024_First India Newspaper Jaipur.pdf12042024_First India Newspaper Jaipur.pdf
12042024_First India Newspaper Jaipur.pdfFIRST INDIA
 
Transforming Andhra Pradesh: TDP's Legacy in Road Connectivity
Transforming Andhra Pradesh: TDP's Legacy in Road ConnectivityTransforming Andhra Pradesh: TDP's Legacy in Road Connectivity
Transforming Andhra Pradesh: TDP's Legacy in Road Connectivitynarsireddynannuri1
 
Political-Ideologies-and-The-Movements.pptx
Political-Ideologies-and-The-Movements.pptxPolitical-Ideologies-and-The-Movements.pptx
Political-Ideologies-and-The-Movements.pptxSasikiranMarri
 
11042024_First India Newspaper Jaipur.pdf
11042024_First India Newspaper Jaipur.pdf11042024_First India Newspaper Jaipur.pdf
11042024_First India Newspaper Jaipur.pdfFIRST INDIA
 
Emerging issues in migration policies.ppt
Emerging issues in migration policies.pptEmerging issues in migration policies.ppt
Emerging issues in migration policies.pptNandinituteja1
 
13042024_First India Newspaper Jaipur.pdf
13042024_First India Newspaper Jaipur.pdf13042024_First India Newspaper Jaipur.pdf
13042024_First India Newspaper Jaipur.pdfFIRST INDIA
 
15042024_First India Newspaper Jaipur.pdf
15042024_First India Newspaper Jaipur.pdf15042024_First India Newspaper Jaipur.pdf
15042024_First India Newspaper Jaipur.pdfFIRST INDIA
 
16042024_First India Newspaper Jaipur.pdf
16042024_First India Newspaper Jaipur.pdf16042024_First India Newspaper Jaipur.pdf
16042024_First India Newspaper Jaipur.pdfFIRST INDIA
 
14042024_First India Newspaper Jaipur.pdf
14042024_First India Newspaper Jaipur.pdf14042024_First India Newspaper Jaipur.pdf
14042024_First India Newspaper Jaipur.pdfFIRST INDIA
 
Geostrategic significance of South Asian countries.ppt
Geostrategic significance of South Asian countries.pptGeostrategic significance of South Asian countries.ppt
Geostrategic significance of South Asian countries.pptUsmanKaran
 
lok sabha Elections in india- 2024 .pptx
lok sabha Elections in india- 2024 .pptxlok sabha Elections in india- 2024 .pptx
lok sabha Elections in india- 2024 .pptxdigiyvbmrkt
 
Power in International Relations (Pol 5)
Power in International Relations (Pol 5)Power in International Relations (Pol 5)
Power in International Relations (Pol 5)ssuser583c35
 
Mitochondrial Fusion Vital for Adult Brain Function and Disease Understanding...
Mitochondrial Fusion Vital for Adult Brain Function and Disease Understanding...Mitochondrial Fusion Vital for Adult Brain Function and Disease Understanding...
Mitochondrial Fusion Vital for Adult Brain Function and Disease Understanding...The Lifesciences Magazine
 
Foreign Relation of Pakistan with Neighboring Countries.pptx
Foreign Relation of Pakistan with Neighboring Countries.pptxForeign Relation of Pakistan with Neighboring Countries.pptx
Foreign Relation of Pakistan with Neighboring Countries.pptxunark75
 

Recently uploaded (14)

12042024_First India Newspaper Jaipur.pdf
12042024_First India Newspaper Jaipur.pdf12042024_First India Newspaper Jaipur.pdf
12042024_First India Newspaper Jaipur.pdf
 
Transforming Andhra Pradesh: TDP's Legacy in Road Connectivity
Transforming Andhra Pradesh: TDP's Legacy in Road ConnectivityTransforming Andhra Pradesh: TDP's Legacy in Road Connectivity
Transforming Andhra Pradesh: TDP's Legacy in Road Connectivity
 
Political-Ideologies-and-The-Movements.pptx
Political-Ideologies-and-The-Movements.pptxPolitical-Ideologies-and-The-Movements.pptx
Political-Ideologies-and-The-Movements.pptx
 
11042024_First India Newspaper Jaipur.pdf
11042024_First India Newspaper Jaipur.pdf11042024_First India Newspaper Jaipur.pdf
11042024_First India Newspaper Jaipur.pdf
 
Emerging issues in migration policies.ppt
Emerging issues in migration policies.pptEmerging issues in migration policies.ppt
Emerging issues in migration policies.ppt
 
13042024_First India Newspaper Jaipur.pdf
13042024_First India Newspaper Jaipur.pdf13042024_First India Newspaper Jaipur.pdf
13042024_First India Newspaper Jaipur.pdf
 
15042024_First India Newspaper Jaipur.pdf
15042024_First India Newspaper Jaipur.pdf15042024_First India Newspaper Jaipur.pdf
15042024_First India Newspaper Jaipur.pdf
 
16042024_First India Newspaper Jaipur.pdf
16042024_First India Newspaper Jaipur.pdf16042024_First India Newspaper Jaipur.pdf
16042024_First India Newspaper Jaipur.pdf
 
14042024_First India Newspaper Jaipur.pdf
14042024_First India Newspaper Jaipur.pdf14042024_First India Newspaper Jaipur.pdf
14042024_First India Newspaper Jaipur.pdf
 
Geostrategic significance of South Asian countries.ppt
Geostrategic significance of South Asian countries.pptGeostrategic significance of South Asian countries.ppt
Geostrategic significance of South Asian countries.ppt
 
lok sabha Elections in india- 2024 .pptx
lok sabha Elections in india- 2024 .pptxlok sabha Elections in india- 2024 .pptx
lok sabha Elections in india- 2024 .pptx
 
Power in International Relations (Pol 5)
Power in International Relations (Pol 5)Power in International Relations (Pol 5)
Power in International Relations (Pol 5)
 
Mitochondrial Fusion Vital for Adult Brain Function and Disease Understanding...
Mitochondrial Fusion Vital for Adult Brain Function and Disease Understanding...Mitochondrial Fusion Vital for Adult Brain Function and Disease Understanding...
Mitochondrial Fusion Vital for Adult Brain Function and Disease Understanding...
 
Foreign Relation of Pakistan with Neighboring Countries.pptx
Foreign Relation of Pakistan with Neighboring Countries.pptxForeign Relation of Pakistan with Neighboring Countries.pptx
Foreign Relation of Pakistan with Neighboring Countries.pptx
 

Data journalism presentation

  • 1. DATA JOURNALISM Making news more meaningful
  • 2. CONTENT • Introduction • What is data journalism • Processes in data journalism • Data journalism tools • How to become a data journalist • Benefits of using data journalism skills
  • 3. INTRODUCTION • The aim of this presentation is to provide a starting place for introducing data journalism into the newsroom. It aims to help journalists to start to develop the skills to write data stories. • Data journalism is a journalism specialty reflecting the increased role that numerical data is used in the production and distribution of information in the digital era. It reflects the increased interaction between content producers (journalist) and several other fields such as design, computer science and statistics. From the point of view of journalists, it represents "an overlapping set of competencies drawn from disparate fields".
  • 4. WHAT IS DATA JOURNALISM? • What makes data journalism different to the rest of journalism? Perhaps it is the new possibilities that open up when you combine the traditional ‘nose for news’ and ability to tell a compelling story, with the sheer scale and range of digital information now available • Data journalism can help a journalist tell a complex story through engaging infographics. • Getting started with data journalism does not require huge resources.
  • 5. PROCESSES IN DATA JOURNALISM • 1. Finding data expert knowledge, contacts and databases Ability to use computer assisted reporting skills, technical skills such as MySQL or Python in data gathering. • 2. Interrogating data A good understanding of jargon and the wider context within which data sits, statistics, - a familiarity with requisite tools. • 3. Cleaning Data Raw data can be difficult to make meaning of, therefore you need to process data set into a format which is easily analysed.
  • 6. Processes 4. Getting The Story Out of Data The most useful output of analysis is getting a story out of the data through statistical or mathematical processing. 5. Visualising data At this stage, data is converted into information through the use of graphics, pictures, diagrams and charts. 6. Mashing data The output at this stage is integrated with web page or complementary elements from two or more sources. Mash-ups are often created by using a development approach called Ajax. 7. Publish, Distribute, Measure:
  • 7. Example of data visualization
  • 8. Examples of Data Visualization
  • 9. Examples of Data Visualization
  • 10. Examples of Data Visualization
  • 11. DATA JOURNALISM TOOLS 1. The spreadsheet Almost every data journalist begins with the spreadsheet. 2. SQL SQL allows you to describe exactly the subset of data you want to extract or the exact changes you want to make, and it allows you to perform these queries across related data sets. 3. Data cleaning tools Most data sets are “dirty.” To clean the data and get it into a useful format, you will need a variety of tools including Google Refine. 4. Visualization tools A good visualization will allow you to see outliers and trends in ways that can profoundly alter your understanding of the data. A couple of Web-based visualization tools which are used in data journalism Google Fusion Tables and Tableau Public.
  • 12. DATA JOURNALISM TOOLS 5. Mapping software Google Fusion tables and Tableau Public both include quick and intuitive mapping capabilities. 6. Scripting language Python and Ruby seem to be the current favourites among journalists. 7. Web framework A framework will keep the boring, repetitive work out of your way, help you adopt best practices, keep you organized and make it easier to collaborate with others. 8. A Flexible editor To write code, you need a code editor 9. Revision control 10. Document analysis tools
  • 13. HOW TO BECOME A DATA JOURNALIST • Journalists have to balance their role in responding to events with their role as an active seeker of stories - and data is no different. The New York Times' Aron Pilhofer recommends that you "Start small, and start with something you already know and already do. And always, always, always remember that the goal here is journalism." It is better to find a story that will be best told through numbers. • There is no shortage of data being released that you can get your journalistic teeth into. The open data movement in Ghana and internationally is seeing a continual release of newsworthy data, and it's relatively easy to find datasets being released by MDAs.
  • 14. HOW TO BECOME A DATA JOURNALIST • A second approach, however, is to start with a question - "Do speed cameras cost or save money?" for example, and then search for the data that might answer it. • Whichever approach you take, it's likely that the real work will lie in finding the further bits of information and data to fill out the picture you're trying to clarify. Government data, for example, will often come littered with jargon and codes you'll need to understand. A call to the relevant organisation can shed some light. If that's taking too long, an advanced search for one of the more obscure codes can help too. • You'll also need to contextualise the initial data with further data. Say you have some information about a government department's changing wage bill, for example: has the department workforce expanded? How does it compare to other government departments? What about wider wages within the industry? What about inflation and changes in the cost of living? This context can make a difference between missing and spotting a story.
  • 15. HOW TO BECOME A DATA JOURNALIST • Quite often your data will need cleaning up: look out for different names for the same thing, spelling and punctuation errors, poorly formatted fields (e.g. dates that are formatted as text), incorrectly entered data and information that is missing entirely. Tools like Freebase Gridworks can help here. • At other times the dataset you need will come in an inconvenient format, such as a PDF, Powerpoint, or a rather ugly webpage. If you're lucky, you may be able to copy and paste the data into a spreadsheet. But you won't always be lucky. • At these moments some programming knowledge comes in handy. There's a sliding scale here: at one end are those who can write scripts from scratch that scrape a webpage and store the information in a spreadsheet. Alternatively, you can use a website like Scraperwiki which already has example scripts that you can customise to your ends - and a community to help. Then there are online tools like Yahoo! Pipes and the Firefox plugin OutWit Hub. If the data is in a HTML table you can evenwrite a one-line formula in Google Spreadsheets to pull it in. Failing all the above, you might just have to record it by hand - but whatever you do, make sure you publish your spreadsheet online and blog about it so others don't have to repeat your hard work.
  • 16. HOW TO BECOME A DATA JOURNALIST • Once you have the data you need to tell the story, you need to get it ready to visualise. Trim off everything peripheral to what you need in order to visualise your story. There are dozens of free online tools you can use to do this. ManyEyes and Tableau Public are good places to start for charts. • Play around. If you're good with a graphics package, try making the visualisation clearer through colour and labelling. And always include a piece of text giving a link to the data and its source - because infographics tend to become separated from their original context as they make their way around the web. • For maps, the wonderful OpenHeatMap is very easy to use - as long as your data is categorised by country, local authority, constituency, region or county. Or you can use Yahoo! Pipes to map the points of interest. Both of these are actually examples of mashups.
  • 17. HOW TO BECOME A DATA JOURNALIST • Data literacy includes statistical literacy but also understanding how to work with large data sets, how they were produced, how to connect various data sets and how to interpret them. • You can easily become a seasoned data- journalist by asking 3 very simple questions.
  • 18. HOW TO BECOME A DATA JOURNALIST 1. You need to ask your self: How was the data collected? • When in doubt about a number’s credibility, always double check 2. The second question you need to ask yourself is: What’s in there to learn? • Always take the distribution and base rate into account. Checking for the mean and median, as well as mode (the most frequent value in the distribution) helps you gain insights in the data. Knowing the order of magnitude makes contextualization easier. Finally, reporting in natural frequencies (1 in 100) is way easier for readers to understand that using percentage (1%).
  • 19. BENEFITS OF USING DATA JOURNALISM SKILLS • It helps you find and write stories from the huge amount of data released by governments every day • It helps you find the strongest stories more quickly. • Data journalism can help a journalist tell a complex story through engaging visualizations. • It can help explain why and how a story relates to the individual • Data journalism helps open up the news gathering process itself