A | B | C | D | E | F | G | H | I | J | |
---|---|---|---|---|---|---|---|---|---|---|
1 | National Poverty lines Dataset - v1 via Gapminder | |||||||||
2 | About this file | This file has multiple sheets with data for one or more indicators used by Gapminder. In this sheet below you'll first find an overview of the indicators (measures) and the list of underlying sources. The actual data we use, is found in the sheet(s) labeled "data-...". This file is also a documentation of the data process. To follow how the data was transformed from the original sources, start in the sheet to the far right, which holds the input data. You can then follow the process step by step, by looking at the formulas in the sheets from right to left, until you reach the output in the "data-..." sheets. | ||||||||
3 | Version: | v1 | ||||||||
4 | Updated: | February 4 2020 | ||||||||
5 | Download latest version: | Excel file » | ||||||||
6 | Latest version online: | http://gapm.io/dnpl | ||||||||
7 | Contributor(s) to this version: | Diane I | ||||||||
8 | Feedback | Please give feedback here | ||||||||
9 | ||||||||||
10 | # | Indicator(s) | Description | Full name | Unit | ID | type | Usage | ||
11 | 1 | National poverty line | This is the minimum level of income deemed adequate in a particular country. It is converted to international dollars using purchasing power parity rate and is expressed in per capita terms per day | National Poverty line, PPP (2011 US$ per day) | US$ | npl | measure | 2 | ||
12 | ||||||||||
13 | Sources | |||||||||
14 | Dataset description: | The National povety lines have been imported from a World Bank's Policy Research Working Paper: "Estimating International Poverty Lines from Comparable National Thresholds"written by Dean Jolliffe Espen Beer Prydz in 2016. This paper aimed to address the critiques on Ravallion, Chen and Sangraula (RCS) data which was used to construct the International Poverty Line of 1.9 $ per day by proposing both a new data set of national poverty lines and an approach for estimating a new set of IPLs that addresses the issue of the official line as being too frugal or irrelevant.Their approach was based on estimating implicit national poverty lines by combining national poverty headcounts from national sources, reported in the World Bank’s databases, with corresponding consumption and income distributions from PovcalNet used for international poverty estimates. The approach yields a significantly larger set of national poverty lines, with greater temporal and country coverage. It also yields national poverty lines that are all expressed in per-capita units and that result in poverty estimates that match the official poverty estimates. To implement this approach, they used 1,376 income and consumption distributions from 154 countries and territories available in PovcalNet. For 1,158 of these distributions, PovcalNet used microdata when estimating poverty and inequality, and reports 100 points from the corresponding Lorenz curve (percentiles and percentile shares) for each distribution in the online detailed output. For the remaining 218 of the distributions, grouped data are used for the estimation, and in these cases, only 20 (or sometimes fewer) points of the Lorenz curve were available in the detailed output. For each publicly available Lorenz curve, they generated synthetic distributions with 1,000 points, using the ungroup command included in the DASP Stata Package (Abdelkrim and Duclos, 2007). They have then applied the adjustment proposed by Shorrocks and Wan (2008), which ensures that the fitted distribution matches the observed shares in the grouped data. This approach and adjustment produces synthetic distributions with a high degree of precision, particularly in the cases where PovcalNet reports Lorenz curves with percentiles. The vast majority of national poverty headcounts they uses to estimate the implicit national lines come from the World Bank’s series of poverty headcount ratios at national poverty lines, available in its Poverty and Inequality Database. This data set contains 800 poverty rates at national poverty lines. Of these, they were able to match 699 observations from 107 countries with surveys available in PovcalNet. As the World Bank’s series of national poverty headcounts does not include estimates for most high-income countries, they have supplemented the sample with national poverty estimates from OECD based on relative poverty lines. For the U.S., one of the few rich countries using absolute poverty lines, they included official national poverty headcounts and for Canada, they used the nationally reported prevalence of low-income status. This end up deriving 864 ‘implicit’ national poverty lines for 129 countries which correspond to officially reported national poverty rates when applied to the PovcalNet per capita welfare measure. Assessment of the precision of their methodology and more details can be accessed here: http://documents.worldbank.org/curated/en/837051468184454513/pdf/WPS7606.pdf | ||||||||
15 | Link to documentation: | http://gapm.io/dnpl | ||||||||
16 | Short source summary: | jp | ||||||||
17 | ||||||||||
18 | # | Source id | Name | Link | ||||||
19 | 1 | jp16 | Estimating International Poverty Lines from Comparable National Thresholds | http://documents.worldbank.org/curated/en/837051468184454513/pdf/WPS7606.pdf | ||||||
20 | ||||||||||
21 | License | |||||||||
22 | Attribution: | We believe in free knowledge and therefor we share free data. Most sheets in this file are provided under the open license, called Creative Commmon Attribution License CC BY 4.0., except those sheets mentioned in the exceptions section below. This means you can freely use, copy, and spread the data in those sheets, as long as you mention the following: 'Free data from Gapminder.org'. You should also mention the underlaying data sources listed above and include this link: http://gapm.io/dnpl | ||||||||
23 | License link: | Creative Common License CC BY 4.0 | ||||||||
24 | Exceptions: | The sheets starting with the word "data" are covered by this license. Other sheets are included for documentation purpose, and may include data that is governed by other licenses. Check the underlying sources for the specific licenses in these cases. | ||||||||
25 | ||||||||||
26 | Versions | Link | Changes compared to previous | Date | Contributors | |||||
27 | v1 | https://docs.google.com/spreadsheets/d/1ugp11zwqm2hwNHamJpcuMPQmyvBz03E4dMjl7alHKeE/edit#gid=1962563621 | First version | 2020 February 4 | Diane I | |||||
28 | ||||||||||
29 | Technical stuff | |||||||||
30 | Dataset name: | National Poverty lines | ||||||||
31 | Dataset id: | npl | ||||||||
32 | Doc url | https://docs.google.com/spreadsheets/d/1ugp11zwqm2hwNHamJpcuMPQmyvBz03E4dMjl7alHKeE/edit#gid=1962563621 | ||||||||
33 | Doc id of work doc spreadsheet | 1ugp11zwqm2hwNHamJpcuMPQmyvBz03E4dMjl7alHKeE | ||||||||
34 | Formulas | The formulas in this workbook may be referring to other spreadsheets online, by their named ranges, and not by sheet names. Search for "named ranges" to see how to use those instead of cell ranges. | ||||||||
35 | For developers | If you like the data we use into your own products, it's better if you fetch data from our standardized gitHub repo on https://open-numbers.github.io These spreadsheets are part of Gapminder's data compilation process and allow end users to track how we combine data. | ||||||||
36 | Read more: | gapm.io/dataworks | ||||||||
37 | CHART PREVIEWS | |||||||||
38 | [c] | data-for-countries-etc-by-year | https://www.gapminder.org/tools/#$state$time$dim=time;&entities$dim=geo;&entities_colorlegend$dim=geo;&marker$axis_x$which=income_per_person_gdppercapita_ppp_inflation_adjusted&scaleType=log&spaceRef:null;&axis_y$data=data_&which=National%20poverty%20line&spaceRef:null;&label$which=name&scaleType=ordinal;&size$which=population_total&use=indicator&scaleType=linear;&color$which=_default&use=constant&scaleType=ordinal;;;&data$reader=ddfbw&service=https:////big-waffle.gapminder.org&dataset=sg-master&translateContributionLink:///crowdin.com//project//systema-globalis;&data_$reader=google_csv&path=https:////docs.google.com//spreadsheets//d//1ugp11zwqm2hwNHamJpcuMPQmyvBz03E4dMjl7alHKeE//gviz//tq?tqx=out:csv/&sheet=data-for-countries-etc-by-year&hasNameColumn:true&nameColumnIndex:1;&chart-type=bubbles | |||||||
39 | National%20poverty%20line | |||||||||
40 | DDF mapping: | schema for indicator table | ||||||||
41 | concept_id | 6 | ||||||||
42 | name_short | 2 | ||||||||
43 | name | 4 | ||||||||
44 | description | 3 | ||||||||
45 | unit | 5 | ||||||||
46 | type | 7 | ||||||||
47 | usage | 8 | ||||||||
48 | ||||||||||
49 | Catalog status | Indicator ID | Time unit | Countries etc | Regions | In. Levels | World | |||
50 | National poverty line | npl | year | Loading... | - | - | - | |||
51 | ||||||||||
52 | # | Validation | ||||||||
53 | 1 | output-sheets | GOOD: There is at least one output sheet present (sheets starting with 'data-for-' and not ending with '-in-columns') | |||||||
54 | 2 | output-sheet:data-for-countries-etc-by-year | GOOD: The 'data-for-countries-etc-by-year' output sheet has at least 4 header columns | |||||||
55 | 3 | output-sheet:data-for-countries-etc-by-year | GOOD: The 'data-for-countries-etc-by-year' output sheet does not have filter mode turned on (since it breaks the CSV endpoint) | |||||||
56 | 4 | version | GOOD: Named range 'version' exists | |||||||
57 | 5 | version | GOOD: 'Version:' is filled in | |||||||
58 | 6 | version | GOOD: The version at 'Version:' starts with a v, followed by an integer | |||||||
59 | 7 | date | GOOD: Named range 'date' exists | |||||||
60 | 8 | date | GOOD: 'Updated:' is filled in | |||||||
61 | 9 | gapmio | GOOD: Named range 'gapmio' exists | |||||||
62 | 10 | gapmio | GOOD: 'Latest version online:' is filled in | |||||||
63 | 11 | contributors | GOOD: Named range 'contributors' exists | |||||||
64 | 12 | contributors | GOOD: 'Contributor(s) to this version:' is filled in | |||||||
65 | 13 | indicator_table | GOOD: Named range 'indicator_table' exists | |||||||
66 | 14 | indicator_table | GOOD: The named range 'indicator_table' covers the whole Indicator(s) table (the rows immediately above and below the table are empty) | |||||||
67 | 15 | indicator_table:row_1 | GOOD: This first column of row '1' in the indicator(s) table is incremental (from 1 and up) | |||||||
68 | 16 | indicator_table:row_1 | GOOD: Indicator 1 has a short indicator name (Column 2) | |||||||
69 | 17 | indicator_table:row_1:data-for-countries-etc-by-year | GOOD: The indicator name cell of indicator 1 is referenced in the 'data-for-countries-etc-by-year' output sheet in column 4 as "=ABOUT!B11" | |||||||
70 | 18 | indicator_table:row_1 | GOOD: Indicator 1 has a description (Column 3) | |||||||
71 | 19 | indicator_table:row_1 | GOOD: Indicator 1 has a full name (Column 4) | |||||||
72 | 20 | indicator_table:row_1 | GOOD: Indicator 1 has a unit (Column 5) | |||||||
73 | 21 | indicator_table:row_1 | GOOD: Indicator 1's unit does not start or end with a space | |||||||
74 | 22 | indicator_table:row_1 | GOOD: Indicator 1 has an ID (Column 6) | |||||||
75 | 23 | indicator_table:row_1 | GOOD: Indicator 1's ID contains only lowercase latin characters (a-z) or numbers, and no space, dashes or underscores. (Column 6) | |||||||
76 | 24 | indicator_table:row_1 | GOOD: Indicator 1's ID has less than or equal to 20 characters | |||||||
77 | 25 | indicator_table:row_1 | GOOD: Indicator 1 has a type set (Column 7) | |||||||
78 | 26 | indicator_table:row_1 | GOOD: Indicator 1 has a usage level set (Column 8) | |||||||
79 | 27 | dataset_description | GOOD: Named range 'dataset_description' exists | |||||||
80 | 28 | dataset_description | GOOD: 'Dataset description:' is filled in | |||||||
81 | 29 | source_url | GOOD: Named range 'source_url' exists | |||||||
82 | 30 | source_url | GOOD: 'Link to documentation:' is filled in | |||||||
83 | 31 | source_short_text | GOOD: Named range 'source_short_text' exists | |||||||
84 | 32 | source_short_text | GOOD: 'Short source summary:' is filled in | |||||||
85 | 33 | source_short_text | GOOD: 'Short source summary:' has less than or equal to 45 characters | |||||||
86 | 34 | source_table | GOOD: Named range 'source_table' exists | |||||||
87 | 35 | source_table | GOOD: The named range 'source_table' covers the whole Sources table (the rows immediately above and below the table are empty) | |||||||
88 | 36 | source_table:row_1 | GOOD: This first column of row '1' in the sources table is incremental (from 1 and up) | |||||||
89 | 37 | source_table:row_1 | GOOD: Source 1 has a source id (Column 2) | |||||||
90 | 38 | source_table:row_1 | GOOD: Source 1 has a name (Column 3) | |||||||
91 | 39 | source_table:row_1 | GOOD: Source 1 has a link (Column 4) | |||||||
92 | 40 | version_table | GOOD: Named range 'version_table' exists | |||||||
93 | 41 | version_table | GOOD: The named range 'version_table' covers the whole Versions table (the rows immediately above and below the table are empty) | |||||||
94 | 42 | version_table:row_1 | GOOD: Version-listing 1 has a version reference set (Column 1) | |||||||
95 | 43 | version_table:row_1 | GOOD: The version in column 1 starts with a v, followed by an integer | |||||||
96 | 44 | version_table:row_1 | GOOD: Version-listing 1 has a link (Column 2) | |||||||
97 | 45 | version_table:row_1 | GOOD: Version-listing 1 has "Changes compared to previous" (Column 3) | |||||||
98 | 46 | dataset_name | GOOD: Named range 'dataset_name' exists | |||||||
99 | 47 | dataset_name | GOOD: 'Dataset name:' is filled in | |||||||
100 | 48 | dataset_id | GOOD: Named range 'dataset_id' exists |