Permalink
Browse files

Add Gujarat data - first step out of UP!

  • Loading branch information...
raphael-susewind committed Sep 29, 2016
1 parent db3859c commit db36e389326d7e255add3d14bc31e42ca2b5bd2a
Showing with 423,349 additions and 1 deletion.
  1. +7 −1 README.md
  2. +5 −0 combined-a.sql
  3. +1 −0 combined-b.sql
  4. +361 −0 gujcandidates2014/Candidates.csv
  5. +52 −0 gujcandidates2014/LICENSE.md
  6. +1,667 −0 gujcandidates2014/Political_parties.csv
  7. +25 −0 gujcandidates2014/README.md
  8. +62 −0 gujcandidates2014/charmap.py
  9. +422 −0 gujcandidates2014/createnamedb.pl
  10. +762 −0 gujcandidates2014/guesscommunity.pl
  11. +27 −0 gujcandidates2014/gujcandidates2014.csv
  12. +34 −0 gujcandidates2014/gujcandidates2014.sql
  13. +86 −0 gujcandidates2014/soundex.py
  14. +169 −0 gujcandidates2014/transform.pl
  15. +52 −0 gujgis/LICENSE.md
  16. +33 −0 gujgis/README.md
  17. +14 −0 gujgis/booths-locality.vrt
  18. +105 −0 gujgis/download.pl
  19. +45,303 −0 gujgis/gujgis.csv
  20. +45,306 −0 gujgis/gujgis.sql
  21. +107 −0 gujgis/proxy.pl
  22. +9 −0 gujgis/transform.sql
  23. +52 −0 gujid/LICENSE.md
  24. +33 −0 gujid/README.md
  25. BIN gujid/VolIII_DetailsOfAssemblySegmentsOfPC.pdf
  26. +46 −0 gujid/actopc.pl
  27. +62 −0 gujid/compress.pl
  28. +556 −0 gujid/gujid-b.sql
  29. +44,562 −0 gujid/gujid.csv
  30. +395 −0 gujloksabha2014/AC001.csv
  31. +280 −0 gujloksabha2014/AC002.csv
  32. +255 −0 gujloksabha2014/AC003.csv
  33. +252 −0 gujloksabha2014/AC004.csv
  34. +278 −0 gujloksabha2014/AC005.csv
  35. +288 −0 gujloksabha2014/AC006.csv
  36. +283 −0 gujloksabha2014/AC007.csv
  37. +237 −0 gujloksabha2014/AC008.csv
  38. +246 −0 gujloksabha2014/AC009.csv
  39. +271 −0 gujloksabha2014/AC010.csv
  40. +305 −0 gujloksabha2014/AC011.csv
  41. +248 −0 gujloksabha2014/AC012.csv
  42. +254 −0 gujloksabha2014/AC013.csv
  43. +235 −0 gujloksabha2014/AC014.csv
  44. +283 −0 gujloksabha2014/AC015.csv
  45. +272 −0 gujloksabha2014/AC016.csv
  46. +282 −0 gujloksabha2014/AC017.csv
  47. +267 −0 gujloksabha2014/AC018.csv
  48. +232 −0 gujloksabha2014/AC019.csv
  49. +225 −0 gujloksabha2014/AC020.csv
  50. +217 −0 gujloksabha2014/AC021.csv
  51. +218 −0 gujloksabha2014/AC022.csv
  52. +248 −0 gujloksabha2014/AC023.csv
  53. +270 −0 gujloksabha2014/AC024.csv
  54. +232 −0 gujloksabha2014/AC025.csv
  55. +218 −0 gujloksabha2014/AC026.csv
  56. +310 −0 gujloksabha2014/AC027.csv
  57. +322 −0 gujloksabha2014/AC028.csv
  58. +280 −0 gujloksabha2014/AC029.csv
  59. +358 −0 gujloksabha2014/AC030.csv
  60. +308 −0 gujloksabha2014/AC031.csv
  61. +300 −0 gujloksabha2014/AC032.csv
  62. +276 −0 gujloksabha2014/AC033.csv
  63. +222 −0 gujloksabha2014/AC034.csv
  64. +286 −0 gujloksabha2014/AC035.csv
  65. +227 −0 gujloksabha2014/AC036.csv
  66. +231 −0 gujloksabha2014/AC037.csv
  67. +206 −0 gujloksabha2014/AC038.csv
  68. +327 −0 gujloksabha2014/AC039.csv
  69. +265 −0 gujloksabha2014/AC040.csv
  70. +321 −0 gujloksabha2014/AC041.csv
  71. +302 −0 gujloksabha2014/AC042.csv
  72. +272 −0 gujloksabha2014/AC043.csv
  73. +216 −0 gujloksabha2014/AC044.csv
  74. +236 −0 gujloksabha2014/AC045.csv
  75. +230 −0 gujloksabha2014/AC046.csv
  76. +240 −0 gujloksabha2014/AC047.csv
  77. +222 −0 gujloksabha2014/AC048.csv
  78. +185 −0 gujloksabha2014/AC049.csv
  79. +242 −0 gujloksabha2014/AC050.csv
  80. +182 −0 gujloksabha2014/AC051.csv
  81. +208 −0 gujloksabha2014/AC052.csv
  82. +230 −0 gujloksabha2014/AC053.csv
  83. +208 −0 gujloksabha2014/AC054.csv
  84. +228 −0 gujloksabha2014/AC055.csv
  85. +205 −0 gujloksabha2014/AC056.csv
  86. +302 −0 gujloksabha2014/AC057.csv
  87. +231 −0 gujloksabha2014/AC058.csv
  88. +265 −0 gujloksabha2014/AC059.csv
  89. +255 −0 gujloksabha2014/AC060.csv
  90. +282 −0 gujloksabha2014/AC061.csv
  91. +243 −0 gujloksabha2014/AC062.csv
  92. +270 −0 gujloksabha2014/AC063.csv
  93. +284 −0 gujloksabha2014/AC064.csv
  94. +270 −0 gujloksabha2014/AC065.csv
  95. +1 −0 gujloksabha2014/AC066.csv
  96. +1 −0 gujloksabha2014/AC067.csv
  97. +1 −0 gujloksabha2014/AC068.csv
  98. +1 −0 gujloksabha2014/AC069.csv
  99. +1 −0 gujloksabha2014/AC070.csv
  100. +1 −0 gujloksabha2014/AC071.csv
  101. +1 −0 gujloksabha2014/AC072.csv
  102. +223 −0 gujloksabha2014/AC073.csv
  103. +265 −0 gujloksabha2014/AC074.csv
  104. +256 −0 gujloksabha2014/AC075.csv
  105. +265 −0 gujloksabha2014/AC076.csv
  106. +237 −0 gujloksabha2014/AC077.csv
  107. +194 −0 gujloksabha2014/AC078.csv
  108. +183 −0 gujloksabha2014/AC079.csv
  109. +246 −0 gujloksabha2014/AC080.csv
  110. +290 −0 gujloksabha2014/AC081.csv
  111. +264 −0 gujloksabha2014/AC082.csv
  112. +219 −0 gujloksabha2014/AC083.csv
  113. +208 −0 gujloksabha2014/AC084.csv
  114. +271 −0 gujloksabha2014/AC085.csv
  115. +262 −0 gujloksabha2014/AC086.csv
  116. +288 −0 gujloksabha2014/AC087.csv
  117. +237 −0 gujloksabha2014/AC088.csv
  118. +220 −0 gujloksabha2014/AC089.csv
  119. +228 −0 gujloksabha2014/AC090.csv
  120. +232 −0 gujloksabha2014/AC091.csv
  121. +230 −0 gujloksabha2014/AC092.csv
  122. +254 −0 gujloksabha2014/AC093.csv
  123. +252 −0 gujloksabha2014/AC094.csv
  124. +290 −0 gujloksabha2014/AC095.csv
  125. +224 −0 gujloksabha2014/AC096.csv
  126. +260 −0 gujloksabha2014/AC097.csv
  127. +263 −0 gujloksabha2014/AC098.csv
  128. +203 −0 gujloksabha2014/AC099.csv
  129. +241 −0 gujloksabha2014/AC100.csv
  130. +216 −0 gujloksabha2014/AC101.csv
  131. +273 −0 gujloksabha2014/AC102.csv
  132. +266 −0 gujloksabha2014/AC103.csv
  133. +224 −0 gujloksabha2014/AC104.csv
  134. +214 −0 gujloksabha2014/AC105.csv
  135. +268 −0 gujloksabha2014/AC106.csv
  136. +251 −0 gujloksabha2014/AC107.csv
  137. +212 −0 gujloksabha2014/AC108.csv
  138. +251 −0 gujloksabha2014/AC109.csv
  139. +210 −0 gujloksabha2014/AC110.csv
  140. +249 −0 gujloksabha2014/AC111.csv
  141. +254 −0 gujloksabha2014/AC112.csv
  142. +216 −0 gujloksabha2014/AC113.csv
  143. +199 −0 gujloksabha2014/AC114.csv
  144. +231 −0 gujloksabha2014/AC115.csv
  145. +232 −0 gujloksabha2014/AC116.csv
  146. +250 −0 gujloksabha2014/AC117.csv
  147. +229 −0 gujloksabha2014/AC118.csv
  148. +270 −0 gujloksabha2014/AC119.csv
  149. +292 −0 gujloksabha2014/AC120.csv
  150. +296 −0 gujloksabha2014/AC121.csv
  151. +324 −0 gujloksabha2014/AC122.csv
  152. +282 −0 gujloksabha2014/AC123.csv
  153. +251 −0 gujloksabha2014/AC124.csv
  154. +214 −0 gujloksabha2014/AC125.csv
  155. +267 −0 gujloksabha2014/AC126.csv
  156. +277 −0 gujloksabha2014/AC127.csv
  157. +296 −0 gujloksabha2014/AC128.csv
  158. +270 −0 gujloksabha2014/AC129.csv
  159. +278 −0 gujloksabha2014/AC130.csv
  160. +240 −0 gujloksabha2014/AC131.csv
  161. +264 −0 gujloksabha2014/AC132.csv
  162. +241 −0 gujloksabha2014/AC133.csv
  163. +253 −0 gujloksabha2014/AC134.csv
  164. +238 −0 gujloksabha2014/AC135.csv
  165. +253 −0 gujloksabha2014/AC136.csv
  166. +288 −0 gujloksabha2014/AC137.csv
  167. +295 −0 gujloksabha2014/AC138.csv
  168. +360 −0 gujloksabha2014/AC139.csv
  169. +243 −0 gujloksabha2014/AC140.csv
  170. +223 −0 gujloksabha2014/AC141.csv
  171. +261 −0 gujloksabha2014/AC142.csv
  172. +226 −0 gujloksabha2014/AC143.csv
  173. +241 −0 gujloksabha2014/AC144.csv
  174. +195 −0 gujloksabha2014/AC145.csv
  175. +220 −0 gujloksabha2014/AC146.csv
  176. +227 −0 gujloksabha2014/AC147.csv
  177. +290 −0 gujloksabha2014/AC148.csv
  178. +275 −0 gujloksabha2014/AC149.csv
  179. +254 −0 gujloksabha2014/AC150.csv
  180. +224 −0 gujloksabha2014/AC151.csv
  181. +304 −0 gujloksabha2014/AC152.csv
  182. +233 −0 gujloksabha2014/AC153.csv
  183. +224 −0 gujloksabha2014/AC154.csv
  184. +335 −0 gujloksabha2014/AC155.csv
  185. +231 −0 gujloksabha2014/AC156.csv
  186. +274 −0 gujloksabha2014/AC157.csv
  187. +359 −0 gujloksabha2014/AC158.csv
  188. +212 −0 gujloksabha2014/AC159.csv
  189. +164 −0 gujloksabha2014/AC160.csv
  190. +190 −0 gujloksabha2014/AC161.csv
  191. +165 −0 gujloksabha2014/AC162.csv
  192. +245 −0 gujloksabha2014/AC163.csv
  193. +224 −0 gujloksabha2014/AC164.csv
  194. +234 −0 gujloksabha2014/AC165.csv
  195. +244 −0 gujloksabha2014/AC166.csv
  196. +213 −0 gujloksabha2014/AC167.csv
  197. +354 −0 gujloksabha2014/AC168.csv
  198. +248 −0 gujloksabha2014/AC169.csv
  199. +250 −0 gujloksabha2014/AC170.csv
  200. +248 −0 gujloksabha2014/AC171.csv
  201. +318 −0 gujloksabha2014/AC172.csv
  202. +334 −0 gujloksabha2014/AC173.csv
  203. +286 −0 gujloksabha2014/AC174.csv
  204. +300 −0 gujloksabha2014/AC175.csv
  205. +361 −0 gujloksabha2014/AC176.csv
  206. +297 −0 gujloksabha2014/AC177.csv
  207. +264 −0 gujloksabha2014/AC178.csv
  208. +248 −0 gujloksabha2014/AC179.csv
  209. +227 −0 gujloksabha2014/AC180.csv
  210. +273 −0 gujloksabha2014/AC181.csv
  211. +261 −0 gujloksabha2014/AC182.csv
  212. +8,796 −0 gujloksabha2014/Candidates.csv
  213. +52 −0 gujloksabha2014/LICENSE.md
  214. +1,667 −0 gujloksabha2014/Political_parties.csv
  215. +37 −0 gujloksabha2014/README.md
  216. +47 −0 gujloksabha2014/combined.pl
  217. +12 −0 gujloksabha2014/download.pl
  218. +45,360 −0 gujloksabha2014/gujloksabha2014-a.sql
  219. +44,566 −0 gujloksabha2014/gujloksabha2014-b.sql
  220. +45,353 −0 gujloksabha2014/gujloksabha2014.csv
  221. +44,563 −0 gujloksabha2014/manual-corrections.csv
  222. +1,069 −0 gujloksabha2014/transform.pl
  223. +177 −0 gujrolls2014/LICENSE.md
  224. +43 −0 gujrolls2014/README.md
  225. BIN gujrolls2014/booths.sqlite.tgz
  226. +24 −0 gujrolls2014/combine.pl
  227. +45,282 −0 gujrolls2014/gujrolls2014.sql
  228. +37 −0 gujrolls2014/run-in-osc-add-gender/control.pl
  229. +119 −0 gujrolls2014/run-in-osc-add-gender/pdf2list.pl
  230. +24 −0 gujrolls2014/run-in-osc-add-gender/run.sh
  231. +28 −0 gujrolls2014/run-in-osc-add-gender/subcontrol.pl
  232. +65 −0 gujrolls2014/run-in-osc-add-ngram/addngram.pl
  233. +29 −0 gujrolls2014/run-in-osc-add-ngram/control.pl
  234. +32 −0 gujrolls2014/run-in-osc-add-ngram/createngram.pl
  235. +208 −0 gujrolls2014/run-in-osc-add-ngram/csv2stats.pl
  236. +24 −0 gujrolls2014/run-in-osc-add-ngram/run.sh
  237. +22 −0 gujrolls2014/run-in-osc-add-ngram/subcontrol.pl
  238. +62 −0 gujrolls2014/run-in-osc/charmap.py
  239. +40 −0 gujrolls2014/run-in-osc/control.pl
  240. +208 −0 gujrolls2014/run-in-osc/csv2stats.pl
  241. +665 −0 gujrolls2014/run-in-osc/pdf2list.pl
  242. +24 −0 gujrolls2014/run-in-osc/run.sh
  243. +91 −0 gujrolls2014/run-in-osc/soundex.py
  244. +34 −0 gujrolls2014/run-in-osc/subcontrol.pl
  245. +26 −0 gujrolls2014/transform.pl
@@ -1,6 +1,6 @@
# Data on religion and politics in India
This repository provides highly localized statistics on religion and politics in India under an open license. I aim to cover Uttar Pradesh as comprehensively as possible, and the rest of India during general elections (see [roadmap](https://github.com/raphael-susewind/india-religion-politics/tree/master/ROADMAP.md)) and/or if other people contribute. An (incomplete) list of academic usecases for this data is on [Google Scholar](https://scholar.google.com/scholar?oi=bibs&hl=de&cites=11938760322875868825).
This repository provides highly localized statistics on religion and politics in India under an open license. I aim to cover Uttar Pradesh as comprehensively as possible, and the rest of India during general elections (see [roadmap](https://github.com/raphael-susewind/india-religion-politics/tree/master/ROADMAP.md)) and/or if other people contribute. A (potentially incomplete) list of academic usecases for this data is on [Google Scholar](https://scholar.google.com/scholar?oi=bibs&hl=de&cites=11938760322875868825); there is also a separate folder with [examples](https://github.com/raphael-susewind/india-religion-politics/tree/master/examples) to replicate.
Fortunately, recent transparency initiatives by the Election Commission of India in general and the Chief Electoral Officer of UP in particular now allow researchers to shift the central unit of quantitative political analyses from the constituency level to that of polling booths, stations, and villages (earlier, such data had to be interpolated or estimated). Often, this data is not very user-friendly, though (think garbled, scanned PDFs). The purpose of this repository is to curate this data in a more accessible format and to share the scraping and cleanup code for reference. This official data is then supplemented with estimates of religious demography based on the religious connotations of electors' names in the voter lists (see below).
@@ -9,6 +9,12 @@ From 2013 to 2015, the whole dataset was located on my [personal website](https:
table | description
--- | ---
[examples](https://github.com/raphael-susewind/india-religion-politics/tree/master/examples) | Example queries that would replicate published papers based on this data
[gujid](https://github.com/raphael-susewind/india-religion-politics/tree/master/gujid) | ID matching and integration table for Gujarat (see below)
[gujgis](https://github.com/raphael-susewind/india-religion-politics/tree/master/gujgis) | GIS coordinates and other spatial characteristics of polling booths in Gujarat
[gujloksabha2014](https://github.com/raphael-susewind/india-religion-politics/tree/master/gujloksabha2014) | Booth-level (form 20) results for the 2014 Lok Sabha election from Gujarat
[gujcandidates2014](https://github.com/raphael-susewind/india-religion-politics/tree/master/gujcandidates2014) | Candidates and their likely religion for the 2014 Lok Sabha election from Gujarat
[gujrolls2014](https://github.com/raphael-susewind/india-religion-politics/tree/master/gujrolls2014) | Booth-level estimates of religious demography for 2014 across Gujarat
[upid](https://github.com/raphael-susewind/india-religion-politics/tree/master/upid) | ID matching and integration table for Uttar Pradesh (see below)
[upgis](https://github.com/raphael-susewind/india-religion-politics/tree/master/upgis) | GIS coordinates and other spatial characteristics of polling booths in Uttar Pradesh
[upvidhansabha2007](https://github.com/raphael-susewind/india-religion-politics/tree/master/upvidhansabha2007) | Booth-level (form 20) results for the 2007 Vidhan Sabha election in Uttar Pradesh
@@ -24,3 +24,8 @@
.read upcandidates2012/upcandidates2012.sql
.read upcandidates2014/upcandidates2014.sql
.read upgis/upgis.sql
.read gujloksabha2014/gujloksabha2014-a.sql
.read gujloksabha2014/gujloksabha2014-b.sql
.read gujcandidates2014/gujcandidates2014.sql
.read gujrolls2014/gujrolls2014.sql
.read gujgis/gujgis.sql
@@ -1,2 +1,3 @@
.read upid/upid-a.sql
.read upid/upid-b.sql
.read gujid/gujid-b.sql

Large diffs are not rendered by default.

Oops, something went wrong.
@@ -0,0 +1,52 @@
## ODC Database Contents License
The Licensor and You agree as follows:
### 1.0 Definitions of Capitalised Words
The definitions of the Open Database License (ODbL) 1.0 are incorporated
by reference into the Database Contents License.
### 2.0 Rights granted and Conditions of Use
2.1 Rights granted. The Licensor grants to You a worldwide,
royalty-free, non-exclusive, perpetual, irrevocable copyright license to
do any act that is restricted by copyright over anything within the
Contents, whether in the original medium or any other. These rights
explicitly include commercial use, and do not exclude any field of
endeavour. These rights include, without limitation, the right to
sublicense the work.
2.2 Conditions of Use. You must comply with the ODbL.
2.3 Relationship to Databases and ODbL. This license does not cover any
Database Rights, Database copyright, or contract over the Contents as
part of the Database. Please see the ODbL covering the Database for more
details about Your rights and obligations.
2.4 Non-assertion of copyright over facts. The Licensor takes the
position that factual information is not covered by copyright. The DbCL
grants you permission for any information having copyright contained in
the Contents.
### 3.0 Warranties, disclaimer, and limitation of liability
3.1 The Contents are licensed by the Licensor "as is" and without any
warranty of any kind, either express or implied, whether of title, of
accuracy, of the presence of absence of errors, of fitness for purpose,
or otherwise. Some jurisdictions do not allow the exclusion of implied
warranties, so this exclusion may not apply to You.
3.2 Subject to any liability that may not be excluded or limited by law,
the Licensor is not liable for, and expressly excludes, all liability
for loss or damage however and whenever caused to anyone by any use
under this License, whether by You or by anyone else, and whether caused
by any fault on the part of the Licensor or not. This exclusion of
liability includes, but is not limited to, any special, incidental,
consequential, punitive, or exemplary damages. This exclusion applies
even if the Licensor has been advised of the possibility of such
damages.
3.3 If liability may not be excluded by law, it is limited to actual and
direct financial loss to the extent it is caused by proved negligence on
the part of the Licensor.
Oops, something went wrong.

0 comments on commit db36e38

Please sign in to comment.