Skip to content

grammarware/fodder-css

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

46 Commits
 
 
 
 
 
 
 
 

Repository files navigation

CSS Corpus

  • Composed and maintained by Nico de Groot and Vadim Zaytsev
  • Contains full CSS sheets from top websites (as opposed to other corpora that rely on CSS stored on GitHub)
  • CC-BY: open content, but please state us as the source

CSS sheets from Alexa Top 500, the version of 30.03.2017:

  1. google.comCSS
  2. youtube.comCSS
  3. facebook.comCSS
  4. baidu.comCSS
  5. wikipedia.orgCSS
  6. yahoo.comCSS
  7. google.co.in — removed (duplicate of google.com)
  8. reddit.comCSS
  9. qq.comCSS
  10. taobao.comCSS
  11. amazon.comCSS
  12. tmall.comCSS
  13. google.co.jp — removed (duplicate of google.com)
  14. sohu.comCSS
  15. twitter.comCSS
  16. live.comCSS
  17. vk.comCSS
  18. instagram.comCSS
  19. sina.com.cnCSS
  20. 360.cnCSS
  21. jd.comCSS
  22. google.de — removed (duplicate of google.com)
  23. linkedin.comCSS
  24. google.co.uk — removed (duplicate of google.com)
  25. weibo.comCSS
  26. google.fr — removed (duplicate of google.com)
  27. google.ru — removed (duplicate of google.com)
  28. google.com.br — removed (duplicate of google.com)
  29. yahoo.co.jp — removed (duplicate of yahoo.com)
  30. yandex.ruCSS
  31. hao123.comCSS
  32. google.com.hk — removed (duplicate of google.com)
  33. netflix.comCSS
  34. t.coCSS
  35. imgur.comCSS
  36. google.it — removed (duplicate of google.com)
  37. ebay.comCSS
  38. pornhub.comCSS
  39. google.es — removed (duplicate of google.com)
  40. detail.tmall.com — removed (duplicate of tmall.com)
  41. wordpress.comCSS
  42. msn.comCSS
  43. bing.comCSS
  44. aliexpress.comCSS
  45. livejasmin.comCSS
  46. tumblr.comCSS
  47. google.ca — removed (duplicate of google.com)
  48. microsoft.comCSS
  49. stackoverflow.comCSS
  50. ok.ruCSS
  51. twitch.tvCSS
  52. google.com.mx — removed (duplicate of google.com)
  53. ntd.tvCSS
  54. imdb.comCSS
  55. blogspot.com — removed (duplicate of blogger.com)
  56. office.comCSS
  57. onclkds.comCSS
  58. melyweb.vnCSS
  59. amazon.co.jp — removed (duplicate of amazon.com)
  60. github.comCSS
  61. microsoftonline.comCSS
  62. apple.comCSS
  63. popads.netCSS
  64. diply.comCSS
  65. tianya.cnCSS
  66. mail.ruCSS
  67. pinterest.comCSS
  68. csdn.netCSS
  69. wikia.comCSS
  70. google.com.tr — removed (duplicate of google.com)
  71. google.com.au — removed (duplicate of google.com)
  72. google.com.tw — removed (duplicate of google.com)
  73. alipay.comCSS
  74. paypal.comCSS
  75. duyendangvietnam.net.vnCSS
  76. service.tmall.com — removed (duplicate of tmall.com)
  77. adobe.comCSS
  78. whatsapp.comCSS
  79. xvideos.comCSS
  80. xhamster.comCSS
  81. pixnet.netCSS
  82. login.tmall.com — removed (duplicate of tmall.com)
  83. soso.com — removed (duplicate of sogou.com)
  84. coccoc.comCSS
  85. txxx.comCSS
  86. bongacams.comCSS
  87. youth.cn — removed (connection timed out)
  88. dropbox.comCSS
  89. google.pl — removed (duplicate of google.com)
  90. amazon.de — removed (duplicate of amazon.com)
  91. googleusercontent.com — removed (server not found)
  92. fc2.comCSS
  93. google.com.eg — removed (duplicate of google.com)
  94. china.comCSS
  95. google.com.sa — removed (duplicate of google.com)
  96. google.co.th — removed (duplicate of google.com)
  97. google.com.pk — removed (duplicate of google.com)
  98. bbc.co.ukCSS
  99. craigslist.orgCSS
  100. gmw.cnCSS
  101. google.com.ar — removed (duplicate of google.com)
  102. soundcloud.comCSS
  103. espn.comCSS
  104. thepiratebay.orgCSS
  105. amazon.co.uk — removed (duplicate of amazon.com)
  106. amazon.in — removed (duplicate of amazon.com)
  107. adf.lyCSS
  108. cnn.comCSS
  109. bbc.com — removed (duplicate of bbc.co.uk)
  110. google.nl — removed (duplicate of google.com)
  111. google.co.id — removed (duplicate of google.com)
  112. ettoday.netCSS
  113. porn555.comCSS
  114. uptodown.comCSS
  115. list.tmall.com — removed (duplicate of tmall.com)
  116. booking.comCSS
  117. dailymotion.comCSS
  118. rakuten.co.jpCSS
  119. vimeo.comCSS
  120. ask.comCSS
  121. nytimes.comCSS
  122. blastingnews.comCSS
  123. amazonaws.comCSS
  124. clicksgear.comCSS
  125. blogger.comCSS
  126. bet365.comCSS
  127. ebay.de — removed (duplicate of ebay.com)
  128. adexchangeprediction.com — removed (403 Forbidden)
  129. quora.comCSS
  130. pages.tmall.com — removed (duplicate of tmall.com)
  131. stackexchange.comCSS
  132. savefrom.netCSS
  133. salesforce.comCSS
  134. daikynguyenvn.comCSS
  135. detik.comCSS
  136. getmyads.comCSS
  137. google.co.ve — removed (duplicate of google.com)
  138. naver.comCSS
  139. google.co.za — removed (duplicate of google.com)
  140. onlinesbi.com — removed (server not found)
  141. ebay.co.uk — removed (duplicate of ebay.com)
  142. vice.comCSS
  143. slideshare.netCSS
  144. so.comCSS
  145. huaban.comCSS
  146. theguardian.comCSS
  147. spotify.comCSS
  148. buzzfeed.comCSS
  149. google.com.vn — removed (duplicate of google.com)
  150. askcom.meCSS
  151. fbcdn.net — removed (duplicate of facebook.com)
  152. alibaba.comCSS
  153. tribunnews.comCSS
  154. chase.comCSS
  155. nicovideo.jpCSS
  156. cnet.comCSS
  157. google.gr — removed (duplicate of google.com)
  158. xinhuanet.comCSS
  159. avito.ruCSS
  160. chaturbate.comCSS
  161. indeed.comCSS
  162. wikihow.comCSS
  163. dailymail.co.ukCSS
  164. google.com.co — removed (duplicate of google.com)
  165. chinadaily.com.cnCSS
  166. google.com.ph — removed (duplicate of google.com)
  167. softonic.comCSS
  168. uol.com.brCSS
  169. 9gag.comCSS
  170. google.be — removed (duplicate of google.com)
  171. nih.govCSS
  172. deviantart.comCSS
  173. globo.comCSS
  174. mediafire.comCSS
  175. cctv.comCSS
  176. thewhizmarketing.comCSS
  177. google.se — removed (duplicate of google.com)
  178. google.com.sg — removed (duplicate of google.com)
  179. github.ioCSS
  180. flipkart.comCSS
  181. w3schools.comCSS
  182. steamcommunity.comCSS
  183. wittyfeed.comCSS
  184. google.ro — removed (duplicate of google.com)
  185. washingtonpost.comCSS
  186. popcash.netCSS
  187. google.com.ua — removed (duplicate of google.com)
  188. godaddy.comCSS
  189. zhihu.comCSS
  190. force.com — removed (duplicate of salesforce.com)
  191. gfycat.comCSS
  192. theladbible.comCSS
  193. blogspot.co.id — removed (duplicate of blogger.com)
  194. google.az — removed (duplicate of google.com)
  195. rambler.ruCSS
  196. sogou.comCSS
  197. varzesh3.comCSS
  198. etsy.comCSS
  199. huffingtonpost.comCSS
  200. xnxx.comCSS
  201. mozilla.orgCSS
  202. steampowered.comCSS
  203. china.com.cnCSS
  204. slack.comCSS
  205. google.at — removed (duplicate of google.com)
  206. pinimg.com — removed (duplicate of pinterest.com)
  207. upornia.comCSS
  208. google.co.kr — removed (duplicate of google.com)
  209. openload.coCSS
  210. weather.comCSS
  211. google.com.ng — removed (duplicate of google.com)
  212. walmart.comCSS
  213. google.cn — removed (duplicate of google.com)
  214. roblox.comCSS
  215. google.com.pe — removed (duplicate of google.com)
  216. indiatimes.comCSS
  217. twimg.com — removed (duplicate of twitter.com)
  218. 4chan.orgCSS
  219. google.cz — removed (duplicate of google.com)
  220. yelp.comCSS
  221. hclips.comCSS
  222. reimageplus.comCSS
  223. cnblogs.comCSS
  224. livedoor.jpCSS
  225. google.ch — removed (duplicate of google.com)
  226. kinogo.clubCSS
  227. bankofamerica.comCSS
  228. cnzz.comCSS
  229. youm7.comCSS
  230. redtube.comCSS
  231. myway.comCSS
  232. 123movies.isCSS
  233. trello.comCSS
  234. prestoris.comCSS
  235. google.cl — removed (duplicate of google.com)
  236. mercadolivre.com.brCSS
  237. k618.cnCSS
  238. rolloid.netCSS
  239. abs-cbn.comCSS
  240. tokopedia.comCSS
  241. iqiyi.comCSS
  242. amazon.it — removed (duplicate of amazon.com)
  243. amazon.fr — removed (duplicate of amazon.com)
  244. bp.blogspot.com — removed (duplicate of blogger.com)
  245. doublepimp.comCSS
  246. thesaurus.comCSS
  247. tripadvisor.comCSS
  248. google.no — removed (duplicate of google.com)
  249. outbrain.comCSS
  250. messenger.comCSS
  251. google.ae — removed (duplicate of google.com)
  252. wellsfargo.comCSS
  253. wordreference.comCSS
  254. weebly.comCSS
  255. google.dz — removed (duplicate of google.com)
  256. 1688.comCSS
  257. douyu.comCSS
  258. doubleclick.netCSS
  259. mama.cnCSS
  260. gamepedia.comCSS
  261. rarbg.toCSS
  262. google.pt — removed (duplicate of google.com)
  263. breitbart.comCSS
  264. spankbang.comCSS
  265. ameblo.jpCSS
  266. zillow.comCSS
  267. skype.comCSS
  268. livejournal.comCSS
  269. forbes.comCSS
  270. redd.it — removed (duplicate of reddit.com)
  271. lifebuzz.comCSS
  272. archive.orgCSS
  273. liputan6.comCSS
  274. battle.netCSS
  275. yts.agCSS
  276. sharepoint.comCSS
  277. files.wordpress.com — removed (duplicate of wordpress.com)
  278. youporn.comCSS
  279. 39.netCSS
  280. ltn.com.twCSS
  281. ign.comCSS
  282. ikea.comCSS
  283. giphy.comCSS
  284. allegro.plCSS
  285. clarins.tmall.com — removed (duplicate of tmall.com)
  286. babytree.comCSS
  287. discordapp.comCSS
  288. google.hu — removed (duplicate of google.com)
  289. foxnews.comCSS
  290. feedly.comCSS
  291. kinopoisk.ruCSS
  292. sourceforge.netCSS
  293. intuit.comCSS
  294. wetransfer.comCSS
  295. shutterstock.comCSS
  296. bilibili.comCSS
  297. google.ie — removed (duplicate of google.com)
  298. ebay-kleinanzeigen.deCSS
  299. aol.comCSS
  300. iwanttodeliver.com — removed (parked)
  301. wordpress.orgCSS
  302. kakaku.comCSS
  303. wikimedia.orgCSS
  304. rutracker.orgCSS
  305. dingit.tvCSS
  306. yesky.comCSS
  307. webtretho.comCSS
  308. eastday.comCSS
  309. google.co.il — removed (duplicate of google.com)
  310. google.dk — removed (duplicate of google.com)
  311. freepik.comCSS
  312. blackboard.comCSS
  313. leboncoin.frCSS
  314. trackmedia101.com — removed (error "Route with parameter not found")
  315. scribd.comCSS
  316. youtube-mp3.orgCSS
  317. go2cloud.orgCSS
  318. blogspot.in — removed (duplicate of blogger.com)
  319. speedtest.netCSS
  320. espncricinfo.comCSS
  321. irctc.co.inCSS
  322. taringa.netCSS
  323. bestbuy.comCSS
  324. 163.comCSS
  325. businessinsider.comCSS
  326. bukalapak.comCSS
  327. sberbank.ruCSS
  328. digikala.comCSS
  329. google.fi — removed (duplicate of google.com)
  330. kompas.comCSS
  331. onclickpredictiv.com — removed (403 Forbidden)
  332. sabah.com.trCSS
  333. web.deCSS
  334. youku.comCSS
  335. oracle.comCSS
  336. google.co.ao — removed (duplicate of google.com)
  337. ouo.ioCSS
  338. asos.comCSS
  339. flickr.comCSS
  340. gmx.netCSS
  341. gearbest.comCSS
  342. mailchimp.comCSS
  343. hotstar.comCSS
  344. extratorrent.ccCSS
  345. huanqiu.comCSS
  346. redirectvoluum.com — removed (server not found)
  347. kissanime.ruCSS
  348. goodreads.comCSS
  349. genius.comCSS
  350. researchgate.netCSS
  351. naver.jpCSS
  352. amazon.es — removed (duplicate of amazon.com)
  353. gamefaqs.comCSS
  354. zippyshare.comCSS
  355. dictionary.comCSS
  356. caijing.com.cnCSS
  357. exoclick.comCSS
  358. google.sk — removed (duplicate of google.com)
  359. providr.comCSS
  360. kaskus.co.idCSS
  361. mega.nzCSS
  362. airbnb.comCSS
  363. enet.com.cnCSS
  364. onedio.comCSS
  365. hp.comCSS
  366. baike.comCSS
  367. thewhizproducts.com — removed (parked)
  368. pandora.comCSS
  369. goo.ne.jpCSS
  370. daum.netCSS
  371. behance.netCSS
  372. spotscenered.infoCSS
  373. hotmovs.comCSS
  374. medium.comCSS
  375. orange.frCSS
  376. nametests.comCSS
  377. appspot.comCSS
  378. nownews.comCSS
  379. xfinity.comCSS
  380. instructure.comCSS
  381. 2ch.netCSS
  382. hatenablog.comCSS
  383. hola.comCSS
  384. subscene.comCSS
  385. sciencedirect.comCSS
  386. evernote.comCSS
  387. accuweather.comCSS
  388. independent.co.ukCSS
  389. ndtv.comCSS
  390. hatena.ne.jpCSS
  391. conservativetribune.comCSS
  392. telegraph.co.ukCSS
  393. rt.comCSS
  394. zendesk.comCSS
  395. themeforest.netCSS
  396. aliyun.comCSS
  397. icloud.comCSS
  398. amazon.cn — removed (duplicate of amazon.com)
  399. media.tumblr.com — removed (duplicate of tumblr.com)
  400. amazon.ca — removed (duplicate of amazon.com)
  401. onet.plCSS
  402. gyazo.comCSS
  403. telegram.orgCSS
  404. youdao.comCSS
  405. onlinevideoconverter.comCSS
  406. rumble.comCSS
  407. cricbuzz.comCSS
  408. go.comCSS
  409. ibm.comCSS
  410. atlassian.netCSS
  411. wix.comCSS
  412. streamable.comCSS
  413. taleo.net — removed (duplicate of oracle.com)
  414. usps.comCSS
  415. ensonhaber.comCSS
  416. samsung.comCSS
  417. ci123.comCSS
  418. beeg.comCSS
  419. mit.eduCSS
  420. hdfcbank.comCSS
  421. taboola.comCSS
  422. seasonvar.ruCSS
  423. spiegel.deCSS
  424. codeonclick.com — removed (parked)
  425. shopify.comCSS
  426. aparat.comCSS
  427. target.comCSS
  428. google.kz — removed (duplicate of google.com)
  429. rottentomatoes.comCSS
  430. tutorialspoint.comCSS
  431. ebay.it — removed (duplicate of ebay.com)
  432. fiverr.comCSS
  433. repubblica.itCSS
  434. bloomberg.comCSS
  435. oeeee.comCSS
  436. prezi.comCSS
  437. free.frCSS
  438. ups.comCSS
  439. gizmodo.comCSS
  440. hespress.comCSS
  441. box.comCSS
  442. hulu.comCSS
  443. americanexpress.comCSS
  444. leagueoflegends.comCSS
  445. reverso.netCSS
  446. huawei.tmall.com — removed (duplicate of tmall.com)
  447. 4dsply.comCSS
  448. mercadolibre.com.arCSS
  449. homedepot.comCSS
  450. cloudfront.net — removed (server not found)
  451. teepr.comCSS
  452. haber7.comCSS
  453. utorrent.comCSS
  454. kapanlagi.comCSS
  455. wp.plCSS
  456. tfetimes.comCSS
  457. google.com.kw — removed (duplicate of google.com)
  458. dell.comCSS
  459. canva.comCSS
  460. patch.comCSS
  461. cbssports.comCSS
  462. azlyrics.comCSS
  463. qiita.comCSS
  464. paytm.comCSS
  465. ultimate-guitar.comCSS
  466. capitalone.comCSS
  467. urbandictionary.comCSS
  468. pwwysydh.com — removed (404 Not Found)
  469. seznam.czCSS
  470. elpais.comCSS
  471. billdesk.com — removed (table layout without CSS)
  472. udemy.comCSS
  473. videodownloadconverter.comCSS
  474. kooora.comCSS
  475. line.meCSS
  476. marca.comCSS
  477. momoshop.com.twCSS
  478. webmd.comCSS
  479. rednet.cnCSS
  480. kickstarter.comCSS
  481. playstation.comCSS
  482. nur.kzCSS
  483. upwork.comCSS
  484. perfecttoolmedia.comCSS
  485. icicibank.comCSS
  486. biobiochile.clCSS
  487. pairade.comCSS
  488. gsmarena.comCSS
  489. setn.comCSS
  490. hilltopads.netCSS
  491. shink.inCSS
  492. ablogica.com — removed (parked)
  493. livedoor.com — removed (duplicate of livedoor.jp)
  494. rediff.comCSS
  495. nike.comCSS
  496. usatoday.comCSS
  497. gismeteo.ruCSS
  498. google.bg — removed (duplicate of google.com)
  499. ck101.comCSS
  500. vporn.comCSS

Releases

No releases published

Packages

No packages published

Languages

  • CSS 100.0%