Skip to content

Commit

Permalink
Write about undetected external domains, add table and graph. Add "th…
Browse files Browse the repository at this point in the history
…e end" page.
  • Loading branch information
joelpurra committed Feb 3, 2015
1 parent e163e3f commit 8d4ea3a
Show file tree
Hide file tree
Showing 4 changed files with 655 additions and 81 deletions.
73 changes: 73 additions & 0 deletions report/datasets.non-failed.domains.ratios.sorted.tsv
@@ -0,0 +1,73 @@
Dataset Domains Ext dom. Prim. D dom. D diff ext. D diff prim. D/ext. Prim. det. Undet.
alexa.2014-09-01.random.10000-http 8216 14257 7312 704 13553 6608 0.049379252297117204 0.0962800875273523 0.9037199124726477
alexa.2014-09-01.random.10000-http-www 8493 14478 7501 704 13774 6797 0.04862550075977345 0.09385415277962939 0.9061458472203706
alexa.2014-09-01.random.10000-https 1135 3071 1454 370 2701 1084 0.12048192771084337 0.2544704264099037 0.7455295735900963
alexa.2014-09-01.random.10000-https-www 1224 2406 1233 368 2038 865 0.15295095594347466 0.2984590429845904 0.7015409570154096
alexa.2014-09-01.top.10000-http 8545 22212 8335 755 21457 7580 0.03399063569241851 0.09058188362327535 0.9094181163767247
alexa.2014-09-01.top.10000-http-www 8682 22661 8544 760 21901 7784 0.033537796213759324 0.08895131086142322 0.9110486891385767
alexa.2014-09-01.top.10000-https 2507 7217 2909 542 6675 2367 0.07510045725370652 0.18631832244757648 0.8136816775524235
alexa.2014-09-01.top.10000-https-www 2957 8017 3120 569 7448 2551 0.07097417986778097 0.18237179487179486 0.8176282051282051
alexa.2014-09-01.top.dk.10000-http 2263 2768 1407 282 2486 1125 0.101878612716763 0.20042643923240938 0.7995735607675907
alexa.2014-09-01.top.dk.10000-http-www 2310 2850 1483 284 2566 1199 0.09964912280701754 0.19150370869858396 0.808496291301416
alexa.2014-09-01.top.dk.10000-https 339 816 420 151 665 269 0.18504901960784315 0.3595238095238095 0.6404761904761904
alexa.2014-09-01.top.dk.10000-https-www 441 997 516 176 821 340 0.1765295887662989 0.34108527131782945 0.6589147286821706
alexa.2014-09-01.top.se.10000-http 2797 4681 2199 342 4339 1857 0.07306131168553728 0.15552523874488403 0.844474761255116
alexa.2014-09-01.top.se.10000-http-www 2895 4751 2207 351 4400 1856 0.0738791833298253 0.1590394200271862 0.8409605799728138
alexa.2014-09-01.top.se.10000-https 438 990 524 167 823 357 0.1686868686868687 0.3187022900763359 0.6812977099236641
alexa.2014-09-01.top.se.10000-https-www 650 1237 651 199 1038 452 0.1608730800323363 0.30568356374807987 0.6943164362519201
com.2014-08-29.random.10000-http 7775 6329 3713 404 5925 3309 0.06383314899668194 0.10880689469431726 0.8911931053056827
com.2014-08-29.random.10000-http-www 7811 6339 3717 405 5934 3312 0.06389020350212968 0.1089588377723971 0.8910411622276029
com.2014-08-29.random.10000-https 50 127 84 47 80 37 0.3700787401574803 0.5595238095238095 0.44047619047619047
com.2014-08-29.random.10000-https-www 55 163 99 49 114 50 0.3006134969325153 0.494949494949495 0.505050505050505
dk.2014-07-23.random.10000-http 7180 4272 2834 278 3994 2556 0.0650749063670412 0.09809456598447425 0.9019054340155257
dk.2014-07-23.random.10000-http-www 7378 4378 2894 275 4103 2619 0.06281407035175879 0.09502418797512094 0.9049758120248791
dk.2014-07-23.random.10000-https 23 52 33 26 26 7 0.5 0.7878787878787878 0.21212121212121215
dk.2014-07-23.random.10000-https-www 32 81 54 32 49 22 0.3950617283950617 0.5925925925925926 0.40740740740740744
net.2014-08-29.random.10000-http 7270 6206 3806 412 5794 3394 0.06638736706413148 0.10825013137151865 0.8917498686284814
net.2014-08-29.random.10000-http-www 7378 6311 3889 411 5900 3478 0.06512438599271114 0.10568269478014913 0.8943173052198509
net.2014-08-29.random.10000-https 26 49 26 21 28 5 0.42857142857142855 0.8076923076923077 0.1923076923076923
net.2014-08-29.random.10000-https-www 28 62 34 27 35 7 0.43548387096774194 0.7941176470588235 0.20588235294117652
reach50.2014w35.se-http 43 339 195 92 247 103 0.2713864306784661 0.4717948717948718 0.5282051282051282
reach50.2014w35.se-http-www 42 342 194 92 250 102 0.26900584795321636 0.4742268041237113 0.5257731958762887
reach50.2014w35.se-https 18 117 66 41 76 25 0.3504273504273504 0.6212121212121212 0.3787878787878788
reach50.2014w35.se-https-www 26 139 83 40 99 43 0.28776978417266186 0.4819277108433735 0.5180722891566265
se.2014-07-10.random.100000-http 73605 24289 15746 496 23793 15250 0.020420766602165588 0.03150006350819256 0.9684999364918074
se.2014-07-10.random.100000-http-www 77261 25366 16546 502 24864 16044 0.019790270440747458 0.030339659132116524 0.9696603408678834
se.2014-07-10.random.100000-https 282 393 235 94 299 141 0.23918575063613232 0.4 0.6
se.2014-07-10.random.100000-https-www 328 546 340 124 422 216 0.2271062271062271 0.36470588235294116 0.6352941176470588
se.healthstatus.2013.counties-http 18 34 23 10 24 13 0.29411764705882354 0.43478260869565216 0.5652173913043479
se.healthstatus.2013.counties-http-www 21 39 27 11 28 16 0.28205128205128205 0.4074074074074074 0.5925925925925926
se.healthstatus.2013.counties-https 3 6 5 2 4 3 0.3333333333333333 0.4 0.6
se.healthstatus.2013.counties-https-www 6 15 11 4 11 7 0.26666666666666666 0.36363636363636365 0.6363636363636364
se.healthstatus.2013.domain-registrars-http 127 216 148 66 150 82 0.3055555555555556 0.44594594594594594 0.5540540540540541
se.healthstatus.2013.domain-registrars-http-www 134 214 144 62 152 82 0.2897196261682243 0.4305555555555556 0.5694444444444444
se.healthstatus.2013.domain-registrars-https 40 124 86 46 78 40 0.3709677419354839 0.5348837209302325 0.4651162790697675
se.healthstatus.2013.domain-registrars-https-www 42 116 79 40 76 39 0.3448275862068966 0.5063291139240507 0.49367088607594933
se.healthstatus.2013.financial-services-http 67 137 97 49 88 48 0.35766423357664234 0.5051546391752577 0.4948453608247423
se.healthstatus.2013.financial-services-http-www 72 144 97 50 94 47 0.3472222222222222 0.5154639175257731 0.48453608247422686
se.healthstatus.2013.financial-services-https 16 47 37 24 23 13 0.5106382978723404 0.6486486486486487 0.3513513513513513
se.healthstatus.2013.financial-services-https-www 31 71 50 32 39 18 0.4507042253521127 0.64 0.36
se.healthstatus.2013.gocs-http 49 130 83 45 85 38 0.34615384615384615 0.5421686746987951 0.45783132530120485
se.healthstatus.2013.gocs-http-www 57 150 95 47 103 48 0.31333333333333335 0.49473684210526314 0.5052631578947369
se.healthstatus.2013.gocs-https 4 44 28 21 23 7 0.4772727272727273 0.75 0.25
se.healthstatus.2013.gocs-https-www 9 65 44 27 38 17 0.4153846153846154 0.6136363636363636 0.38636363636363635
se.healthstatus.2013.higher-education-http 40 73 53 24 49 29 0.3287671232876712 0.4528301886792453 0.5471698113207547
se.healthstatus.2013.higher-education-http-www 47 74 52 26 48 26 0.35135135135135137 0.5 0.5
se.healthstatus.2013.higher-education-https 9 38 25 16 22 9 0.42105263157894735 0.64 0.36
se.healthstatus.2013.higher-education-https-www 24 63 45 22 41 23 0.3492063492063492 0.4888888888888889 0.5111111111111111
se.healthstatus.2013.isps-http 18 111 76 47 64 29 0.42342342342342343 0.618421052631579 0.381578947368421
se.healthstatus.2013.isps-http-www 19 135 92 55 80 37 0.4074074074074074 0.5978260869565217 0.40217391304347827
se.healthstatus.2013.isps-https 6 84 63 41 43 22 0.4880952380952381 0.6507936507936508 0.3492063492063492
se.healthstatus.2013.isps-https-www 10 89 66 43 46 23 0.48314606741573035 0.6515151515151515 0.3484848484848485
se.healthstatus.2013.media-http 26 346 190 81 265 109 0.23410404624277456 0.4263157894736842 0.5736842105263158
se.healthstatus.2013.media-http-www 28 378 207 79 299 128 0.20899470899470898 0.38164251207729466 0.6183574879227054
se.healthstatus.2013.media-https 4 102 58 24 78 34 0.23529411764705882 0.41379310344827586 0.5862068965517242
se.healthstatus.2013.media-https-www 5 95 59 28 67 31 0.29473684210526313 0.4745762711864407 0.5254237288135593
se.healthstatus.2013.municipalities-http 249 207 113 39 168 74 0.18840579710144928 0.34513274336283184 0.6548672566371682
se.healthstatus.2013.municipalities-http-www 271 203 113 39 164 74 0.1921182266009852 0.34513274336283184 0.6548672566371682
se.healthstatus.2013.municipalities-https 44 67 41 18 49 23 0.26865671641791045 0.43902439024390244 0.5609756097560976
se.healthstatus.2013.municipalities-https-www 54 73 42 18 55 24 0.2465753424657534 0.42857142857142855 0.5714285714285714
se.healthstatus.2013.public-authorities-http 170 172 110 48 124 62 0.27906976744186046 0.43636363636363634 0.5636363636363637
se.healthstatus.2013.public-authorities-http-www 203 170 111 48 122 63 0.2823529411764706 0.43243243243243246 0.5675675675675675
se.healthstatus.2013.public-authorities-https 18 32 21 9 23 12 0.28125 0.42857142857142855 0.5714285714285714
se.healthstatus.2013.public-authorities-https-www 37 63 41 23 40 18 0.36507936507936506 0.5609756097560976 0.4390243902439024
146 changes: 73 additions & 73 deletions report/datasets.non-failed.requests.counts.sorted.tsv
@@ -1,73 +1,73 @@
Dataset Domains w/ int w/ ext All requests Int Ext Disco.
alexa.2014-09-01.random.10000-http 8216 7829 7591 610150 343214 266936 166702
alexa.2014-09-01.random.10000-http-www 8493 8009 7825 627185 355413 271772 169685
alexa.2014-09-01.random.10000-https 1135 1084 1072 86124 49816 36308 23599
alexa.2014-09-01.random.10000-https-www 1224 1182 1139 87423 60773 26650 16764
alexa.2014-09-01.top.10000-http 8545 8156 8176 899404 408553 490851 274782
alexa.2014-09-01.top.10000-http-www 8682 8190 8289 912709 415958 496751 276636
alexa.2014-09-01.top.10000-https 2507 2398 2369 207090 93986 113104 67788
alexa.2014-09-01.top.10000-https-www 2957 2849 2801 243323 117947 125376 73239
alexa.2014-09-01.top.dk.10000-http 2263 2182 2136 162059 99234 62825 37832
alexa.2014-09-01.top.dk.10000-http-www 2310 2212 2182 165023 100543 64480 38373
alexa.2014-09-01.top.dk.10000-https 339 325 316 25034 15575 9459 5942
alexa.2014-09-01.top.dk.10000-https-www 441 424 406 29901 18596 11305 6901
alexa.2014-09-01.top.se.10000-http 2797 2687 2684 209012 117914 91098 52345
alexa.2014-09-01.top.se.10000-http-www 2895 2756 2779 212700 121479 91221 52398
alexa.2014-09-01.top.se.10000-https 438 427 422 32271 19899 12372 7104
alexa.2014-09-01.top.se.10000-https-www 650 636 630 45199 28286 16913 9510
com.2014-08-29.random.10000-http 7775 5575 6222 226636 76167 150469 55666
com.2014-08-29.random.10000-http-www 7811 5546 6241 230039 78086 151953 55955
com.2014-08-29.random.10000-https 50 45 41 2251 1654 597 446
com.2014-08-29.random.10000-https-www 55 54 43 2650 1930 720 477
dk.2014-07-23.random.10000-http 7180 4648 4626 187706 80787 106919 36822
dk.2014-07-23.random.10000-http-www 7378 4763 4773 190186 82052 108134 35960
dk.2014-07-23.random.10000-https 23 22 16 902 725 177 150
dk.2014-07-23.random.10000-https-www 32 29 22 1337 921 416 257
net.2014-08-29.random.10000-http 7270 4871 5757 192646 56364 136282 48379
net.2014-08-29.random.10000-http-www 7378 4867 5839 196301 58205 138096 49471
net.2014-08-29.random.10000-https 26 26 16 1299 1071 228 203
net.2014-08-29.random.10000-https-www 28 25 20 1568 1210 358 291
reach50.2014w35.se-http 43 41 43 3898 1313 2585 843
reach50.2014w35.se-http-www 42 39 42 3645 1135 2510 801
reach50.2014w35.se-https 18 16 17 1092 264 828 265
reach50.2014w35.se-https-www 26 23 26 1436 455 981 303
se.2014-07-10.random.100000-http 73605 43216 54882 1931501 782998 1148503 395347
se.2014-07-10.random.100000-http-www 77261 45312 57547 2006337 807160 1199177 406990
se.2014-07-10.random.100000-https 282 263 226 14140 10726 3414 1962
se.2014-07-10.random.100000-https-www 328 311 285 17686 13057 4629 2451
se.healthstatus.2013.counties-http 18 18 18 921 726 195 105
se.healthstatus.2013.counties-http-www 21 20 21 1066 809 257 133
se.healthstatus.2013.counties-https 3 3 3 156 137 19 7
se.healthstatus.2013.counties-https-www 6 6 5 240 191 49 20
se.healthstatus.2013.domain-registrars-http 127 108 108 6418 4459 1959 886
se.healthstatus.2013.domain-registrars-http-www 134 114 113 6627 4565 2062 872
se.healthstatus.2013.domain-registrars-https 40 39 34 2342 1620 722 430
se.healthstatus.2013.domain-registrars-https-www 42 40 36 2439 1833 606 327
se.healthstatus.2013.financial-services-http 67 61 67 3260 2319 941 378
se.healthstatus.2013.financial-services-http-www 72 64 71 3518 2491 1027 415
se.healthstatus.2013.financial-services-https 16 15 16 881 696 185 95
se.healthstatus.2013.financial-services-https-www 31 30 31 1504 1148 356 228
se.healthstatus.2013.gocs-http 49 44 48 2585 1746 839 501
se.healthstatus.2013.gocs-http-www 57 50 55 2925 1894 1031 577
se.healthstatus.2013.gocs-https 4 4 4 321 195 126 64
se.healthstatus.2013.gocs-https-www 9 9 9 567 377 190 91
se.healthstatus.2013.higher-education-http 40 39 38 2064 1685 379 270
se.healthstatus.2013.higher-education-http-www 47 46 44 2305 1886 419 308
se.healthstatus.2013.higher-education-https 9 9 9 571 442 129 104
se.healthstatus.2013.higher-education-https-www 24 24 24 1291 1038 253 182
se.healthstatus.2013.isps-http 18 17 18 1150 757 393 271
se.healthstatus.2013.isps-http-www 19 19 19 1209 735 474 317
se.healthstatus.2013.isps-https 6 6 6 523 323 200 152
se.healthstatus.2013.isps-https-www 10 10 10 669 448 221 163
se.healthstatus.2013.media-http 26 24 25 4812 1596 3216 1101
se.healthstatus.2013.media-http-www 28 25 27 5507 1676 3831 1234
se.healthstatus.2013.media-https 4 4 4 977 202 775 186
se.healthstatus.2013.media-https-www 5 5 5 868 316 552 204
se.healthstatus.2013.municipalities-http 249 249 239 14028 10162 3866 2367
se.healthstatus.2013.municipalities-http-www 271 270 258 14749 10827 3922 2447
se.healthstatus.2013.municipalities-https 44 44 41 2603 2047 556 305
se.healthstatus.2013.municipalities-https-www 54 54 50 3001 2347 654 394
se.healthstatus.2013.public-authorities-http 170 153 162 7423 5403 2020 935
se.healthstatus.2013.public-authorities-http-www 203 182 188 8297 6123 2174 945
se.healthstatus.2013.public-authorities-https 18 18 15 664 575 89 64
se.healthstatus.2013.public-authorities-https-www 37 37 36 1596 1315 281 200
Dataset Domains w/ int w/ ext Ext dom. Ext prim. Ext D dom. All requests Int Ext Disco.
alexa.2014-09-01.random.10000-http 8216 7829 7591 14257 7312 704 610150 343214 266936 166702
alexa.2014-09-01.random.10000-http-www 8493 8009 7825 14478 7501 704 627185 355413 271772 169685
alexa.2014-09-01.random.10000-https 1135 1084 1072 3071 1454 370 86124 49816 36308 23599
alexa.2014-09-01.random.10000-https-www 1224 1182 1139 2406 1233 368 87423 60773 26650 16764
alexa.2014-09-01.top.10000-http 8545 8156 8176 22212 8335 755 899404 408553 490851 274782
alexa.2014-09-01.top.10000-http-www 8682 8190 8289 22661 8544 760 912709 415958 496751 276636
alexa.2014-09-01.top.10000-https 2507 2398 2369 7217 2909 542 207090 93986 113104 67788
alexa.2014-09-01.top.10000-https-www 2957 2849 2801 8017 3120 569 243323 117947 125376 73239
alexa.2014-09-01.top.dk.10000-http 2263 2182 2136 2768 1407 282 162059 99234 62825 37832
alexa.2014-09-01.top.dk.10000-http-www 2310 2212 2182 2850 1483 284 165023 100543 64480 38373
alexa.2014-09-01.top.dk.10000-https 339 325 316 816 420 151 25034 15575 9459 5942
alexa.2014-09-01.top.dk.10000-https-www 441 424 406 997 516 176 29901 18596 11305 6901
alexa.2014-09-01.top.se.10000-http 2797 2687 2684 4681 2199 342 209012 117914 91098 52345
alexa.2014-09-01.top.se.10000-http-www 2895 2756 2779 4751 2207 351 212700 121479 91221 52398
alexa.2014-09-01.top.se.10000-https 438 427 422 990 524 167 32271 19899 12372 7104
alexa.2014-09-01.top.se.10000-https-www 650 636 630 1237 651 199 45199 28286 16913 9510
com.2014-08-29.random.10000-http 7775 5575 6222 6329 3713 404 226636 76167 150469 55666
com.2014-08-29.random.10000-http-www 7811 5546 6241 6339 3717 405 230039 78086 151953 55955
com.2014-08-29.random.10000-https 50 45 41 127 84 47 2251 1654 597 446
com.2014-08-29.random.10000-https-www 55 54 43 163 99 49 2650 1930 720 477
dk.2014-07-23.random.10000-http 7180 4648 4626 4272 2834 278 187706 80787 106919 36822
dk.2014-07-23.random.10000-http-www 7378 4763 4773 4378 2894 275 190186 82052 108134 35960
dk.2014-07-23.random.10000-https 23 22 16 52 33 26 902 725 177 150
dk.2014-07-23.random.10000-https-www 32 29 22 81 54 32 1337 921 416 257
net.2014-08-29.random.10000-http 7270 4871 5757 6206 3806 412 192646 56364 136282 48379
net.2014-08-29.random.10000-http-www 7378 4867 5839 6311 3889 411 196301 58205 138096 49471
net.2014-08-29.random.10000-https 26 26 16 49 26 21 1299 1071 228 203
net.2014-08-29.random.10000-https-www 28 25 20 62 34 27 1568 1210 358 291
reach50.2014w35.se-http 43 41 43 339 195 92 3898 1313 2585 843
reach50.2014w35.se-http-www 42 39 42 342 194 92 3645 1135 2510 801
reach50.2014w35.se-https 18 16 17 117 66 41 1092 264 828 265
reach50.2014w35.se-https-www 26 23 26 139 83 40 1436 455 981 303
se.2014-07-10.random.100000-http 73605 43216 54882 24289 15746 496 1931501 782998 1148503 395347
se.2014-07-10.random.100000-http-www 77261 45312 57547 25366 16546 502 2006337 807160 1199177 406990
se.2014-07-10.random.100000-https 282 263 226 393 235 94 14140 10726 3414 1962
se.2014-07-10.random.100000-https-www 328 311 285 546 340 124 17686 13057 4629 2451
se.healthstatus.2013.counties-http 18 18 18 34 23 10 921 726 195 105
se.healthstatus.2013.counties-http-www 21 20 21 39 27 11 1066 809 257 133
se.healthstatus.2013.counties-https 3 3 3 6 5 2 156 137 19 7
se.healthstatus.2013.counties-https-www 6 6 5 15 11 4 240 191 49 20
se.healthstatus.2013.domain-registrars-http 127 108 108 216 148 66 6418 4459 1959 886
se.healthstatus.2013.domain-registrars-http-www 134 114 113 214 144 62 6627 4565 2062 872
se.healthstatus.2013.domain-registrars-https 40 39 34 124 86 46 2342 1620 722 430
se.healthstatus.2013.domain-registrars-https-www 42 40 36 116 79 40 2439 1833 606 327
se.healthstatus.2013.financial-services-http 67 61 67 137 97 49 3260 2319 941 378
se.healthstatus.2013.financial-services-http-www 72 64 71 144 97 50 3518 2491 1027 415
se.healthstatus.2013.financial-services-https 16 15 16 47 37 24 881 696 185 95
se.healthstatus.2013.financial-services-https-www 31 30 31 71 50 32 1504 1148 356 228
se.healthstatus.2013.gocs-http 49 44 48 130 83 45 2585 1746 839 501
se.healthstatus.2013.gocs-http-www 57 50 55 150 95 47 2925 1894 1031 577
se.healthstatus.2013.gocs-https 4 4 4 44 28 21 321 195 126 64
se.healthstatus.2013.gocs-https-www 9 9 9 65 44 27 567 377 190 91
se.healthstatus.2013.higher-education-http 40 39 38 73 53 24 2064 1685 379 270
se.healthstatus.2013.higher-education-http-www 47 46 44 74 52 26 2305 1886 419 308
se.healthstatus.2013.higher-education-https 9 9 9 38 25 16 571 442 129 104
se.healthstatus.2013.higher-education-https-www 24 24 24 63 45 22 1291 1038 253 182
se.healthstatus.2013.isps-http 18 17 18 111 76 47 1150 757 393 271
se.healthstatus.2013.isps-http-www 19 19 19 135 92 55 1209 735 474 317
se.healthstatus.2013.isps-https 6 6 6 84 63 41 523 323 200 152
se.healthstatus.2013.isps-https-www 10 10 10 89 66 43 669 448 221 163
se.healthstatus.2013.media-http 26 24 25 346 190 81 4812 1596 3216 1101
se.healthstatus.2013.media-http-www 28 25 27 378 207 79 5507 1676 3831 1234
se.healthstatus.2013.media-https 4 4 4 102 58 24 977 202 775 186
se.healthstatus.2013.media-https-www 5 5 5 95 59 28 868 316 552 204
se.healthstatus.2013.municipalities-http 249 249 239 207 113 39 14028 10162 3866 2367
se.healthstatus.2013.municipalities-http-www 271 270 258 203 113 39 14749 10827 3922 2447
se.healthstatus.2013.municipalities-https 44 44 41 67 41 18 2603 2047 556 305
se.healthstatus.2013.municipalities-https-www 54 54 50 73 42 18 3001 2347 654 394
se.healthstatus.2013.public-authorities-http 170 153 162 172 110 48 7423 5403 2020 935
se.healthstatus.2013.public-authorities-http-www 203 182 188 170 111 48 8297 6123 2174 945
se.healthstatus.2013.public-authorities-https 18 18 15 32 21 9 664 575 89 64
se.healthstatus.2013.public-authorities-https-www 37 37 36 63 41 23 1596 1315 281 200

0 comments on commit 8d4ea3a

Please sign in to comment.