[191305] in North American Network Operators' Group

home help back first fref pref prev next nref lref last post

measuring web similarity from dual-stacked hosts

daemon@ATHENA.MIT.EDU (Bajpai, Vaibhav)
Mon Sep 5 08:36:47 2016

X-Original-To: nanog@nanog.org
From: "Bajpai, Vaibhav" <v.bajpai@jacobs-university.de>
To: "nanog@nanog.org" <nanog@nanog.org>
Date: Mon, 5 Sep 2016 12:36:35 +0000
Errors-To: nanog-bounces@nanog.org

--Apple-Mail=_207B06BC-14E3-49FA-925F-27EFEAB1A3BE
X-Clacks-Overhead: GNU Terry Pratchett
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=utf-8

Dear NANOG,

Measuring Web Similarity from Dual-stacked Hosts
------------------------------------------------

How similar are the webpages accessed over IPv6 to their IPv4 =
counterparts? =E2=80=93
In situations where the content is dissimilar over IPv4 and IPv6, what =
factors
contribute to the dissimilarity?

To answer ^ we developed a tool (simweb) and deployed it on 80 =
geographically
distributed dual-stacked SamKnows probes. A paper presenting results =
from the
collected dataset got accepted recently. We just released the tool and =
the
paper [a]. Thought to share it along.

[a] http://goo.gl/sAsDcG

Feedback most welcome!
You may recall a presentation of this work at RIPE 72 [b].

[b] https://ripe72.ripe.net/archives/video/126

Abstract
--------

We compare the similarity of webpages delivered over IPv4 and IPv6. =
Using the
SamKnows web performance (webget) test, we implemented an extension =
(simweb)
that allows us to measure the similarity of webpages. The simweb test =
measures
against ALEXA top 100 dual-stacked websites from 80 SamKnows probes =
connected
to dual-stacked networks representing 58 different ASes. Using a two
months-long dataset we show that 14% of these dual-stacked websites =
exhibit a
dissimilarity in the number of fetched webpage elements, with 94% of =
them
exhibiting a dissimilarity in their size. We show that 6% of these =
websites
announce AAAA entries in the DNS but no content is delivered over IPv6 =
when an
HTTP request is made. We also noticed several cases where not all =
webpage
elements (such as images, javascript and CSS) of a dual-stacked website =
are
available over IPv6. We show that 27% of the dual-stacked websites have =
some
fraction of webpage elements that fail over IPv6, with 9% of the =
websites
having more than 50% webpage elements that fail over IPv6. We perform a
causality analysis and also identify sources for these failing elements. =
We
show that 12% of these websites have more than 50% webpage elements that
belong to the same origin source and fail over IPv6. Failure rates are =
largely
affected by DNS resolution error on images, javascript and CSS content
delivered from both same-origin and cross-origin sources. These failures =
tend
to cripple experience for users behind an IPv6-only network and a
quantification of failure cases may help improve IPv6 adoption on the =
Internet.

-- Vaibhav

=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D
Vaibhav Bajpai
www.vaibhavbajpai.com

Postdoctoral Researcher
Jacobs University Bremen, Germany
=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D


--Apple-Mail=_207B06BC-14E3-49FA-925F-27EFEAB1A3BE
X-Clacks-Overhead: GNU Terry Pratchett
Content-Transfer-Encoding: 7bit
Content-Disposition: attachment; filename="signature.asc"
Content-Type: application/pgp-signature; name="signature.asc"
Content-Description: Message signed with OpenPGP using GPGMail

-----BEGIN PGP SIGNATURE-----
Comment: GPGTools - https://gpgtools.org

iQEcBAEBCgAGBQJXzWbTAAoJEHR3XKwTWKOZRY8H+gINjCbcGeDwKugIIEg6WIFx
tfqGXu5mTpS7QLgojeBH5LuzQ/R/V+12jWyUqZSQ0+ExASXH1NSBT56jfYnmV32B
GZv4OmDd90CSveR9TBKT0zOfsAkBZqEi9mZLNBsrAPMTAFaamUtvJ6i6hXptEgRv
l6USHnFbUm4VIj+Kt86FcUmghT2WnmMMzTFtJOJ51ySZSkHCgXqCFPh7uTEdZvV6
zcb4mveKSYSXyTW2oGxM/g+YQqFcXc24KdUJCwDzQ/0SaaGgEPk0993TS2Y1sh44
G5sKoJEuOIrPe7FovXojCnEp2oUxTN3sA2SqwPe78X3FQsZSho58k2CNxIkPX+8=
=JOZ7
-----END PGP SIGNATURE-----

--Apple-Mail=_207B06BC-14E3-49FA-925F-27EFEAB1A3BE--

home help back first fref pref prev next nref lref last post