loading page

Parsing Millions of URLs per Second
  • Yagiz Nizipli,
  • Daniel Lemire
Yagiz Nizipli
Universite TELUQ
Author Profile
Daniel Lemire
Universite TELUQ

Corresponding Author:[email protected]

Author Profile

Abstract

URLs are fundamental elements of web applications. By applying vector algorithms, we built a fast standard-compliant C++ implementation. Our parser uses three times fewer instructions than competing parsers following WHATWG URL standard (e.g., Servo’s rust-url) and up to eight times fewer instructions than the popular curl parser. The Node.js environment adopted our C++ library. In our tests on realistic data, a recent Node.js version (20.0) with our parser is four to five times faster than the last version with the legacy URL parser.
02 Jun 2023Submitted to Software: Practice and Experience
02 Jun 2023Submission Checks Completed
02 Jun 2023Assigned to Editor
02 Jun 2023Review(s) Completed, Editorial Evaluation Pending
20 Jun 2023Reviewer(s) Assigned
21 Aug 2023Editorial Decision: Revise Major
13 Sep 20231st Revision Received
13 Sep 2023Submission Checks Completed
13 Sep 2023Assigned to Editor
13 Sep 2023Review(s) Completed, Editorial Evaluation Pending
16 Sep 2023Reviewer(s) Assigned
17 Oct 2023Editorial Decision: Revise Minor
31 Oct 20232nd Revision Received
01 Nov 2023Submission Checks Completed
01 Nov 2023Assigned to Editor
01 Nov 2023Review(s) Completed, Editorial Evaluation Pending
03 Nov 2023Reviewer(s) Assigned
17 Nov 2023Editorial Decision: Accept