Abstract
We consider the problem of running the regex pattern matching in a space-efficient manner. Given a regex, we suggest a bit-packing scheme for representing a compiled regex in a compressed way, which is its position automaton. Our scheme reduces its representation size further by relying on the homogeneous property of position automata and practical features of regexes. We implement the proposed scheme and evaluate the memory consumption using a practical regex benchmark dataset. Our approach produces a much smaller representation compared to two common FA representations. In addition, experimental results show that our bit-packing regex engine is effective for matching regexes that have large compiled forms, by showing less memory consumption compared to the current state-of-the-art regex engine (RE2).
Original language | English |
---|---|
Title of host publication | Implementation and Application of Automata - 27th International Conference, CIAA 2023, Proceedings |
Editors | Benedek Nagy, Benedek Nagy |
Publisher | Springer Science and Business Media Deutschland GmbH |
Pages | 290-301 |
Number of pages | 12 |
ISBN (Print) | 9783031402463 |
DOIs | |
Publication status | Published - 2023 |
Event | Implementation and Application of Automata - 27th International Conference, CIAA 2023, Proceedings - Famagusta, Cyprus Duration: 2023 Sept 19 → 2023 Sept 22 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 14151 LNCS |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | Implementation and Application of Automata - 27th International Conference, CIAA 2023, Proceedings |
---|---|
Country/Territory | Cyprus |
City | Famagusta |
Period | 23/9/19 → 23/9/22 |
Bibliographical note
Publisher Copyright:© 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.
All Science Journal Classification (ASJC) codes
- Theoretical Computer Science
- General Computer Science