I wondered about this for some time. Simple regex (as in formal language theory)... | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

tibiapejagala on July 20, 2016 | parent | context | favorite | on: Stack Overflow Outage Postmortem

I wondered about this for some time.

Simple regex (as in formal language theory) are matched in O(n) time by finite automaton.

Extended regex like PCRE are more powerful, but most of the time are implemented by backtracking engines, where really bad regex pattern might go exponential, but even simple pattern as in postmortem can go O(n^2).

Do implementations optimize simple regex patterns to O(n) matching? Even I wrote x86 JIT regex compiler for fun some time ago. Compilation time was really bad, but matching was O(n).

alexchamberlain on July 21, 2016 [–]

There are a few implementations that are linear, but compilation time is then exponential instead.

tibiapejagala on July 21, 2016 | [–]

Which is still a big win because

* regex pattern is controlled by site, while regex input is external

* regex pattern is compiled once, while it is being run for every input

Consider applying for YC's Summer 2025 batch! Applications are open till May 13
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact