值得使用 Python 的 re.compile 吗？

小编典典

值得使用 Python 的 re.compile 吗？

all

在 Python 中对正则表达式使用 compile 有什么好处吗？

h = re.compile('hello')
h.match('hello world')

对比

re.match('hello', 'hello world')

阅读 157

2022-03-08

共1个答案

小编典典

我有很多运行编译正则表达式 1000 次而不是即时编译的经验，并且没有注意到任何可察觉的差异。显然，这是轶事，当然不是反对
编译的好论据，但我发现差异可以忽略不计。

编辑：在快速浏览了实际的 Python 2.5 库代码后，我发现无论何时使用 Python
都会在内部编译并缓存正则表达式（包括re.match()调用根本不会节省很多时间 - 只是检查缓存所需的时间（内部dict类型的键查找）。

从模块 re.py （评论是我的）：

def match(pattern, string, flags=0):
    return _compile(pattern, flags).match(string)

def _compile(*key):

    # Does cache check at top of function
    cachekey = (type(key[0]),) + key
    p = _cache.get(cachekey)
    if p is not None: return p

    # ...
    # Does actual compilation on cache miss
    # ...

    # Caches compiled regex
    if len(_cache) >= _MAXCACHE:
        _cache.clear()
    _cache[cachekey] = p
    return p

我仍然经常预编译正则表达式，但只是为了将它们绑定到一个好的、可重用的名称，而不是为了任何预期的性能提升。

2022-03-08