一个 Python 多位置多行的正则提取问题 - V2EX
V2EX = way to explore
V2EX 是一个关于分享和探索的地方
现在注册
已注册用户请  登录
lapertem4
V2EX    问与答

一个 Python 多位置多行的正则提取问题

  •  
  •   lapertem4 2014-07-21 19:06:53 +08:00 3252 次点击
    这是一个创建于 4149 天前的主题,其中的信息可能已经有所发展或是发生改变。
    新手刚学Python...

    有文本

    AC1DE2FB
    AC3DE4FB
    AC5DE6FB

    想截取为

    [1,2]
    [3,4]
    [5,6]

    试了好几次只能单独截取,有没有大神指点迷津哇~
    6 条回复    2015-01-26 14:58:14 +08:00
    messense
        1
    messense  
       2014-07-21 19:22:43 +08:00
    >>> s = """AC1DE2FB
    ... AC3DE4FB
    ... AC5DE6FB"""
    >>> import re
    >>> s
    'AC1DE2FB\nAC3DE4FB\nAC5DE6FB'
    >>> pattern = re.compile(r'\w+?(\d+?)\w+?(\d+?)\w*', re.S | re.M)
    >>> dir(pattern)
    ['__copy__', '__deepcopy__', 'findall', 'finditer', 'match', 'scanner', 'search', 'split', 'sub', 'subn']
    >>> pattern.findall(s)
    [('1', '2'), ('3', '4'), ('5', '6')]
    messense
        2
    messense  
       2014-07-21 19:25:07 +08:00   1
    以上是终端下试的,完整代码:

    import re

    pattern = re.compile(r'\w+?(\d+?)\w+?(\d+?)\w*', re.S | re.M)
    s = """AC1DE2FB
    AC3DE4FB
    AC5DE6FB"""

    result = pattern.findall(s)

    然后可以再对 result 做进一步处理。
    lapertem4
        3
    lapertem4  
    OP
       2014-07-21 19:30:30 +08:00
    @messense 谢回复,答主的思路我明白了,但(嘿嘿)事实的例子不是简单的字母和数字,此例只是为了简化而举的例子。

    用re.findall能多位置的匹配吗
    messense
        4
    messense  
       2014-07-21 19:46:02 +08:00
    @lapertem4 明白了思路根据实际情况改改正则表达式不就差不多了吗
    yiding
        5
    yiding  
       2014-07-21 19:46:48 +08:00
    @lapertem4 都说是 findall 了,就是找全部能够匹配的位置出来
    是你处理后的 变量名.findall
    lapertem4
        6
    lapertem4  
    OP
       2015-01-26 14:58:14 +08:00
    test post data
    /td>
    关于     帮助文档     自助推广系统     博客     API     FAQ     Solana     1382 人在线   最高记录 6679       Select Language
    创意工作者们的社区
    World is powered by solitude
    VERSION: 3.9.8.5 19ms UTC 16:40 PVG 00:40 LAX 08:40 JFK 11:40
    Do have faith in what you're doing.
    ubao msn snddm index pchome yahoo rakuten mypaper meadowduck bidyahoo youbao zxmzxm asda bnvcg cvbfg dfscv mmhjk xxddc yybgb zznbn ccubao uaitu acv GXCV ET GDG YH FG BCVB FJFH CBRE CBC GDG ET54 WRWR RWER WREW WRWER RWER SDG EW SF DSFSF fbbs ubao fhd dfg ewr dg df ewwr ewwr et ruyut utut dfg fgd gdfgt etg dfgt dfgd ert4 gd fgg wr 235 wer3 we vsdf sdf gdf ert xcv sdf rwer hfd dfg cvb rwf afb dfh jgh bmn lgh rty gfds cxv xcv xcs vdas fdf fgd cv sdf tert sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf shasha9178 shasha9178 shasha9178 shasha9178 shasha9178 liflif2 liflif2 liflif2 liflif2 liflif2 liblib3 liblib3 liblib3 liblib3 liblib3 zhazha444 zhazha444 zhazha444 zhazha444 zhazha444 dende5 dende denden denden2 denden21 fenfen9 fenf619 fen619 fenfe9 fe619 sdf sdf sdf sdf sdf zhazh90 zhazh0 zhaa50 zha90 zh590 zho zhoz zhozh zhozho zhozho2 lislis lls95 lili95 lils5 liss9 sdf0ty987 sdft876 sdft9876 sdf09876 sd0t9876 sdf0ty98 sdf0976 sdf0ty986 sdf0ty96 sdf0t76 sdf0876 df0ty98 sf0t876 sd0ty76 sdy76 sdf76 sdf0t76 sdf0ty9 sdf0ty98 sdf0ty987 sdf0ty98 sdf6676 sdf876 sd876 sd876 sdf6 sdf6 sdf9876 sdf0t sdf06 sdf0ty9776 sdf0ty9776 sdf0ty76 sdf8876 sdf0t sd6 sdf06 s688876 sd688 sdf86