我有一个名单以文本文件的网址与一些不需要的案文,我已经写了一个regex这将满足我需要这工作很好,但我现在面临的一个问题,regex添加的产出不必要的样本["]下面的例子:
文件内容清单的网址:
http://www.example.com/52 (Status: 403) [Size: 919]
http://www.example.com/details (Status: 403) [Size: 919]
http://www.example.com/h (Status: 403) [Size: 919]
http://www.example.com/affiliate (Status: 403) [Size: 919]
http://www.example.com/56 (Status: 403) [Size: 919]
regex我使用的是: "^[://.a-zA-Z0-9-_]*"
输出如下:
['http://www.example.com/52']
['http://www.example.com/details']
['http://www.example.com/h']
['http://www.example.com/affiliate']
['http://www.example.com/56']
我需要输出如下:
http://www.example.com/52
http://www.example.com/details
http://www.example.com/h
http://www.example.com/affiliate
http://www.example.com/56
代码用于这种程序如下:
import re
with open("test.txt","r") as test:
for i in test:
x = re.findall("^[://.a-zA-Z0-9-_]*",i)
print(x)