使用Python来检查统计代码是否布置到位
在给网站布置统计代码后,往往需要检查开发人员布置的代码是否到位,采用人力的方式去检查很费时间,于是自己整了一个Python自动检查代码的小程序,现拿出来分享下。
# -*- coding: utf-8 -*-
__author__ = ‘www.biaodianfu.com’
import urllib2 ,gzip, StringIO, re
code = “_gaq.push(['_setAccount', 'UA-123456789-1']);”
filename = ‘urllist.txt’
checklist = open(filename,’r')
#urls = checklist.readlines()
urls = [l.strip() for l in checklist.readlines()]
for url in urls:
page_encode = “utf-8″
try:
request = urllib2.Request(url)
request.add_header(“Accept-encoding”, “gzip”)
sock = urllib2.urlopen(request)
page = sock.read()
if sock.headers.get(‘content-encoding’, None) == ‘gzip’:
page = gzip.GzipFile(fileobj=StringIO.StringIO(page)).read()
if not isinstance(page, unicode):
page = unicode(page, page_encode)
times = page.count(code)
print “%s 出现了 %d 次” % (url,times)
except urllib2.HTTPError,e:
print url,e.code
checklist.close()
如果要判断_setAccount或其他_gaq中的项是否添加正确,可以采用以下代码。
# -*- coding: utf-8 -*-
__author__ = ‘www.biaodianfu.com’
# -*- coding: utf-8 -*-
import urllib2 ,gzip, StringIO, re
filename = ‘urllist.txt’
checklist = open(filename,’r')
#urls = checklist.readlines()
urls = [l.strip() for l in checklist.readlines()]
for url in urls:
page_encode = “utf-8″
try:
request = urllib2.Request(url)
request.add_header(“Accept-encoding”, “gzip”)
sock = urllib2.urlopen(request)
page = sock.read()
if sock.headers.get(‘content-encoding’, None) == ‘gzip’:
page = gzip.GzipFile(fileobj=StringIO.StringIO(page)).read()
if not isinstance(page, unicode):
page = unicode(page, page_encode)
pattern = re.compile(“‘_setAccount’, ‘(.*?)’”)
codes = pattern.findall(page)
account = codes[0].encode(‘utf-8′)
print “%s Account:%s” % (url,account)
except urllib2.HTTPError,e:
print url,e.code
checklist.close()
扫一扫订阅我的微信号:IT技术博客大学习
- 作者:标点符 来源: 标点符
- 标签: 统计代码
- 发布时间:2012-01-27 18:52:22
- [52] Go Reflect 性能
- [18] [译]Google Chrome中的高性能网
- [16] 在FreeNAS/BSD搭建基于Nginx+
- [16] 关于Linux的文件系统cache
- [15] 最近总结的一些技巧(vim,python,s
- [13] Linux常用系统信息查看命令
- [11] PHP加速器 eaccelerator 缓存
- [9] Linux(Ubuntu 10.04)上安装
- [9] base64_encode 和 urlenc
- [9] 精于图片处理的10款jQuery插件