python - KeyError: ’page’這個錯誤要怎么搞?
問題描述
def parse2csv(filepath): logfile = open(filepath,’r’) source_ip_dict={} res_url_dict={} from_url_dict={} category_dict={} print(’start.....’) for line in logfile:line=line.strip()if line!='': source_ip = line.split(’- -’)[0].strip() if re.match(’[0-9]{1,3}.[0-9]{1,3}.[0-9]{1,3}’,source_ip):if source_ip_dict.get(source_ip,’-’)==’-’: source_ip_dict[source_ip]=1else: source_ip_dict[source_ip]=source_ip_dict[source_ip]+1 reg=’'[GETPUOSHADINS]{3,8} /’ url_start = re.compile(reg) re_result = url_start.findall(line) if len(re_result)>=1:res_url = ’/’+line.split(re_result[0])[1].split(’ ’)[0]category = strip_detail(res_url.split(’/’))if len(category)>=1: if category[0] in [’article’,’item’,’question’,’topic’,’people’,’celebrity’,’brand’,’page’,’category’]:if category_dict.get(category[0],’-’)==’-’: category_dict[category[0]]=1else: category_dict[category[0]]=category_dict[category[0]]+1if res_url.endswith(’.jpg’) or res_url.endswith(’.css’) or res_url.endswith(’.js’) or res_url.endswith(’.png’) or res_url.endswith(’.gif’): passelse: if res_url.find(r’.css?’)!=-1 or res_url.find(r’.js?’)!=-1:pass else:if res_url_dict.get(res_url,’-’)==’-’: res_url_dict[res_url]=1else: res_url_dict[res_url]=res_url_dict[res_url]+1 from_url_start =’ 'http://’ if line.find(from_url_start)!=-1:from_url = ’http://’+line.split(from_url_start)[1].split(’' ’)[0]if from_url_dict.get(from_url,’-’)==’-’: from_url_dict[from_url]=1else: from_url_dict[from_url]=from_url_dict[from_url]+1 logfile.close() yesterday = datetime.date.today()-datetime.timedelta(days=1) yesterday_str = str(yesterday).replace(’-’,’’) conn = pymysql.connect(host=’127.0.0.1’, user=’fenxi’,passwd=’’, db=’rizhifenxi’, port = 3306, charset = ’utf8’) cur = conn.cursor() cur.execute(’insert into categort(yesterday_str, item, question, topic, people, celebrity, brand, page, category, article) values(%s,%s,%s,%s,%s,%s,%s,%s,%s,%s)’, (yesterday_str, category[’item’], category[’question’], category[’topic’], category[’people’], category[’celebrity’], category[’brand’], category[’page’], category[’category’], category[’article’]))#temp = 'insert into ipList(ipList) values ('+ ip_list[j] + ')'#print temp#print (’success connect’) conn.commit() cur.close() conn.close()
代碼如上
報錯如下:
root@iZbp1iqn00z9x3jov6bas1Z:/data/wwwlogs/www# python parselog2.py/data/wwwlogs/www/m.xxx.com-access_20170419.logstart.....Traceback (most recent call last): File 'parselog2.py', line 145, in <module> parse2csv(filename) File 'parselog2.py', line 92, in parse2csv cur.execute(’insert into categort(yesterday_str, item, question, topic, people, celebrity, brand, page, category, article) values(%s,%s,%s,%s,%s,%s,%s,%s,%s,%s)’, (yesterday_str, category[’item’], category[’question’], category[’topic’], category[’people’], category[’celebrity’], category[’brand’], category[’page’], category[’category’], category[’article’]))TypeError: list indices must be integers or slices, not strroot@iZbp1iqn00z9x3jov6bas1Z:/data/wwwlogs/www#
這是哪里錯了呢?我想把category的統(tǒng)計結(jié)果這個結(jié)果導入到數(shù)據(jù)庫,要怎么修改呢?
category修改成category_dict之后其他取值正常,就是這個取值失敗,是什么原因呢?
cur.execute(’insert into categort(yesterday_str, item, question, topic, people, celebrity, brand, page, category, article) values(%s,%s,%s,%s,%s,%s,%s,%s,%s,%s)’, (yesterday_str, category_dict[’item’], category_dict[’question’], category_dict[’topic’], category_dict[’people’], category_dict[’celebrity’], category_dict[’brand’], 0, category_dict[’category’], category_dict[’article’]))KeyError: ’category’
問題解答
回答1:看錯誤提示,猜想你這里的category類型是一個list,而不是一個dict,不能category[’item’]這樣來取值
回答2:懷疑你把category_dict寫成category了
相關(guān)文章:
1. java - 安卓電視盒子取得了root權(quán)限但是不能安裝第三方應(yīng)用,請問該怎么辦?2. 在MySQL中新增字段時,報錯??3. 老哥們求助啊4. css3 - 請問一下在移動端CSS布局布局中通常需要用到哪些元素,屬性?5. javascript - js 寫一個正則 提取文本中的數(shù)據(jù)6. npm鏡像站全新上線7. javascript - vue-router怎么不能實現(xiàn)跳轉(zhuǎn)呢8. javascript - [WDS] Disconnected! 一直重復出現(xiàn)。9. python - 模擬滑動驗證碼,有源碼,求解10. html5 - angularjs中外部模版加載無法使用
