本文的文字及圖片來源于網路,僅供學習、交流使用,不具有任何商業用途,著作權歸原作者所有,如有問題請及時聯系我們以作處理
以下文章來源于小蚊子資料分析 ,作者小蚊子資料分析
代碼
1import string
2import pandas as pd
3import csv
4
5
6blocks = []
7def str_count(str):
8 '''找出字串中的中英文、空格、數字、標點符號個數'''
9 count_en = count_dg = count_sp = count_zh = \
10 count_pu = count_at = count_tan = count_wen = count_mao = count_jing = count_sheng = 0
11
12 for s in str:
13 # 英文
14 if s in string.ascii_letters:
15 count_en += 1
16 # 數字
17 elif s.isdigit():
18 count_dg += 1
19 # 空格
20 elif s.isspace():
21 count_sp += 1
22 # 中文
23 elif s.isalpha():
24 count_zh += 1
25 # 特殊字符
26 else:
27 for ss in s[:]:
28 if ss == '@':
29 count_at += 1
30 if ss == '!':
31 count_tan += 1
32 if ss == '?':
33 count_wen += 1
34 if ss == ':':
35 count_mao += 1
36 if ss == '#':
37 count_jing += 1
38 if '......' in s:
39 count_sheng += 1
40 count_pu += 1
41 print('英文字符:', count_en)
42 print('數字:', count_dg)
43 print('空格:', count_sp)
44 print('中文:', count_zh)
45 print('特殊字符:', count_pu)
46 print('@:', count_at)
47 print('!:', count_tan)
48 print('?:', count_wen)
49 print(':', count_mao)
50 print('#:', count_jing)
51 print('......', count_sheng)
52 block = [count_en, count_dg, count_sp, count_zh, count_pu, count_at, count_tan, count_wen, \
53 count_sheng, count_mao, count_jing]
54 blocks.append(block)
55
56
57f = pd.read_csv('C://Users//lecce//Desktop//1015.csv')
58for i in range(0, 569):
59 str_count(f.content[i])
60
61with open('content.csv', 'w', newline='') as csvfile:
62 writer = csv.writer(csvfile)
63 for x in range(0, len(blocks)):
64 writer.writerow(blocks[x])
如何統計文本中的中英文字符數,在Excel中用LEN函式無果后,只好拿出python,從網上開始抄代碼,經過一系列復制粘貼和拼湊,最終達到可用的狀態,可能還有些許bug,
轉載請註明出處,本文鏈接:https://www.uj5u.com/houduan/174408.html
標籤:其他
