將idx檔案或字串決議為PandasDataFrame-有解無憂

我想將以下 idx 檔案：將 idx 檔案或字串決議為 Pandas DataFrame

接下來我想構建一個容錯的決議器，因為它應該每天將一個新的 idx 檔案決議到 pd.DataFrame 中。

我的想法是使用字串操作，但它會非常復雜并且不能容錯。

如果有人可以展示決議和提供樣板代碼的最佳實踐，我將不勝感激。

uj5u.com熱心網友回復：

由于這主要是一個固定寬度的檔案，您可以使用 pandasread_fwf來讀取此檔案。您可以跳過主要資訊（通過skiprows=）并直接獲取資料。列名是預定義的并在讀取時分配：

idx_path = 'company.20220112.idx'
names = ['Company Name','Form Type','CIK','Date Filed','File Name']
df = pd.read_fwf(idx_path, colspecs=[(0,61),(62,74),(74,84),(86,94),(98,146)], names=names, skiprows=11)
df.head(10)

                                         Company Name Form Type      CIK Date Filed                                    File Name  
0   005 - Series of IPOSharks Venture Master Fund,...         D  1888451   20220112  edgar/data/1888451/0001888451-22-000002.txt  
1           10X Capital Venture Acquisition Corp. III    EFFECT  1848948   20220111  edgar/data/1848948/9999999995-22-000102.txt  
2                              110 White Partners LLC         D  1903845   20220112  edgar/data/1903845/0001884293-22-000001.txt  
3                                       15 Beach, MHC         3  1903509   20220112  edgar/data/1903509/0001567619-22-001073.txt  
4                                       15 Beach, MHC    SC 13D  1903509   20220112  edgar/data/1903509/0000943374-22-000014.txt  
5                                      170 Valley LLC         D  1903913   20220112  edgar/data/1903913/0001903913-22-000001.txt  
6                         1st FRANKLIN FINANCIAL CORP     424B3    38723   20220112    edgar/data/38723/0000038723-22-000003.txt  
7                         1st FRANKLIN FINANCIAL CORP     424B3    38723   20220112    edgar/data/38723/0000038723-22-000004.txt  
8                               215 BF Associates LLC         D  1904145   20220112  edgar/data/1904145/0001904145-22-000001.txt  
9                       2401 Midpoint Drive REIT, LLC         D  1903337   20220112  edgar/data/1903337/0001903337-22-000001.txt

轉載請註明出處，本文鏈接：https://www.uj5u.com/caozuo/485608.html

標籤：Python 熊猫解析

上一篇：我如何決議JSON檔案中的資料并將其保存到shell腳本中的另一個文本檔案

下一篇：如何將字串“5e-04”決議為數字0.0005？[復制]