這是我之前的問題的延續,只是檢查我是否能夠同時處理這個模型
減少“同時讀取”回圈的處理時間
我有一個巨大的 csv 檔案,具有不同長度的欄位 11,例如
"xx","x",x,x,x,xx,xx,"x",x,11,"00000aaaaD00000bbbbD00000abcdD00000dwasD00000dedsD00000ddfgD00000dsdfD00000snfjD00000djffD00000wedfD00000asdfZ"
"xx","x",x,x,x,xx,xx,"x",x,5,"00000aaaaD00000bbbbD00000abcdD00000dwasD00000dedsD"
將欄位 11 拆分為 10 大小后,我需要 6-9 個字符。然后我必須將它作為新行插入我需要如下輸出,
"xx","x",x,x,x,xx,xx,"x",x,11,"aaaa"
"xx","x",x,x,x,xx,xx,"x",x,11,"bbbb"
"xx","x",x,x,x,xx,xx,"x",x,11,"abcd"
.
.
.
"xx","x",x,x,x,xx,xx,"x",x,11,"asdf"
"xx","x",x,x,x,xx,xx,"x",x,5,"djff"
.
.
"xx","x",x,x,x,xx,xx,"x",x,5,"deds"
while read -r line1; do
icount=$[icount 1]
col_11=$( echo $line1 | cut -d',' -f11 )
col_10=$( echo $line1 | cut -d',' -f1,2,3,4,5,7,10)
#echo $col_11
col_11_trim=$(echo "$col_11" | tr -d '"')
#echo $col_11_trim
echo $col_11_trim | fold -w10 > $path/col_11_extract
while read -r line2; do
ocount=$[ocount 1]
strng_cut=$(echo $line2 | cut -c6-9)
echo "$col_10",\""$strng_cut"\" >> $path/final_out
done < $path/col_11_extract
done < $input
uj5u.com熱心網友回復:
與awk:
awk 'BEGIN{FS=OFS=","}
{
eleven=$11;
len=length(eleven);
for(i=2; i<len-1; i=i 10){
$11="\"" substr(eleven, i 5, 4) "\"";
print;
}
}' file
for回圈從位置開始2并len-1以欄位 11 中的引號結束。
輸出:
"xx","x",x,x,x,xx,xx,"x",x,11,"aaaa" "xx","x",x,x,x,xx,xx,"x",x,11,"bbbb" "xx","x",x,x,x,xx,xx,"x",x,11,"abcd" "xx","x",x,x,x,xx,xx,"x",x,11,"dwas" "xx","x",x,x,x,xx,xx,"x",x,11,"deds" "xx","x",x,x,x,xx,xx,"x",x,11,"ddfg" "xx","x",x,x,x,xx,xx,"x",x,11,"dsdf" "xx","x",x,x,x,xx,xx,"x",x,11,"snfj" "xx","x",x,x,x,xx,xx,"x",x,11,"djff" "xx","x",x,x,x,xx,xx,"x",x,11,"wedf" "xx","x",x,x,x,xx,xx,"x",x,11,"asdf" "xx","x",x,x,x,xx,xx,"x",x,5,"aaaa" "xx","x",x,x,x,xx,xx,"x",x,5,"bbbb" "xx","x",x,x,x,xx,xx,"x",x,5,"abcd" "xx","x",x,x,x,xx,xx,"x",x,5,"dwas" "xx","x",x,x,x,xx,xx,"x",x,5,"deds"
轉載請註明出處,本文鏈接:https://www.uj5u.com/ruanti/341205.html
