hbase原始文本 第2列 cx 的數字x是預期結果的列號,value=https://bbs.csdn.net/topics/后面的值是 預期結果對應的列號的值,按從小到大的列號把value=后面的值寫到 一行
02120aa_MERGE_20190711_551_1000020250198 column=aa:c4, timestamp=1562935948187, value=https://bbs.csdn.net/topics/250
02120aa_MERGE_20190711_551_1000020250198 column=aa:c5, timestamp=1562935948187, value=https://bbs.csdn.net/topics/20190519
02120aa_MERGE_20190711_551_1000020250198 column=aa:c2, timestamp=1562935948187, value=https://bbs.csdn.net/topics/0
02120aa_MERGE_20190711_551_1000020250198 column=aa:c3, timestamp=1562935948187, value=https://bbs.csdn.net/topics/551
02120aa_MERGE_20190711_551_1000020250198 column=aa:c1, timestamp=1562935948187, value=https://bbs.csdn.net/topics/aa_MERGE_20190711_551_1000030
02120aa_MERGE_20190711_551_1000020250201 column=aa:c1, timestamp=1562935948187, value=https://bbs.csdn.net/topics/aa_MERGE_20190711_551_1000020
02120aa_MERGE_20190711_551_1000020250201 column=aa:c3, timestamp=1562935948187, value=https://bbs.csdn.net/topics/160
02120aa_MERGE_20190711_551_1000020250201 column=aa:c4, timestamp=1562935948187, value=https://bbs.csdn.net/topics/167
02120aa_MERGE_20190711_551_1000020250201 column=aa:c2, timestamp=1562935948187, value=https://bbs.csdn.net/topics/0558
02120aa_MERGE_20190711_551_1000020250201 column=aa:c5, timestamp=1562935948187, value=https://bbs.csdn.net/topics/69
------------預期結果--------------
aa_MERGE_20190711_551_1000020|0558|160|167|69|02120aa_MERGE_20190711_551_1000020250201
aa_MERGE_20190711_551_1000030|0|551|250|20190519|02120aa_MERGE_20190711_551_1000020250198
uj5u.com熱心網友回復:
不太懂你的意思uj5u.com熱心網友回復:
推薦 http://www.zsythink.net/archives/tag/awk好東西!
{
a[$1][$2]=$4;
}
END{
for(i in a){
for(k in a[i]) {
out[i] = sprintf("%s%s|", out[i],gensub(/value=https://bbs.csdn.net/topics/(.+)/, "\\1", "g", a[i][k]));
}
}
for(i in out){
print(sprintf("%s%s", out[i],i));
}
}
轉載請註明出處,本文鏈接:https://www.uj5u.com/caozuo/113310.html
標籤:專題技術討論區
下一篇:netstat命令無法使用
