我想檢查列值是否在值中包含“COIN”等。是否有可能更改我的正則運算式以不包含“CRYPTOCOIN|KUCOIN|COINBASE”?我想要類似
“與 COIN 單詞|BTCBIT.NET 相關的正則運算式”
請在下面找到我的附加代碼:
val CRYPTO_CARD_INDICATOR: String = ("BTCBIT.NET|KUCOIN|COINBASE|CRYPTCOIN")
val CryptoCheckDataset = df.withColumn("is_crypto_indicator",when(upper(col("company_name")).rlike(CRYPTO_CARD_INDICATOR), 1).otherwise(0))
uj5u.com熱心網友回復:
我認為以下應該有效:
COIN|BTCBIT.NET
PySpark 中的完整測驗:
from pyspark.sql.functions import *
CRYPTO_CARD_INDICATOR = "COIN|BTCBIT.NET"
df = spark.createDataFrame([('kucoin',), ('coinbase',), ('crypto',)], ['company_name'])
CryptoCheckDataset = df.withColumn("is_crypto_indicator", when(upper(col("company_name")).rlike(CRYPTO_CARD_INDICATOR), 1).otherwise(0))
CryptoCheckDataset.show()
# ------------ -------------------
# |company_name|is_crypto_indicator|
# ------------ -------------------
# | kucoin| 1|
# | coinbase| 1|
# | crypto| 0|
# ------------ -------------------
轉載請註明出處,本文鏈接:https://www.uj5u.com/ruanti/521185.html
上一篇:Java驗證車輛登記
