有沒有辦法列出 Azure ML 資料集的所有可用版本?不是通過 UI,而是使用 SDK。此外,我們如何才能獲得該 Azure ML 資料集的最新版本之前的版本?
這里的主要目標是確定資料趨勢的變化。
uj5u.com熱心網友回復:
創建機器學習作業室資源組和作業區。多次上傳資料集,它將更新為具有相同名稱的版本。


使用以下代碼塊獲取上傳的資料集版本以及有關這些版本的資訊。
代碼塊 1
from azureml.core import Dataset
Diabetes1234 = Dataset.get_all(workspace = ws)
counts = Diabetes1234['Diabetes123'].version
versions = [Dataset.get_by_name(workspace = ws, name = 'Diabetes123', version = v) for v in range(1,counts 1)]
代碼塊 2
versions
輸出
[{
"source": [
"('workspaceblobstore', 'UI/2022-10-14_055538_UTC/')"
],
"definition": [
"GetDatastoreFiles",
"ParseDelimited",
"DropColumns",
"SetColumnTypes"
],
"registration": {
"id": "Your ID",
"name": "Diabetes123",
"version": 1,
"workspace": "Workspace.create(name='cancerset', subscription_id=your subscription ID', resource_group='your resource group')"
}
},
{
"source": [
"('workspaceblobstore', 'UI/2022-10-14_055914_UTC/')"
],
"definition": [
"GetDatastoreFiles",
"ParseDelimited",
"DropColumns",
"SetColumnTypes"
],
"registration": {
"id": " Your ID ",
"name": "Diabetes123",
"version": 2,
"workspace": "Workspace.create(name='cancerset', subscription_id=your subscription ID', resource_group='your resource group')"
}
},
{
"source": [
"('workspaceblobstore', 'UI/2022-10-14_060011_UTC/')"
],
"definition": [
"GetDatastoreFiles",
"ParseDelimited",
"DropColumns",
"SetColumnTypes"
],
"registration": {
"id": " Your ID ",
"name": "Diabetes123",
"version": 3,
"workspace": "Workspace.create(name='cancerset', subscription_id=your subscription ID', resource_group='your resource group')"
}
},
{
"source": [
"('workspaceblobstore', 'UI/2022-10-14_070300_UTC/')"
],
"definition": [
"GetDatastoreFiles",
"ParseDelimited",
"DropColumns",
"SetColumnTypes"
],
"registration": {
"id": " Your ID ",
"name": "Diabetes123",
"version": 4,
"workspace": "Workspace.create(name='cancerset', subscription_id=your subscription ID', resource_group='your resource group')"
}
},
{
"source": [
"('workspaceblobstore', 'UI/2022-10-14_093655_UTC/')"
],
"definition": [
"GetDatastoreFiles",
"ParseDelimited",
"DropColumns",
"SetColumnTypes"
],
"registration": {
"id": " Your ID ",
"name": "Diabetes123",
"version": 5,
"workspace": "Workspace.create(name='cancerset', subscription_id=your subscription ID', resource_group='your resource group')"
}
}]
獲取最新版本之前的最后一個。使用下面的代碼塊。
代碼塊:
versions[-2]
輸出
{
"source": [
"('workspaceblobstore', 'UI/2022-10-14_070300_UTC/')"
],
"definition": [
"GetDatastoreFiles",
"ParseDelimited",
"DropColumns",
"SetColumnTypes"
],
"registration": {
"id": "your ID",
"name": "Diabetes123",
"version": 4,
"workspace": "Workspace.create(name='cancerset', subscription_id=your subscription ID', resource_group='your resource group')"
}
}
轉載請註明出處,本文鏈接:https://www.uj5u.com/gongcheng/516943.html
標籤:天蓝色机器学习天蓝色机器学习服务毛毛虫azureml-python-sdk
上一篇:AttributeError:型別物件“EnglishDefaults”沒有屬性“create_tokenizer”
