我正在嘗試使用 LinearRegression() 演算法來預測房屋的價格。
這是我的代碼:
import pandas as pd
import numpy as np
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LinearRegression
df = pd.read_csv('data.csv')
df = df.drop(columns=['date', 'street', 'city', 'statezip', 'country'])
X = df.drop(columns=['price'])
y = df['price']
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2)
lr = LinearRegression()
lr.fit(X_train, y_train)
pred = lr.predict(X_test)
pred.reshape((-1, 1))
acc = lr.score(pred, y_test)
但是,我不斷收到此錯誤:
Reshape your data either using array.reshape(-1, 1) if your data has a single feature or array.reshape(1, -1) if it contains a single sample.
我試圖重塑我的資料中的所有屬性,但我唯一能夠重塑的是pred,并且在這樣做之后我仍然得到同樣的錯誤?
我應該如何解決這個錯誤?
提前致謝。
uj5u.com熱心網友回復:
基于檔案sklearn.linear_model.LinearRegression.score:
分數(X,y,sample_weight=None)
回傳 self.predict(X) 的 R^2 分數。是的。
您需要X作為第一個引數傳遞,如下所示:
lr.fit(X_train, y_train)
acc = lr.score(X_test, y_test)
print(acc)
或者您可以使用sklearn.metrics.r2_score:
from sklearn.metrics import r2_score
acc = r2_score(y_test, pred)
print(acc)
例子:
import numpy as np
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LinearRegression
X = np.array([[1, 1], [1, 2], [2, 2], [2, 3]])
y = np.dot(X, np.array([1, 2])) 3
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.4, random_state=42)
lr = LinearRegression()
lr.fit(X_train, y_train)
pred = lr.predict(X_test)
acc = lr.score(X_test, y_test)
print(acc)
# Or
from sklearn.metrics import r2_score
acc = r2_score(y_test, pred)
print(acc)
輸出:
0.8888888888888888
0.8888888888888888
轉載請註明出處,本文鏈接:https://www.uj5u.com/ruanti/454200.html
上一篇:在python中創建一個滾動條到一個完整的視窗tkinter
下一篇:ModuleNotFoundError:沒有名為“sklearn.neighbors._dist_metrics”的模塊
