regression | 易学教程

Scikit Learn sklearn.linear_model.LinearRegression: View the results of the model generated

阅读更多关于 Scikit Learn sklearn.linear_model.LinearRegression: View the results of the model generated

问题 So, I can get sklearn.linear_model.LinearRegression to process my data - at least to run the script without raising any exceptions or warnings. The only issue is, that I am not trying to plot the results with matplotlib, but instead I want to see the estimators and diagnostic statistics for the model. How can I get a model summary such as the slope and intercept (B0,B1), R squared adjusted, etc to display in the console or populate into a variable instead of plotting this? This is a generic

Gaussian Process Regression: standard deviation meaning

阅读更多关于 Gaussian Process Regression: standard deviation meaning

问题 In the following code about the Gaussian Process Regression (GPR): from sklearn.datasets import make_friedman2 from sklearn.gaussian_process import GaussianProcessRegressor from sklearn.gaussian_process.kernels import DotProduct, WhiteKernel X, y = make_friedman2(n_samples=500, noise=0, random_state=0) kernel = DotProduct() + WhiteKernel() gpr = GaussianProcessRegressor(kernel=kernel, random_state=0).fit(X, y) print gpr.score(X, y) print gpr.predict(X[:2,:], return_std=True) What is the

Gaussian Process Regression: standard deviation meaning

阅读更多关于 Gaussian Process Regression: standard deviation meaning

Running multiple simple linear regressions from a nested dataframe/tibble

阅读更多关于 Running multiple simple linear regressions from a nested dataframe/tibble

问题 I am trying to run multiple simple linear regressions based on data from a nested data frame and store the regression fit coefficients in a dataframe using tidy(). My code block is as follows library(tidyverse) library(broom) library(reshape2) library(dplyr) Factors <- as.factor(c("A","B","C","D")) set.seed(5) DF <- data.frame(Factors, X = rnorm(4), Y = rnorm(4), Z= rnorm(4)) MDF <- melt(DF, id.vars=c("Factors","X")) DFF <- MDF %>% nest(-Factors) If it is a single dataframe with many columns,

Most efficient way to run regression models for multiple independent variables on the same list of 80 dependent outcomes?

阅读更多关于 Most efficient way to run regression models for multiple independent variables on the same list of 80 dependent outcomes?

问题 What is the most efficient way to run regression models for a list of 20 independent variables (e.g. genetic variants, each of these genetic variants will be tested alone) and 40 dependent variables? I am a beginner to R! I found a solution but it would work only if I had 1 independent variable. Not sure how I would go about if I had many (http://techxhum.dk/loop-multiple-variables/) Thanks for your time. 回答1: Here's a somewhat dense solution that uses the mfastLmCpp() function from the MESS

Running multiple simple linear regressions from a nested dataframe/tibble

阅读更多关于 Running multiple simple linear regressions from a nested dataframe/tibble

Extract lists of p-values for each regression coefficients (1104 linear regressions) with R

阅读更多关于 Extract lists of p-values for each regression coefficients (1104 linear regressions) with R

问题 I try to do 1104 linear regressions with the same model. My independent variable does not change. However, my dependant variable does. Indeed, I have 1104 dependent variables. I can only extract all the coefficients (intercepts included), t-stats and R-square stats. I would also like to extract all the p-values lists of each coefficients within the 1104 linear regressions. How to do that with an easy way ? Here are my codes: run 1104 regressions for M1 bigtest<-as.data.frame(bigtest) test <-

R: Dynamically update formula

阅读更多关于 R: Dynamically update formula

问题 How can I dynamically update a formula? Example: myvar <- "x" update(y ~ 1 + x, ~ . -x) # y ~ 1 (works as intended) update(y ~ 1 + x, ~ . -myvar) # y ~ x (doesn't work as intended) update(y ~ 1 + x, ~ . -eval(myvar)) # y ~ x (doesn't work as intended) 回答1: You can use paste() within the update() call. myvar <- "x" update(y ~ 1 + x, paste(" ~ . -", myvar)) # y ~ 1 Edit As @A.Fischer noted in the comments, this won't work if myvar is a vector of length > 1 myvar <- c("k", "l") update(y ~ 1 + k

R: Dynamically update formula

阅读更多关于 R: Dynamically update formula

What is causing this error? Coefficients not defined because of singularities

阅读更多关于 What is causing this error? Coefficients not defined because of singularities

问题 I'm trying to find a model for my data but I get the message "Coefficients: (3 not defined because of singularities)" These occur for winter, large and high_flow I found this: https://stats.stackexchange.com/questions/13465/how-to-deal-with-an-error-such-as-coefficients-14-not-defined-because-of-singu which said it may be incorrect dummy variables, but I've checked that none of my columns are duplicates. when I use the function alias() I get: Model : S ~ A + B + C + D + E + F + G + spring +