Why do ROSE or SMOTE function changing numeric values?
ROSE and SMOTE function are changing numeric values and I don't know why.
I'm trying to predict hospital readmission in diabetic inpatients and the dataset is highly imbalanced. So I want to use oversampling method to balance my dataset - I've already tried ROSE and SMOTE function.
My dataset contains only numeric values (dummy variables) as I wanted to apply xgboost. But I notice that ROSE and SMOTE are generating non-integer values within variable that is binary.
Should those funtions change original values?
set.seed(1994)
d_split <- initial_split(d.fin, prop = .8)
train <- as.data.table(training(d_split))
test <- as.data.table(testing(d_split))
data.rose <- as.data.table(ROSE(readmitted~., data=train, seed=3)$data)
Original dataset
After applying ROSE function
r modeling oversampling
New contributor
add a comment |
ROSE and SMOTE function are changing numeric values and I don't know why.
I'm trying to predict hospital readmission in diabetic inpatients and the dataset is highly imbalanced. So I want to use oversampling method to balance my dataset - I've already tried ROSE and SMOTE function.
My dataset contains only numeric values (dummy variables) as I wanted to apply xgboost. But I notice that ROSE and SMOTE are generating non-integer values within variable that is binary.
Should those funtions change original values?
set.seed(1994)
d_split <- initial_split(d.fin, prop = .8)
train <- as.data.table(training(d_split))
test <- as.data.table(testing(d_split))
data.rose <- as.data.table(ROSE(readmitted~., data=train, seed=3)$data)
Original dataset
After applying ROSE function
r modeling oversampling
New contributor
1
I'm not sure about the range of new values, but if a variable is considered numeric, then SMOTE creates new values with convex combinations of the existent ones. Hence, all values between 0 and 1 become possible. What you want then, I guess, is to convertinsulin
to a factor variable. In that case, stackoverflow.com/a/53653693/1320535 is relevant.
– Julius Vainora
2 days ago
add a comment |
ROSE and SMOTE function are changing numeric values and I don't know why.
I'm trying to predict hospital readmission in diabetic inpatients and the dataset is highly imbalanced. So I want to use oversampling method to balance my dataset - I've already tried ROSE and SMOTE function.
My dataset contains only numeric values (dummy variables) as I wanted to apply xgboost. But I notice that ROSE and SMOTE are generating non-integer values within variable that is binary.
Should those funtions change original values?
set.seed(1994)
d_split <- initial_split(d.fin, prop = .8)
train <- as.data.table(training(d_split))
test <- as.data.table(testing(d_split))
data.rose <- as.data.table(ROSE(readmitted~., data=train, seed=3)$data)
Original dataset
After applying ROSE function
r modeling oversampling
New contributor
ROSE and SMOTE function are changing numeric values and I don't know why.
I'm trying to predict hospital readmission in diabetic inpatients and the dataset is highly imbalanced. So I want to use oversampling method to balance my dataset - I've already tried ROSE and SMOTE function.
My dataset contains only numeric values (dummy variables) as I wanted to apply xgboost. But I notice that ROSE and SMOTE are generating non-integer values within variable that is binary.
Should those funtions change original values?
set.seed(1994)
d_split <- initial_split(d.fin, prop = .8)
train <- as.data.table(training(d_split))
test <- as.data.table(testing(d_split))
data.rose <- as.data.table(ROSE(readmitted~., data=train, seed=3)$data)
Original dataset
After applying ROSE function
r modeling oversampling
r modeling oversampling
New contributor
New contributor
edited 2 days ago
Mr Shunz
66911726
66911726
New contributor
asked 2 days ago
Michiru
12
12
New contributor
New contributor
1
I'm not sure about the range of new values, but if a variable is considered numeric, then SMOTE creates new values with convex combinations of the existent ones. Hence, all values between 0 and 1 become possible. What you want then, I guess, is to convertinsulin
to a factor variable. In that case, stackoverflow.com/a/53653693/1320535 is relevant.
– Julius Vainora
2 days ago
add a comment |
1
I'm not sure about the range of new values, but if a variable is considered numeric, then SMOTE creates new values with convex combinations of the existent ones. Hence, all values between 0 and 1 become possible. What you want then, I guess, is to convertinsulin
to a factor variable. In that case, stackoverflow.com/a/53653693/1320535 is relevant.
– Julius Vainora
2 days ago
1
1
I'm not sure about the range of new values, but if a variable is considered numeric, then SMOTE creates new values with convex combinations of the existent ones. Hence, all values between 0 and 1 become possible. What you want then, I guess, is to convert
insulin
to a factor variable. In that case, stackoverflow.com/a/53653693/1320535 is relevant.– Julius Vainora
2 days ago
I'm not sure about the range of new values, but if a variable is considered numeric, then SMOTE creates new values with convex combinations of the existent ones. Hence, all values between 0 and 1 become possible. What you want then, I guess, is to convert
insulin
to a factor variable. In that case, stackoverflow.com/a/53653693/1320535 is relevant.– Julius Vainora
2 days ago
add a comment |
active
oldest
votes
Your Answer
StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Michiru is a new contributor. Be nice, and check out our Code of Conduct.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53945527%2fwhy-do-rose-or-smote-function-changing-numeric-values%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
active
oldest
votes
active
oldest
votes
active
oldest
votes
active
oldest
votes
Michiru is a new contributor. Be nice, and check out our Code of Conduct.
Michiru is a new contributor. Be nice, and check out our Code of Conduct.
Michiru is a new contributor. Be nice, and check out our Code of Conduct.
Michiru is a new contributor. Be nice, and check out our Code of Conduct.
Thanks for contributing an answer to Stack Overflow!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Some of your past answers have not been well-received, and you're in danger of being blocked from answering.
Please pay close attention to the following guidance:
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53945527%2fwhy-do-rose-or-smote-function-changing-numeric-values%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
1
I'm not sure about the range of new values, but if a variable is considered numeric, then SMOTE creates new values with convex combinations of the existent ones. Hence, all values between 0 and 1 become possible. What you want then, I guess, is to convert
insulin
to a factor variable. In that case, stackoverflow.com/a/53653693/1320535 is relevant.– Julius Vainora
2 days ago