# How to change the number of bins?

## How to change the number of bins?

 I wish to calculate the weight of evidence of a variable x, which is positively skewed, with over 6000 of the observations are 999 but only 200 range from 1-27. I used the code, "IV<-create_infotables(data=Test[,-1],y="class",bins=10)" However, no matter what number I used in bins parameter, I can only get 2 bins, [1,27] and [999,999]. Is there any way I can look into the [1,27] closely because they represent a lot? The output from R is shown below, Table\$pdays     pdays        N   Percent     WOE       IV     1 [1,27]    243  0.03807584  2.6743166 0.5267751     2 [999,999] 6139 0.96192416 -0.2230081 0.5707022 Thank you very much!!
## Re: How to change the number of bins?

 Seems rather likely that 999 is not really a measured value but rather is a missing value indicator. -- David. On 3/10/19 1:54 PM, wong bowie wrote: > I wish to calculate the weight of evidence of a variable x, which is > positively skewed, with over 6000 of the observations are 999 but only 200 > range from 1-27. I used the code, > > "IV<-create_infotables(data=Test[,-1],y="class",bins=10)" > > However, no matter what number I used in bins parameter, I can only get 2 > bins, [1,27] and [999,999]. Is there any way I can look into the [1,27] > closely because they represent a lot? The output from R is shown below, > > Table\$pdays >      pdays        N   Percent     WOE       IV >      1 [1,27]    243  0.03807584  2.6743166 0.5267751 >      2 [999,999] 6139 0.96192416 -0.2230081 0.5707022 > > Thank you very much!!
## Re: How to change the number of bins?

 You are right. Actually this variable represents the number of day passed after contacting a client, 999 means the client has never been contacted. But I am not supposed to change the value, am I? David Winsemius <[hidden email]> 於 2019年3月10日 週日 下午10:48寫道： > Seems rather likely that 999 is not really a measured value but rather > is a missing value indicator.
## Re: How to change the number of bins?

 On 3/10/19 5:29 PM, wong bowie wrote: > You are right. Actually this variable represents the number of day > passed after contacting a client, 999 means the client has never been > contacted. > > But I am not supposed to change the value, am I? I certainly would. SAS allows one to specify a value such as 999 to be missing but R needs to have it changed to NA is.na(Table\$pdays) <- Table\$pdays == 999 -- David