Data Cleaning with PHP
Handling Missing Values with Rubix
RubixML provides the MissingDataImputer for handling missing values. This imputer allows you to fill in missing values using strategies like Mean, Median, or Constant.
Dataset
age,income,spending_score,tag
25,55000,45,yes
32,?,75,yes
40,72000,?,yes
?,82000,60,yes
28,63000,30,yes
Result:
Memory: 0.242 Mb
Time running: 0.009 sec.
After Imputation:
---------------
25,55000,45
32,55000,75
40,72000,45
32,82000,60
28,63000,30