{"library":"sklearn-pandas","title":"sklearn-pandas","description":"sklearn-pandas (current version 2.2.0) provides a bridge between Scikit-Learn's machine learning methods and pandas DataFrames. It allows users to map DataFrame columns to different scikit-learn transformations, which are then recombined into features for model training. The library aims to streamline data preprocessing workflows involving both pandas and scikit-learn.","language":"python","status":"active","last_verified":"Mon May 18","install":{"commands":["pip install sklearn-pandas"],"cli":null},"imports":["from sklearn_pandas import DataFrameMapper","from sklearn.impute import SimpleImputer # Replaced in sklearn-pandas >= 2.0.0"],"auth":{"required":false,"env_vars":[]},"quickstart":{"code":"import pandas as pd\nfrom sklearn_pandas import DataFrameMapper\nfrom sklearn.preprocessing import LabelBinarizer, StandardScaler\n\ndata = pd.DataFrame({\n    'pet': ['cat', 'dog', 'dog', 'fish', 'cat', 'dog', 'cat', 'fish'],\n    'children': [4., 6, 3, 3, 2, 3, 5, 4],\n    'salary': [90., 24, 44, 27, 32, 59, 36, 27]\n})\n\n# Map DataFrame columns to Scikit-learn transformations\nmapper = DataFrameMapper([\n    ('pet', LabelBinarizer()),\n    (['children'], StandardScaler()),\n    ('salary', None) # 'None' keeps the column without transformation\n], df_out=True) # Set df_out=True to get a DataFrame output (requires pandas >= 1.0)\n\ntransformed_data = mapper.fit_transform(data.copy())\nprint(transformed_data.head())\nprint(transformed_data.columns)","lang":"python","description":"This quickstart demonstrates how to use `DataFrameMapper` to apply different scikit-learn transformers to specific columns of a pandas DataFrame. Categorical 'pet' column is binarized, 'children' is standardized, and 'salary' is kept as is. Setting `df_out=True` (requires pandas >= 1.0) ensures the output is a DataFrame rather than a NumPy array.","tag":null,"tag_description":null,"last_tested":null,"results":[]},"compatibility":{"tag":null,"tag_description":null,"last_tested":"2026-05-18","installed_version":"2.2.0","pypi_latest":"2.2.0","is_stale":false,"summary":{"python_range":"3.10–3.9","success_rate":50,"avg_install_s":13.6,"avg_import_s":2.63,"wheel_type":"wheel"},"results":[{"runtime":"python:3.10-alpine","python_version":"3.10","os_libc":"alpine (musl)","variant":"sklearn-pandas","exit_code":1,"wheel_type":null,"failure_reason":"build_error","import_side_effects":null,"install_time_s":null,"import_time_s":null,"mem_mb":null,"disk_size":null},{"runtime":"python:3.10-slim","python_version":"3.10","os_libc":"slim (glibc)","variant":"sklearn-pandas","exit_code":0,"wheel_type":"wheel","failure_reason":null,"import_side_effects":"broken","install_time_s":13.3,"import_time_s":null,"mem_mb":null,"disk_size":"341M"},{"runtime":"python:3.11-alpine","python_version":"3.11","os_libc":"alpine (musl)","variant":"sklearn-pandas","exit_code":1,"wheel_type":null,"failure_reason":"build_error","import_side_effects":null,"install_time_s":null,"import_time_s":null,"mem_mb":null,"disk_size":null},{"runtime":"python:3.11-slim","python_version":"3.11","os_libc":"slim (glibc)","variant":"sklearn-pandas","exit_code":0,"wheel_type":"wheel","failure_reason":null,"import_side_effects":"broken","install_time_s":12.6,"import_time_s":null,"mem_mb":null,"disk_size":"363M"},{"runtime":"python:3.12-alpine","python_version":"3.12","os_libc":"alpine (musl)","variant":"sklearn-pandas","exit_code":1,"wheel_type":null,"failure_reason":"build_error","import_side_effects":null,"install_time_s":null,"import_time_s":null,"mem_mb":null,"disk_size":null},{"runtime":"python:3.12-slim","python_version":"3.12","os_libc":"slim (glibc)","variant":"sklearn-pandas","exit_code":0,"wheel_type":"wheel","failure_reason":null,"import_side_effects":"broken","install_time_s":13.3,"import_time_s":null,"mem_mb":null,"disk_size":"344M"},{"runtime":"python:3.13-alpine","python_version":"3.13","os_libc":"alpine (musl)","variant":"sklearn-pandas","exit_code":1,"wheel_type":null,"failure_reason":"build_error","import_side_effects":null,"install_time_s":null,"import_time_s":null,"mem_mb":null,"disk_size":null},{"runtime":"python:3.13-slim","python_version":"3.13","os_libc":"slim (glibc)","variant":"sklearn-pandas","exit_code":0,"wheel_type":"wheel","failure_reason":null,"import_side_effects":"broken","install_time_s":13.8,"import_time_s":null,"mem_mb":null,"disk_size":"342M"},{"runtime":"python:3.9-alpine","python_version":"3.9","os_libc":"alpine (musl)","variant":"sklearn-pandas","exit_code":1,"wheel_type":null,"failure_reason":"build_error","import_side_effects":null,"install_time_s":null,"import_time_s":null,"mem_mb":null,"disk_size":null},{"runtime":"python:3.9-slim","python_version":"3.9","os_libc":"slim (glibc)","variant":"sklearn-pandas","exit_code":0,"wheel_type":"wheel","failure_reason":null,"import_side_effects":"clean","install_time_s":15.1,"import_time_s":2.63,"mem_mb":58.6,"disk_size":"356M"}]}}