GitHub - Shellcat-Zero/skmca: A scikit-learn compatible implementation of MCA

Use https://github.com/MaxHalford/Prince instead

skmca

A scikit-learn pipeline API compatible implementation of Multiple Correspondence Analysis (MCA).

Usage

import pandas as pd
from skmca import MCA

df = pd.read_csv('http://www.statoek.wiso.uni-goettingen.de/'
                 'CARME-N/download/wg93.txt',
                 sep='\t', dtype='category')
mca = MCA()
mca.fit(df)

Crucially, the input to MCA.fit must be a pandas.DataFrame where all the columns have a category dtype. This is necessary to ensure that the dummy encoding of the columns is consistent across training and test datasets.

Background

MCA is like `PCA`_, but for categorical data. You can use it to visualize high-dimensional datasets. It can also be useful as a pre-processing step for clustering, to avoid the curse of dimensionality.

skmca requires pandas and scikit-learn.

References

This library follows the setup in `Nenadic and Greenacre (2005)`_.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
data		data
skmca		skmca
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.rst		README.rst
setup.cfg		setup.cfg
setup.py		setup.py
versioneer.py		versioneer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

skmca

Usage

Background

References

About

Uh oh!

Releases

Packages

Languages

License

Shellcat-Zero/skmca

Folders and files

Latest commit

History

Repository files navigation

skmca

Usage

Background

References

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages