Counting 3,367 Big Data & Machine Learning Frameworks, Toolsets, and Examples...
Suggestion? Feedback? Tweet @stkim1

Last Commit
Nov. 19, 2018
Mar. 24, 2017

A data mining suite for gene expression data

candis is an open source data mining suite (released under the GNU General Public License v3) for gene expression data that consists of a wide collection of tools you require, right from Data Extraction to Model Deployment. candis is built on top of the toolkit - CancerDiscover written by the bioinformaticians at HelikarLab.

WARNING: candis currently is still in dev mode and not production-ready yet. In case if you run across bugs or errors, raise an issue over here.

Table of Contents


Assuming you've installed dependencies, simply

$ pip install candis


$ curl -sL | python # with dependencies

... and lauch candis's development server:

$ candis

To install candis right from scratch, check out our exhaustive guides:

Docker Image

You can also attempt to install candis via Docker as follows:

$ docker pull helikarlab/candis

... and simply run the image optionally mapping the port 5000.

$ docker run -p 8888:5000 helikarlab/candis


Launching the RIA (Rich Internet Application)

via CLI

$ candis


$ python -m candis

via Python

>>> import candis
>>> candis.main()

Using the CLI (Command Line Interface)

$ candis --cdata path/to/data.cdata --config path/to/config.json


  • Converting a CDATA to an ARFF file

     >>> import candis
     >>> cdata ='path/to/data.cdata')

    Then, simply use the CData.toARFF API:

     >>> cdata.toARFF('path/to/data.arff')
  • Running a Pipeline.

     >>> pipe = candis.Pipeline()
     >>> while pipe.status == candis.Pipeline.RUNNING:
     ...     # do something while pipeline is running


  • Production Dependencies
    • R
    • WEKA (NOTE: Requires Java)
    • Python 3.6+ and PIP (Python's Package Manager)
    • NumPy
  • Development Dependencies


Dr. Tomas Helikar, Ph.D

Principal Investigator

Dr. Akram Mohammed, Ph.D

Author and Maintainer

Achilles Rasquinha

Author and Maintainer

Rupav Jain

Author and Maintainer


This software has been released under the GNU General Public License v3.