MLC@Home: Machine Learning Comprehension @ Home
As of October 2022, the MLC@Home BOINC project has completed its initial goals and is shutting down until there are new experiments to run. Please see here for more info, and "MLDS Datasets" to access the generated datasets.

MLC@Home is a distributed computing project dedicated to understanding and interpreting complex machine learning models, with an emphasis on neural networks. It uses the BOINC distributed computing platform.

Opening the Black Box
XKCD #1838

Neural Networks have fuelled a machine learning revolution over the past decade that has led to machines accomplishing amazingly complex tasks. However, these models are largely black boxes: we know they work, but they are so complex (up to hundreds of millions of parameters!) that we struggle to understand the limits of such systems. Yet understanding networks becomes extremely important as networks are deployed in safety critical fields, like medicine and autonomous vehicles. Models must be vetted for robustness against adversarial examples, biases need to be identified and compensated for, and boundaries for what the network will produce need to be identified.

What MLC@Home Does

MLC@Home provides an open, collaborative platform for researchers studying machine learning comprehension. It allows us to train thousands of networks in parallel, with tightly controlled inputs, hyperparameters, and network structures. We use this to gain insights into these complex models.

MLC@Home's initial project, the Machine Learning Dataset Generator (MLDS), will generate a large dataset of simple networks trained with both clean and adversarial data. To our knowledge, this is the first dataset of its kind. MLC@Home also welcomes project proposals from other researchers aligned with this research area. MLC@Home requests that all data generated by our supported projects be made available to the public, and that, where possible, any papers and analysis be made public as well.

How You Can Help

We ask for volunteers to donate some of their background computing time to help us continue our research. We use the same time-tested BOINC distributed computing infrastructure as SETI@home and Rosetta@home. Follow these steps to join:

  • Install BOINC [ download ]
  • Open BOINC Manager
  • Click "Tools"->"Add Project"
  • Select "MLC@Home" from the menu
  • Follow the prompts to create an account

You're done! BOINC takes over from there, running in the background doing science when your computer is idle. You can follow your progress on this project by choosing "BOINC Project Page" from the menu at the top of this page.

MLDS Live Status
Dataset 1: 50000/50000
ParityMachine 100 500 1000 5000 10000
EightBitMachine 100 500 1000 5000 10000
SimpleXORMachine 100 500 1000 5000 10000
SingleDirectMachine 100 500 1000 5000 10000
SingleInvertMachine 100 500 1000 5000 10000
Dataset 2: 50000/50000
ParityModified 100 500 1000 5000 10000
EightBitModified 100 500 1000 5000 10000
SimpleXORModified 100 500 1000 5000 10000
SingleDirectModified 100 500 1000 5000 10000
SingleInvertModified 100 500 1000 5000 10000
Dataset 3
Milestone 1 (100x100) : COMPLETE (10000/10000)
Milestone 2 (1000x100) : COMPLETE (100000/100000)
Milestone 3 (10000x100) : COMPLETE (1000000/1000000)

Dataset 4
Dataset NameDense

0% < 25% 25% - 75% >75% 100%
Last update: 2022-10-03 15:46:39.630718