This is a library for handling Data in Java. It is inspired in part by NumPy and aims to provide similar functionality.
The Eclipse Dataset project is a fork of the Eclipse Dawnsci project that extracts Datasets and its associated Maths libraries.
- Familiar. Provide familiar functionality, especially to NumPy users.
- Robust. Has test suite and is used in production heavily at Diamond Light Source.
- No more passing double[]. IDataset provide a consistent object for basing APIs on with significantly improved clarity over using double arrays or similar.
- Optimized. Optimized for speed and getting better all the time.
- Scalable. Allows handling of data sets larger than available memory with "Lazy Datasets".
- Focus on your algorithms. By reusing this library it allows you to focus on your code.
Clone the examples repository and have a look. Browse over to the examples repository for more information.
The current Javadocs are published here http://jonahkichwacoders.github.io/org.eclipse.dataset/javadoc/
At the moment this is work in progress and essentially a proposal on the way forward. Join us for discussion https://dev.eclipse.org/mailman/listinfo/science-iwg
The code is distributed under the Eclipse Public License.