Apache Mahout 0.6 veröffentlicht
Heute hat die Apache Software Foundation (ASF) das Machine-Learning-Framework Mahout in Version 0.6 veröffentlicht. Es gab viele Fehlerbehebungen und Verbesserungen. Insgesamt wurde 182 JIRAs geschlossen. Dazu zählen unter anderem die Fehlerreports [MAHOUT-807] - Wrong prefixes in PrefixAdditionFilter und [MAHOUT-809] - Bad bug in ChunkedWriter von Florian Bausch.
Interessant sind aber auch die Reports [MAHOUT-890] - Performance issue in FPGrowth und [MAHOUT-921] - FPG uses a lot of boxed primitives - this patch eliminates a bunch of List<Integer>, die einige, teils gravierende, Probleme im implementierten FP-Growth Algorithmus beheben.
Hier noch die Mail von der Mailing Liste:
Apache Mahout has reached version 0.6. All developers are encouraged to begin using version 0.6, as much has changed and will continue to do so as we march inexorably toward a 1.0 release. Highlights of 0.6 include:
- Improved Decision Tree performance and added support for regression problems
- New LDA implementation using Collapsed Variational Bayes 0th Derivative Approximation
- Reduced runtime of LanczosSolver tests
- K-Trusses, Top-Down and Bottom-Up clustering, Random Walk with Restarts implementation
- Reduced runtime of dot product between vectors
- Added MongoDB and Cassandra DataModel support
- Increased efficiency of parallel ALS matrix factorization
- SSVD enhancements
- Performance improvements in RowSimilarityJob, TransposeJob
- Added numerous clustering display examples
- Many bug fixes, refactorings, and other small improvements
Changes in 0.6 are detailed in the release notes (https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12310751&version=12316364).
Downloads of all releases available from Apache mirrors. (http://www.apache.org/dyn/closer.cgi/mahout/)
Enjoy!
Regards, Shannon
Links:
- Release notes
- Download Links
- PFP: Parallel FP-Growth for Query Recommendation, H. Li, Y. Wang, D. Zhang, M. Zhang, E. Chang
This article was published on 02/06/2012 11:05 p.m. and is stored in the category Development.
It is tagged with
Apache,
Artificial Intelligence,
Cluster,
Hadoop,
Java,
Linux,
Mahout
There are 0 comments.