The last two weeks have been very interesting. Testing of the implemented evaluation metrics is complete and the code is now ready for the pull request. The tests conducted on the implementations checked the accuracy and of the results they produced by comparing their results with those of their corresponding scikit-learn implementations. This was done … Continue reading The Seventh and Eighth weeks
The Sixth Week
A great amount of progress was made this week. The completed implementations of the Chi squared feature test, the contingency table, and the Silhouette coefficient were uploaded to the forked Github repository. With this, all but the implementation of the AUC test have been completed and are ready for testing. The goal for the next … Continue reading The Sixth Week
The Fifth Week
A good amount of progress was made this week. Prototypes for the implementations of all components required in the implementation of the chi squared feature selection test were created, namely the contingency table the algorithm to perform the test. Regarding the issues raised the previous week, both issues have been resolved. The information required by … Continue reading The Fifth Week
The Fourth Week
The goal for the week was to begin the implementation of the metrics designed in order of priority and importance. Each implementation was committed onto a branch of a Github repository that had been forked from the HPCC Systems ML_Core repository. The simple metrics such as Hamming Loss and F-Score were completed first, their implementations … Continue reading The Fourth Week
The Third Week
The objective for the third week was to create a set of proposals detailing the implementation of the various evaluation metrics, while adding a few metrics to the list to make it richer. The proposals for the metrics can be found here. They contain a brief summary of the metric, the parameters needed, their dependencies, … Continue reading The Third Week
The Second Week
The second week of the summer internship at HPCC systems went by without any issues. The goal for the week was to create a summary of the existing evaluation metrics that are available in the HPCC systems Machine Learning bundles, and to then come up with a list of evaluation metrics that could potentially be … Continue reading The Second Week
The First Week
The first week progressed smoothly and as planned. The goal was to understand the fundamentals of ECL, the programming language used to perform operations on an HPCC Systems cluster, and, upon completion of which, to acquaint myself with the Machine Learning bundles shipped with the platform and their use. With the guidance of my mentors, … Continue reading The First Week
Introduction
Hi! This is Suryanarayanan, and this blog is a journal of my thoughts, ups and downs, achievements, and everything in between, as I journey along through my summer internship at HPCC Systems. Follow along if you'd like to check in on my progress or learn of the solutions I've gathered to interesting problems tackled along … Continue reading Introduction