SIT720 Machine Learning - Clustering and Dimensionality Reduction - Assignment Help

Get 25% Off Order New Solution
Added on: 2020-09-05 08:45:47
Order Code:
Question Task Id: 0

Connect with Assignment Expert Now




This assessment task is for student to apply skills for data clustering and dimensionality reduction. Students will be required to demonstrate ability in data representation, and competency in applying suitable clustering/dimensionality reduction techniques in a real-world scenario.


Students should insert Python code or text responses into the cell followed by the question into the supplied ipynb (Jupyter Notebook) file. For answers regarding discussion or explanation, maximum five sentences are suggested. Rename this Jupyter notebook file appending your student ID. For example, for student ID 1234, the submitted file name should be A2_1234.ipynb. Insert your student ID and name in the appropriate cell inside that file.

Part-1: Clustering (15 marks) Dataset and the ipynb files are provided in zip format (available in ‘Assessment 2 – T2 2020 - Dataset’ link) in the assessment section (Assessment->Assessment 2) of the unit site.

1. Download the attached clustering.csv file. Read the file and separate the class and feature matrix. (2 marks)

2. Determine the number of clusters from the dataset. Is this same as the actual number of classes in the dataset? (1 marks)

3. Perform K-Means clustering on the complete dataset and report purity score. (2 marks)

4. There are several distance metrics for K-Means such as Euclidean, Squared Euclidean, Manhattan, Chebyshev, Minkowski. [Hints: See the pyclustering library for python.]

 Your job is to compare the purity score of k-means clustering for different distance metrics. (5 marks)

 Select the best distance metric and explain why this distance metric is best for the given dataset. (2 marks)


This Machine Learning Assignment has been solved by our Machine Learning Experts at TVAssignmentHelp. Our Assignment Writing Experts are efficient to provide a fresh solution to this question. We are serving more than 10000+ Students in Australia, UK & US by helping them to score HD in their academics.Our Experts are well trained to follow all marking rubrics & referencing style.

Be it a used or new solution, the quality of the work submitted by our assignment experts remains unhampered. You may continue to expect the same or even better quality with the used and new assignment solution files respectively. There’s one thing to be noticed that you could choose one between the two and acquire an HD either way. You could choose a new assignment solution file to get yourself an exclusive, plagiarism (with free Turnitin file), expert quality assignment or order an old solution file that was considered worthy of the highest distinction.

  • Uploaded By : Pearl
  • Posted on : September 05th, 2018
  • Downloads : 0

Order New Solution

Review Question

Please enter your email

Can't find what you're looking for?

Why are you waiting for Assignment Deadline?
Book Assignment Today & Get 500 Words Free
Order Now TnC Apply

Choose a Plan


80 USD
  • All in Gold, plus:
  • 30-minute live one-to-one session with an expert
    • Understanding Marking Rubric
    • Understanding task requirements
    • Structuring & Formatting
    • Referencing & Citing


30 50 USD
  • Get the Full Used Solution
    (Solution is already submitted and 100% plagiarised.
    Can only be used for reference purposes)
Save 33%


20 USD
  • Journals
  • Peer-Reviewed Articles
  • Books
  • Various other Data Sources – ProQuest, Informit, Scopus, Academic Search Complete, EBSCO, Exerpta Medica Database, and more