﻿<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD with MathML3 v1.2 20190208//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd">
<article
    xmlns:mml="http://www.w3.org/1998/Math/MathML"
    xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="mini-review">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">IJMEBAC</journal-id>
      <journal-title-group>
        <journal-title>International Journal of Mathematical, Engineering, Biological and Applied Computing</journal-title>
      </journal-title-group>
      <issn pub-type="epub">2832-5273</issn>
      <issn pub-type="ppub"></issn>
      <publisher>
        <publisher-name>Science Publications</publisher-name>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.31586/ijmebac.2022.340</article-id>
      <article-id pub-id-type="publisher-id">IJMEBAC-340</article-id>
      <article-categories>
        <subj-group subj-group-type="heading">
          <subject>Mini Review</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>
          Movie Recommendation System using an Item-based Collaborative Filtering
        </article-title>
      </title-group>
      <contrib-group>
<contrib contrib-type="author">
<name>
<surname>Marappan</surname>
<given-names>Raja</given-names>
</name>
<xref rid="af1" ref-type="aff">1</xref>
<xref rid="cr1" ref-type="corresp">*</xref>
</contrib>
      </contrib-group>
<aff id="af1"><label>1</label>School of Computing, SASTRA Deemed University, Thanjavur, India</aff>
<author-notes>
<corresp id="c1">
<label>*</label>Corresponding author at: School of Computing, SASTRA Deemed University, Thanjavur, India
</corresp>
</author-notes>
      <pub-date pub-type="epub">
        <day>21</day>
        <month>06</month>
        <year>2022</year>
      </pub-date>
      <volume>1</volume>
      <issue>1</issue>
      <history>
        <date date-type="received">
          <day>21</day>
          <month>06</month>
          <year>2022</year>
        </date>
        <date date-type="rev-recd">
          <day>21</day>
          <month>06</month>
          <year>2022</year>
        </date>
        <date date-type="accepted">
          <day>21</day>
          <month>06</month>
          <year>2022</year>
        </date>
        <date date-type="pub">
          <day>21</day>
          <month>06</month>
          <year>2022</year>
        </date>
      </history>
      <permissions>
        <copyright-statement>&#xa9; Copyright 2022 by authors and Trend Research Publishing Inc. </copyright-statement>
        <copyright-year>2022</copyright-year>
        <license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0/">
          <license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p>
        </license>
      </permissions>
      <abstract>
        Everyone likes movies irrespective of color, gender, age, location, and race. The most important thing is how the users are getting our unique combinations of choices concerning the preferences of the movies. This article focuses on the creation of a movie recommendation system using item-based collaborative filtering.
      </abstract>
      <kwd-group>
        <kwd-group><kwd>Movie Recommendation</kwd>
<kwd>Recommendation System</kwd>
<kwd>Movies Preferences</kwd>
<kwd>Collaborative Filtering</kwd>
</kwd-group>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec1">
<title>Introduction</title><p>The recommendation system is a program that filters the information to predict the preference or rating of the item. In the case of movies, the system filters and predicts the movies that the users would prefer based on their preferences [
<xref ref-type="bibr" rid="R1">1</xref>,<xref ref-type="bibr" rid="R2">2</xref>].</p>
</sec><sec id="sec2">
<title>Movie Recommendation System</title><p>The movie recommendation system recommends the top movies using item-based collaborative filtering (CF) [
<xref ref-type="bibr" rid="R1">1</xref>,<xref ref-type="bibr" rid="R2">2</xref>]. The dataset applied is the Movielens small dataset as shown inTable <xref ref-type="table" rid="tab1">1</xref>. The movie dataset has movie id and genre columns. The rating dataset has the columns: user id, movie id, and rating as shown inTable <xref ref-type="table" rid="tab2">2</xref> [
<xref ref-type="bibr" rid="R3">3</xref>,<xref ref-type="bibr" rid="R4">4</xref>,<xref ref-type="bibr" rid="R5">5</xref>,<xref ref-type="bibr" rid="R6">6</xref>,<xref ref-type="bibr" rid="R7">7</xref>].</p>
<p>The steps in the proposed model using Python are as follows:</p>
<p>Import the required libraries.</p>
<p>Specify the CSV file path and import the dataset.</p>
<p>Search the files using the command: dataframe.head()<bold> </bold>to print the dataset rows. </p>
<p>Construct a new data frame with user id and movie id columns.</p>
<p>Remove the movies with minimal ratings. Assume that at least 15 votes per movie and the user should vote for at least 50 movies.</p>
<p>Apply the function csr_matrix to reduce the sparsity.</p>
<p>Apply the KNN method to calculate the cosine similarity measure.</p>
<p>Find similar movies and sort them out.</p>
<p>Identify the top movies.</p>
<p></p>
<p></p>
<p></p>
<p></p>
<p></p>
<p></p>
<p></p>
<table-wrap id="tab1">
<label>Table 1</label>
<caption>
<p>Movielens datasets &#x02013; ML-100K and ML-1M</p>
</caption>
<table> <tr>  <td>  <p> </p>  </td>  <td>  <p><b >ML-100K</b></p>  </td>  <td>  <p><b >ML-1M</b></p>  </td> </tr> <tr>  <td>  <p>Number of users</p>  </td>  <td>  <p>943</p>  </td>  <td>  <p>6,040</p>  </td> </tr> <tr>  <td>  <p>Number of movies</p>  </td>  <td>  <p>1,682</p>  </td>  <td>  <p>3,952</p>  </td> </tr> <tr>  <td>  <p>Number of ratings</p>  </td>  <td>  <p>10,0000</p>  </td>  <td>  <p>1,000,209</p>  </td> </tr> <tr>  <td>  <p>Number of all genres</p>  </td>  <td>  <p>19</p>  </td>  <td>  <p>18</p>  </td> </tr> <tr>  <td>  <p>Average number of  genres</p>  </td>  <td>  <p>1.7</p>  </td>  <td>  <p>1.6</p>  </td> </tr> <tr>  <td>  <p>Rating scales</p>  </td>  <td>  <p>1-5</p>  </td>  <td>  <p>1-5</p>  </td> </tr></table>
</table-wrap><table-wrap id="tab2">
<label>Table 2</label>
<caption>
<p>Rating dataset</p>
</caption>
<table> <tr>  <td>  <p> </p>  </td>  <td>  <p><b >userId</b></p>  </td>  <td>  <p><b >movieId</b></p>  </td>  <td>  <p><b >rating</b></p>  </td>  <td>  <p><b >timestamp</b></p>  </td> </tr> <tr>  <td>  <p>0</p>  </td>  <td>  <p>1</p>  </td>  <td>  <p>110</p>  </td>  <td>  <p>1.0</p>  </td>  <td>  <p>1425941529</p>  </td> </tr> <tr>  <td>  <p>1</p>  </td>  <td>  <p>1</p>  </td>  <td>  <p>147</p>  </td>  <td>  <p>4.5</p>  </td>  <td>  <p>1425942435</p>  </td> </tr> <tr>  <td>  <p>2</p>  </td>  <td>  <p>1</p>  </td>  <td>  <p>858</p>  </td>  <td>  <p>5.0</p>  </td>  <td>  <p>1425941523</p>  </td> </tr> <tr>  <td>  <p>3</p>  </td>  <td>  <p>1</p>  </td>  <td>  <p>1221</p>  </td>  <td>  <p>5.0</p>  </td>  <td>  <p>1425941546</p>  </td> </tr> <tr>  <td>  <p>4</p>  </td>  <td>  <p>1</p>  </td>  <td>  <p>1246</p>  </td>  <td>  <p>5.0</p>  </td>  <td>  <p>1425941556</p>  </td> </tr></table>
</table-wrap></sec><sec id="sec3">
<title>Conclusions &#x00026;#x26; Future Work</title><p>This article models the creation of a movie recommendation system using an item-based CF. in the future other soft computing approaches can be integrated to create a better recommendation of movies [
<xref ref-type="bibr" rid="R8">8</xref>,<xref ref-type="bibr" rid="R9">9</xref>,<xref ref-type="bibr" rid="R10">10</xref>,<xref ref-type="bibr" rid="R11">11</xref>,<xref ref-type="bibr" rid="R12">12</xref>].</p>
<p></p>
</sec>
  </body>
  <back>
    <ref-list>
      <title>References</title>
      
<ref id="R1">
<label>[1]</label>
<mixed-citation publication-type="other">Dataset: https://grouplens.org/datasets/movielens/
</mixed-citation>
</ref>
<ref id="R2">
<label>[2]</label>
<mixed-citation publication-type="other">Dataset: https://www.themoviedb.org/documentation/api
</mixed-citation>
</ref>
<ref id="R3">
<label>[3]</label>
<mixed-citation publication-type="other">Harper, F.M.; Konstan, J.A. The MovieLens datasets: History and context. ACM Trans. Interact. Intell. Syst. 2015, 5, 19:1-19:19, doi: 10. 1145/2827872.
</mixed-citation>
</ref>
<ref id="R4">
<label>[4]</label>
<mixed-citation publication-type="other">Marappan, R., &#x00026; Bhaskaran, S. (2022). Movie Recommendation System Modeling Using Machine Learning. International Journal of Mathematical, Engineering, Biological and Applied Computing 2022, 1(1), 12-16. DOI: 10.31586/ijmebac.2022.291
</mixed-citation>
</ref>
<ref id="R5">
<label>[5]</label>
<mixed-citation publication-type="other">Raja Marappan, S Bhaskaran. Datasets Finders and Best Public Datasets for Machine Learning and Data Science Applications. COJ Rob Artificial Intel. 2(1). COJRA. 000530. 2022.
</mixed-citation>
</ref>
<ref id="R6">
<label>[6]</label>
<mixed-citation publication-type="other">Bhaskaran, S.; Marappan, R.; Santhi, B. Design and Comparative Analysis of New Personalized Recommender Algorithms with Specific Features for Large Scale Datasets. Mathematics 2020, 8, 1106. https://doi.org/10.3390/math8071106
</mixed-citation>
</ref>
<ref id="R7">
<label>[7]</label>
<mixed-citation publication-type="other">Bhaskaran, S.; Marappan, R.; Santhi, B. Design and Analysis of a Cluster-Based Intelligent Hybrid Recommendation System for E-Learning Applications. Mathematics 2021, 9, 197. https://doi.org/10.3390/math9020197
</mixed-citation>
</ref>
<ref id="R8">
<label>[8]</label>
<mixed-citation publication-type="other">Marappan, R., Sethumadhavan, G. Solving Graph Coloring Problem Using Divide and Conquer-Based Turbulent Particle Swarm Optimization. Arab J Sci Eng (2021). https://doi.org/10.1007/s13369-021-06323-x
</mixed-citation>
</ref>
<ref id="R9">
<label>[9]</label>
<mixed-citation publication-type="other">Marappan, R.; Sethumadhavan, G. Complexity Analysis and Stochastic Convergence of Some Well-known Evolutionary Operators for Solving Graph Coloring Problem. Mathematics 2020, 8, 303. https://doi.org/10.3390/math8030303
</mixed-citation>
</ref>
<ref id="R10">
<label>[10]</label>
<mixed-citation publication-type="other">Marappan, R., Sethumadhavan, G. Solution to Graph Coloring Using Genetic and Tabu Search Procedures. Arab J Sci Eng 43, 525-542 (2018). https://doi.org/10.1007/s13369-017-2686-9
</mixed-citation>
</ref>
<ref id="R11">
<label>[11]</label>
<mixed-citation publication-type="other">R. Marappan and G. Sethumadhavan, "A New Genetic Algorithm for Graph Coloring," 2013 Fifth International Conference on Computational Intelligence, Modelling and Simulation, 2013, pp. 49-54, doi: 10.1109/CIMSim.2013.17.
</mixed-citation>
</ref>
<ref id="R12">
<label>[12]</label>
<mixed-citation publication-type="other">G. Sethumadhavan and R. Marappan, "A genetic algorithm for graph coloring using single parent conflict gene crossover and mutation with conflict gene removal procedure," 2013 IEEE International Conference on Computational Intelligence and Computing Research, 2013, pp. 1-6, doi: 10.1109/ICCIC.2013.6724190.
</mixed-citation>
</ref>
    </ref-list>
  </back>
</article>