合肥生活安徽新闻合肥交通合肥房产生活服务合肥教育合肥招聘合肥旅游文化艺术合肥美食合肥地图合肥社保合肥医院企业服务合肥法律

代做CE 314、代写Python/Java编程

时间:2024-01-05  来源:合肥网hfw.cc  作者:hfw.cc 我要纠错



CE 314/887 Assignment 2
Text classification
December 2023
Deadline: Please follow deadline on FASER

Build a text classifier on the IMDB sentiment classification dataset, you can use any classification method, but you must training your model on the first 40000 instances and testing your model on the last 10000 instances. The IMDB dataset will be uploaded on the moodle page for you to download.

Your code should include:

1: Read the file, incorporate the instances into the training set and testing set. 

2: Pre-processing the text, you can choose whether you need stemming, removing stop words, removing non-alphabetical words. (Not all classification models need this step, it is OK if you think your model can perform better without this step, and you can give some justification in the report.)

3: Analysing the feature of the training set, report the linguistic features of the training dataset.

4: Build a text classification model, train your model on the training set and test your model on the test set.

5: Summarize the performance of your model (You can gain additional marks if you have some graph visualization).

6: (Optional) You can speculate how you can improve your works based on your proposed model.

After you build such a model and test on the test set, you should write a report (no longer than three pages in A4, with Arial 11 fonts) to summarize your work.  

(You can use the existing algorithms on github or kaggle, but you must not directly copy and paste their code!
However, you are not allowed to use the Naïve Bayes algorithm and VADER classifier, which practiced in Lab 4)
 
Suggestion: some bonus points:

Have necessary comments on your code
Have proper reference on your report 
Have graph visualization on your report 
Investigate more evaluation methods, like not only show the P R F score, but also run multiple times and show the standard derivation on P R F (I am sure you can find more evaluation methods.)  
Write your report like a mini-conference paper (you can learn from this paper:
请加QQ:99515681 或邮箱:99515681@qq.com   WX:codehelp

扫一扫在手机打开当前页
  • 上一篇:ENVS363代做、R设计编程代写
  • 下一篇:代做COMP2209、代写Java,Python程序
  • 无相关信息
    合肥生活资讯

    合肥图文信息
    流体仿真外包多少钱_专业CFD分析代做_友商科技CAE仿真
    流体仿真外包多少钱_专业CFD分析代做_友商科
    CAE仿真分析代做公司 CFD流体仿真服务 管路流场仿真外包
    CAE仿真分析代做公司 CFD流体仿真服务 管路
    流体CFD仿真分析_代做咨询服务_Fluent 仿真技术服务
    流体CFD仿真分析_代做咨询服务_Fluent 仿真
    结构仿真分析服务_CAE代做咨询外包_刚强度疲劳振动
    结构仿真分析服务_CAE代做咨询外包_刚强度疲
    流体cfd仿真分析服务 7类仿真分析代做服务40个行业
    流体cfd仿真分析服务 7类仿真分析代做服务4
    超全面的拼多多电商运营技巧,多多开团助手,多多出评软件徽y1698861
    超全面的拼多多电商运营技巧,多多开团助手
    CAE有限元仿真分析团队,2026仿真代做咨询服务平台
    CAE有限元仿真分析团队,2026仿真代做咨询服
    钉钉签到打卡位置修改神器,2026怎么修改定位在范围内
    钉钉签到打卡位置修改神器,2026怎么修改定
  • 短信验证码 宠物饲养 十大卫浴品牌排行 suno 豆包网页版入口 wps 目录网 排行网

    关于我们 | 打赏支持 | 广告服务 | 联系我们 | 网站地图 | 免责声明 | 帮助中心 | 友情链接 |

    Copyright © 2025 hfw.cc Inc. All Rights Reserved. 合肥网 版权所有
    ICP备06013414号-3 公安备 42010502001045