Homework 6 (60 points)
Due Thursday, April 25th at 11:59pm (via blackboard)

AKA “Project” Milestone #2

In this assignment, we’re taking the raw data we obtained from HW5, and we’re building a data
model for this. This can be anything you like (for example: SQL relationships, a class hierarchy,
setting up Pandas dataframes, SQLalchemy, etc. This list is not exhaustive!) You have the
freedom to interface with your data however you’d like, but keep in mind that regardless of
how simple you think the data is, your solution will be graded on how useful, extensible,
modular and robust your solution is. Better solutions get better scores!

You are to turn in your Python code for your project so far, including the code you wrote in
HW5(i.e. this new code should integrate with the old code). You can turn in any number of
supporting files (libraries, modules, etc.) but you must follow the same format as before:

Name your script: LASTNAME_FIRSTNAME_hw6.py (you will LOSE points if you don’t do this!)

Your script should be modular in that it allows you to obtain the data from the scraper/API (as
in HW5) but also obtain it from local storage. How you implemented this (text files, CSV,
cached webpages, SQL files, Feather serialized dataframes, etc.) is up to you. There should be a
–source=remote or –source=local command line parameter (remember the lecture on args and
kwargs!)

When invoked, your Python script should grab the data (either locally or remotely) stick it into
your data model, and then retrieve it and manipulate it in some way. How you do this is up to
you; just imagine doing one of whatever computation you’ll end up doing for the final project.
For example, if your data sources were, say, lat/long combinations, a google API and voting
records, you might grab the lat/long, ask the google API for the closest city, and then get the
voting records for that city. You’d display a “result” (just one!) [You’ll save the “final”
result/conclusion for the last part of the project]

代做INF 510作业、代写Milestone留学生作业、SQL语言作业代做

In addition, you should turn in a plain text file named LASTNAME_FIRSTNAME_hw6.txt (NO
DOC, PDF, OR ANYTHING ELSE), that answers the following questions:
1. What are the strengths of your data modeling format?
2. What are the weaknesses? (Does your data model support? Sorting the information?
Re-ordering it? Only obtaining a certain subset of the information?)
3. How do you store your data on disk?
4. Let’s say you find another data source that relates to all 3 of your data sources (i.e. a
data source that relates to your existing data). How would you extend your model to
include this new data source? How would that change the interface?
5. How would you add a new attribute to your data (i.e. imagine you had a lat/long
column in a database. You might use that to access an API to get a city name. How
would you add city name to your data?)
The rubric for HW6 is as follows:
Python Code
o Code is modular and robust: /20
o Code displays a result: /5
o Remote and local source command line parameter: /5
o Code is poorly documented: -5
o Runtime error: -5
Data Model
o Question 1: /3
o Question 2: /5
o Question 3: /2
o Question 4: /10
o Question 5: /10
Due Thursday, April 25th at 11:59pm (via blackboard)

因为专业,所以值得信赖。如有需要,请加QQ:99515681 或邮箱:99515681@qq.com

微信:codinghelp

AKA “Project” Milestone的更多相关文章

  1. A Word (Or Two) On Quality

    In the world of interactive project management the promise of quality has become cliché. Quality is ...

  2. [译]脱离jQuery,使用原生Ajax

    脱离jQuery,使用原生Ajax 标签: Ajax translate 英文出处:<A Guide to Vanilla Ajax Without jQuery> 翻译: 刘健超 J.c ...

  3. [Bayesian] “我是bayesian我怕谁”系列 - Latent Variables

    下一章有意讲讲EM和变分推断的内容. EM和变分推断的内容能Google到很多,虽然质量乘次不齐,但本文也无意再赘述那么些个细节. 此处记录一些核心思想,帮助菜鸡形成整体上的认识.不过,变分推断也不是 ...

  4. The Honeynet ProjectThe Honeynet Project

    catalogue . 蜜罐基本概念 . Kippo: SSH低交互蜜罐安装.使用 . Dionaea: 低交互式蜜罐框架部署 . Thug . Amun malware honeypots . Gl ...

  5. Project Management Process

    Project Management ProcessDescription .............................................................. ...

  6. The "Real" Project Plan

    "I need a project plan by tomorrow morning." As project managers, that's what we hear. But ...

  7. A Guide to Creating a Quality Project Schedule

    Successful projects start with a good quality project schedule. Creating a schedule is one of the fi ...

  8. Pintos-斯坦福大学操作系统Project详解-Project1

    转载请注明出处. 前言:  本实验来自斯坦福大学cs140课程,只限于教学用途,以下是他们对于Pintos系统的介绍:  Pintos is a simple operating system fra ...

  9. Facebook IV Winner's Interview: 1st place, Peter Best (aka fakeplastictrees)

    Facebook IV Winner's Interview: 1st place, Peter Best (aka fakeplastictrees) Peter Best (aka fakepla ...

随机推荐

  1. c++基础学习

    1.输入输出函数(cout,cin) #include<iostream> int main() { using namespace std; cout<<"Come ...

  2. selenium——键盘操作

    很多键盘操作实际是没有意义的.

  3. Linux centos系统安装后的基本配置,Linux命令

    一.centos系统安装后的基本配置 .常用软件安装 yum install -y bash-completion vim lrzsz wget expect net-tools nc nmap tr ...

  4. JAVA 遍历数组

    package Code411;//遍历数组public class CodeArray { public static void main(String[] args) { int[] array ...

  5. java 读取本地文件并转换为byte数组

    private byte[] InputStream2ByteArray(String filePath) throws IOException { InputStream in = new File ...

  6. AI数据分析(二)

    NumPy库 NumPy数组对象 NumPy数据类型 数组的索引 数组的切片 数组的组合 数组的分割 数组的属性 NumPy数组对象 NumPy数据类型 #numpy数据类型 print " ...

  7. fire workflow总结

    一.Fire WorkFlow核心1.IPersistenceService存储服务.Fire Workflow 缺省情况下使用hibernate 进行数据库存取.如果你的系统不是使用hibernat ...

  8. Hive SemanticException

    hive, sex='boy'); FAILED: SemanticException Line : Invalid path ''/root/data/hdata.txt'': No files m ...

  9. echarts4.0折线图让某个点闪烁

    <!DOCTYPE html> <html> <head> <meta charset="UTF-8"> <title> ...

  10. Python_多线程threading模块

    python 在执行的时候会淡定的在CPU上只允许一个线程运行,故Python在多核CPU的情况下也只能发挥出单核的功能,其中的原因:gil锁 gil 锁 (全局解释器锁):每个线程在执行时都需要先获 ...