Difference between revisions of "HWsql"

Revision as of 18:25, 11 March 2020

See also the lecture

SQL can store the results from a query directly in a table, but in this task you should instead read each row of the SELECT query in Python and to store it by running INSERT command from Python
Also do not forget to create the new table in the database with appropriate column names and types. You can execute CREATE TABLE command from Python
The cursor from the SELECT query is needed while you iterate over the results. Therefore create two cursors - one for reading the database and one for writing.
If you change your database during debugging, you can start over by running the command for creating the database above
Store the script as taskC.py.

To check that your table was created, you can run command

sqlite3 series.db "SELECT * FROM seasons;"

This will print many lines, including this one: "5|1|8|9.3" which is for season 1 of series 5 (True Detective).

Submit your script taskC.py and the modified database series.db.

Task D (SQL, optionally Python)

For each pair of consecutive seasons within each series, compute how much has the average rating increased or decreased.

For example in the Sherlock series, season 1 had rating 8.825 and season 2 rating 9.26666666666667, and thus the difference in ratings is 0.44166666666667
Print a table containing series name, season number x, average rating in season x and average rating in season x+1
The table should be ordered by the difference between the last two columns, i.e. from seasons with the highest increase to seasons with the highest drop.
One option is to run a query in SQL in which you join the seasons table from task C with itself and select rows that belong to the same series and successive seasons
You can also read the rows of the seasons table in Python, combine information from rows for successive seasons of the same series and create the final report by sorting
Submit your code as taskD.py or taskD.sql and the resulting table as taskD.txt

The output should start like this (the formatting may differ):

series      season x    rating for x  rating for x+1  
----------  ----------  ------------  ----------------
Sherlock    1           8.825         9.26666666666667
Breaking B  4           9.0           9.375

When using SQL without Python, include the following two lines in taskD.sql

.mode column
.headers on

and run your query as sqlite3 series.db < taskD.sql > taskD.txt

@@ Line 126: / Line 126: @@
 * For example in the Sherlock series, season 1 had rating 8.825 and season 2 rating 9.26666666666667, and thus the difference in ratings is 0.44166666666667
 * Print a table containing series name, season number x, average rating in season x and average rating in season x+1
-* The table should be ordered by the difference between the last two columns, i.e. from seasons with the highest increase to seasons to the highest drop.
+* The table should be ordered by the difference between the last two columns, i.e. from seasons with the highest increase to seasons with the highest drop.
 * One option is to run a query in SQL in which you join the <tt>seasons</tt> table from task C with itself and select rows that belong to the same series and successive seasons
 * You can also read the rows of the <tt>seasons</tt> table in Python, combine information from rows for successive seasons of the same series and create the final report by sorting

Difference between revisions of "HWsql"

Revision as of 18:25, 11 March 2020

Contents

Introduction

Preparation

Task A (Python)

Task B1 (SQL)

Task B2 (SQL)

Task C (Python+SQL)

Task D (SQL, optionally Python)

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools