2.The LAG function is used to showdatafrom the precedingrowor the table. When lining up rows the datais partitioned by country nameand ordered by the data whn. That means that onlydatafrom Italy is considered. Modify the querytoshow confirmed for the day before.
SELECTname, DAY(whn), confirmed, LAG(confirmed, 1) OVER (PARTITIONBYnameORDERBY whn) FROM covid WHEREname = 'Italy' ANDMONTH(whn) = 3 ORDERBY whn
1 2 3 4 5 6 7 8
3.The number of confirmed case is cumulative - but we can use LAG torecover the numberofnew cases reported foreach day. Show the numberofnew cases foreachday, for Italy, for March.
SELECTname, DAY(whn), confirmed - LAG(confirmed, 1) OVER (PARTITIONBYnameORDERBY whn) FROM covid WHEREname = 'Italy' ANDMONTH(whn) = 3 ORDERBY whn
1 2 3 4 5 6 7 8 9
4.The data gathered are necessarily estimates and are inaccurate. However by taking a longer time span we can mitigate some of the effects. You can filter the data to view only Monday's figures WHERE WEEKDAY(whn) = 0. Show the numberofnew cases in Italy foreachweek - show Monday only.
SELECTname, DATE_FORMAT(whn,'%Y-%m-%d'), confirmed-LAG(confirmed, 1) OVER (PARTITIONBYnameORDERBY whn) FROM covid WHEREname = 'Italy' ANDWEEKDAY(whn) = 0 ORDERBY whn
1 2 3 4 5 6 7 8 9 10 11
5.You can JOIN a table using DATE arithmetic. This will give different results if data is missing. Show the numberofnew cases in Italy foreachweek - show Monday only. In the samplequery we JOIN this week tw withlastweek lw using the DATE_ADD function.
SELECT tw.name, DATE_FORMAT(tw.whn,'%Y-%m-%d'), tw.confirmed - lw.confirmed FROM covid tw LEFTJOIN covid lw ON DATE_ADD(lw.whn, INTERVAL1WEEK) = tw.whn AND tw.name=lw.name WHERE tw.name = 'Italy'andWEEKDAY(tw.whn) = 0 ORDERBY tw.whn
1 2 3 4 5 6 7 8 9 10 11
6.Include the ranking for the number of deaths in the table.
SELECT name, confirmed, RANK() OVER (ORDERBY confirmed DESC) rc, deaths, RANK() OVER (ORDERBY deaths DESC) rc1 FROM covid WHERE whn = '2020-04-20' ORDERBY confirmed DESC
1 2 3 4 5 6 7 8 9 10 11 12
7.The query shown includes a JOIN t the world table so we can access the total population of each country and calculate infection rates (in cases per 100,000). Show the infect rate ranking foreach country. Onlyinclude countries with a population ofatleast10 million.
SELECT world.name, ROUND(100000*confirmed/population,0)as infect_rate, RANK() OVER(ORDERBY infect_rate) FROM covid JOIN world ON covid.name=world.name WHERE whn = '2020-04-20'AND population > 10000000 ORDERBY population DESC
感觉答案好像出错了,不正常...
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
8.For each country that has had at last 1000 new cases in a single day, show the dateof the peak numberofnew cases.
select table1.name,DATE_FORMAT(table1.whn,'%Y-%m-%d'),table1.c1 from (select newcases.name,newcases.whn, rank() over(partitionbynameorderby c1 desc)as rk,newcases.c1 from (selectname,whn, (confirmed-lag(confirmed,1) over(partitionbynameorderby whn))as c1 from covid )as newcases )as table1 where table1.rk = 1and table1.c1 >= 1000 orderby table1.whn