User:Mill 1/Project Chaining back the Years/Statistics
The statistics cover the period 1990-2005 and are organized in separate categories.
General statistics and facts
[edit]Information on the dpm's per 6 November 2024:
- Total number of entries: 42,765
- Total number of references: 27,268
- Overall reference density[1]: 63.76% (27,268/42,765)
- Total size of combined text (approx.): 10.7 megabytes
- Which translates to approx. 1,850 pages (A4)[2]
- Average number of entries per death day: 7.32 (42,765/5,843)
- Average number of references per death day: 4.6666 (27,268/5,843)
- Average number of entries per dpm: 222.73 (42,765/(12 * 16))
- Average number of references per dpm: 142.02 (27,268/(12 * 16))
- Death day with the most entries (74): Deaths in September 2001#11 (details)
- Dpm with the most entries (310): Deaths in December 2005
- Dpm with the most references (229): Deaths in December 1995
- Dpm with highest reference density[1] (82.33%): Deaths in January 1999
- Minimum reference density regarding all processed days: 30%
- Month with the most deaths: December (3,980) (2nd: January (3,901)) (details)
- Month with the least deaths: June (3,326) (details)
- Total number of views for all dpm's per year (2023): 846,402 (details)
Data on entries, references and page sizes
[edit]This section shows the development of the number of entries, references and page sizes over time.[3] To do that I had to establish a baseline first. This is the status of the dpm's before I started work on them. As explained work was done in specific rounds but not all dpm's were handled in each round (which is shown here). As a consequence data on Round 1 is skipped. Data regarding Round 2 does not exist for the years 1990 – 1995. In those cases the baseline counts are used. Regarding the years 1993 and 1994 no dpm's or dpy's existed to use as a baseline. Therefore the corresponding Year-page acted as the baseline.[4]
Charts & tables
[edit]The charts in this section visualize the results over time. Beneath the chart a table states its data. To consult the underlying data of a particular year, click on the corresponding title in the table.
Number of entries
[edit]As you can see in the chart below the number of (notable) entries increased considerably over time. In total I doubled the overall number of entries by 21,174 to 42,765. This is including the entries I removed during the course of the project. Strangely the inital dpm's of the year 1990 contained a lot of entries (and zero references). This was also true for the first five months of 1991. It turned out that a particular wikipedian in an unprecedented editing frenzy had been adding entries to dpm's between January 1990 and May 1993. And he really went to town regarding the first 17 dpm's adding entries indiscriminately and unreferenced. When reprocessing them I filtered out more than a third of unworthy entries which improved the lists considerably.
Year | 1990 | 1991 | 1992 | 1993 | 1994 | 1995 | 1996 | 1997 | 1998 | 1999 | 2000 | 2001 | 2002 | 2003 | 2004 | 2005 | Total |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Baseline | 4,121 | 2,513 | 1,597 | 247 | 239 | 1,697 | 1,033 | 907 | 420 | 536 | 885 | 1,394 | 958 | 1,066 | 1,549 | 2,429 | 21,591 |
After Round 2 | 4,121 | 2,513 | 1,597 | 247 | 239 | 1,697 | 2,191 | 2,233 | 2,143 | 2,166 | 2,227 | 2,789 | 2,487 | 2,350 | 2,139 | 2,701 | 33,840 |
Current | 2,556 | 2,368 | 2,405 | 2,470 | 2,401 | 2,937 | 2,508 | 2,795 | 2,751 | 2,796 | 2,733 | 2,938 | 2,764 | 2,689 | 2,726 | 2,928 | 42,765 |
Number of references
[edit]Citing the date (and cause) of death of the listed dead did not have a big priority to the Wikipedians before me who concerned themselves with the death lists. Looking at the for the nineties, excluding 1995 (which was a special case) only 450 references existed on a total of 11,613 entries. This meant an abysmal reference density of 3.9%. And the other months didn't fare much better, leading to a total number of refs of 4,561 (on 21,591 entries) concerning the whole period. So it's not a big surprise that Round 2 would yield big rewards. As explained not all dpm's were handled in Round 2; only the years 1997-2005 and 8 months of 1996; 116 dpm's in total.[5]
Generated references
[edit]The use of the references tool led to a big increase of NYTimes obituary citations. In a single edit many references in a dpm were added and updated. For instance take a look at the edit regarding Deaths in December 2001: afterwards the tool had added 29 NYTimes citations and had replaced 14 existing references with the more reliable and future proof NYTimes obituaries. Also, the edit caused a netto size increase of 10,934 bytes. All dpm's benefited this way. You can find a list of dpm's whose size increase most because of it here.
Olympedia
Year | 1990 | 1991 | 1992 | 1993 | 1994 | 1995 | 1996 | 1997 | 1998 | 1999 | 2000 | 2001 | 2002 | 2003 | 2004 | 2005 | Total |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Baseline | 22 | 3 | 49 | 4 | 13 | 1,503 | 5 | 15 | 4 | 335 | 592 | 873 | 29 | 40 | 457 | 617 | 4,561 |
After Round 2 | 22 | 3 | 49 | 4 | 13 | 1,503 | 1,075 | 898 | 845 | 909 | 860 | 1,109 | 1,369 | 1,187 | 1,034 | 917 | 11,797 |
Current | 1,577 | 1,408 | 1,406 | 1,515 | 1,429 | 2,164 | 1,464 | 1,737 | 1,739 | 1,772 | 1,729 | 1,860 | 1,884 | 1,798 | 1,817 | 1,969 | 27,268 |
Reference density
[edit]TODO conclusies
Year | 1990 | 1991 | 1992 | 1993 | 1994 | 1995 | 1996 | 1997 | 1998 | 1999 | 2000 | 2001 | 2002 | 2003 | 2004 | 2005 | Total |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Baseline | 0.5% | 0.1% | 3.1% | 1.6% | 5.4% | 88.6% | 0.5% | 1.7% | 1.0% | 62.5% | 66.9% | 62.6% | 3.0% | 3.8% | 29.5% | 25.4% | 22.3% |
After Round 2 | 0.5% | 0.1% | 3.1% | 1.6% | 5.4% | 88.6% | 49.1% | 40.2% | 39.4% | 42.0% | 38.6% | 39.8% | 55.0% | 50.5% | 48.3% | 34.0% | 33.5% |
Current | 61.7% | 59.5% | 58.5% | 61.3% | 59.5% | 73.7% | 58.4% | 62.1% | 63.2% | 63.4% | 63.3% | 63.3% | 68.2% | 66.9% | 66.7% | 67.2% | 63.5% |
Article sizes
[edit]The increase in the size of the dpm articles perhaps is the area where most progress was realised. About half of the content is currently represented by (generated) citations.
Year | 1990 | 1991 | 1992 | 1993 | 1994 | 1995 | 1996 | 1997 | 1998 | 1999 | 2000 | 2001 | 2002 | 2003 | 2004 | 2005 | Total |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Baseline | 278,197 | 183,945 | 139,458 | 22,176 | 21,744 | 510,480 | 77,603 | 68,818 | 33,106 | 112,690 | 216,886 | 334,912 | 73,896 | 90,287 | 276,163 | 385,407 | 2,825,768 |
After Round 2 | 278,197 | 183,945 | 139,458 | 22,176 | 21,744 | 510,480 | 463,766 | 438,279 | 411,236 | 428,967 | 428,277 | 548,770 | 616,674 | 560,766 | 494,002 | 513,721 | 6,060,458 |
Current | 605,208 | 546,099 | 553,140 | 588,934 | 569,257 | 787,423 | 585,610 | 683,271 | 679,157 | 689,906 | 679,035 | 761,645 | 749,906 | 721,308 | 723,522 | 778,297 | 10,701,718 |
Pageviews
[edit]The total number of views for all dpm's in 2023 was 846,402. This translates to 4,408 views per page per year, 367 views per dpm per month and about 12 views per dpm daily.
Number of pageviews per dpm
[edit]The table below shows the number of pageviews stated per dpm. The nineties seem to be less popular than the zero's. Also, the month of January is viewed considerable more than the other months for some reason (a new year?). The first month of 1990 and 2000[6] especially generate more interest. This is also true for the last month of the millenium.
Spikes
[edit]For different reasons other dpm's also have significantly more pageviews:
- A celebrity died in that month:[7]
- November 1991 (death of Freddie Mercury on the 24th)[8]
- April 1994 (death of Kurt Cobain on the 5th)[9]
- March 1995 (death of Selena on the 31st)[10]
- September 1996 (death of Tupac Shakur on the 13th)[11]
- March 1997 (death of The Notorious B.I.G. on the 9th)[12]
- August 1997 (death of Princess Diana on the 31st)[13]
- August 2001 (death of Aaliyah on the 25th)[14]
- Because of a specific incident:
- Or a certain dpm was 20 or 25 years ago. Notice the large views spike regarding the corresponding month:
- January 2003 (20 years since then)[17][18]
- August 1998 (25 years since then)[19]
- September 1997 (25 years since September 2022)[20][21]
Number of pageviews per month in 2023. Total 846,402: | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Day | Jan | Feb | Mar | Apr | May | Jun | Jul | Aug | Sep | Oct | Nov | Dec |
1990 | 8,889 | 3,594 | 3,240 | 3,019 | 3,371 | 3,193 | 2,657 | 2,569 | 2,459 | 2,484 | 2,272 | 2,814 |
1991 | 4,134 | 2,603 | 2,544 | 2,603 | 2,511 | 2,672 | 2,473 | 2,408 | 2,598 | 2,342 | 3,303 | 3,843 |
1992 | 4,175 | 2,585 | 2,538 | 2,594 | 2,441 | 2,362 | 2,319 | 2,355 | 2,389 | 2,451 | 2,393 | 2,780 |
1993 | 5,241 | 3,122 | 3,311 | 3,401 | 2,862 | 3,139 | 3,270 | 3,157 | 2,905 | 3,115 | 2,781 | 3,389 |
1994 | 5,328 | 3,040 | 3,262 | 4,602 | 3,745 | 3,064 | 2,990 | 2,519 | 2,771 | 2,700 | 3,147 | 3,046 |
1995 | 4,844 | 3,129 | 4,113 | 3,206 | 3,032 | 3,249 | 3,139 | 3,258 | 2,981 | 3,114 | 2,942 | 3,705 |
1996 | 4,740 | 3,337 | 2,941 | 2,717 | 2,941 | 3,014 | 3,061 | 2,930 | 4,464 | 3,124 | 2,873 | 3,137 |
1997 | 5,166 | 3,191 | 3,946 | 3,487 | 3,536 | 3,343 | 3,671 | 5,764 | 4,304 | 3,620 | 3,338 | 4,099 |
1998 | 5,406 | 3,573 | 3,491 | 3,718 | 4,149 | 4,442 | 3,727 | 3,969 | 3,578 | 3,463 | 3,382 | 4,068 |
1999 | 6,265 | 3,896 | 3,988 | 4,791 | 4,482 | 4,183 | 4,456 | 3,912 | 4,053 | 4,368 | 4,371 | 9,567 |
2000 | 11,090 | 5,232 | 4,070 | 4,032 | 4,235 | 3,997 | 4,092 | 3,771 | 3,988 | 4,032 | 3,781 | 5,075 |
2001 | 6,296 | 4,835 | 4,708 | 3,960 | 3,955 | 4,660 | 4,032 | 5,036 | 24,739 | 4,172 | 4,575 | 4,441 |
2002 | 5,973 | 4,828 | 4,540 | 4,848 | 4,342 | 4,022 | 4,073 | 4,293 | 4,178 | 4,208 | 4,152 | 4,689 |
2003 | 8,173 | 5,663 | 5,233 | 5,233 | 4,890 | 5,176 | 5,152 | 4,898 | 5,555 | 4,975 | 4,898 | 4,757 |
2004 | 6,652 | 4,522 | 4,605 | 4,460 | 4,625 | 5,037 | 4,635 | 5,004 | 4,671 | 5,377 | 4,900 | 57,865 |
2005 | 7,512 | 5,033 | 5,219 | 5,014 | 5,183 | 5,056 | 4,830 | 5,208 | 5,282 | 5,547 | 4,998 | 4,976 |
Total | 99,884 | 62,183 | 61,749 | 61,685 | 60,300 | 60,609 | 58,577 | 61,051 | 80,915 | 59,092 | 58,106 | 122,251 |
Wikipedia edit count
[edit]Although having recovered from editcountitis I still wanted to estimate the number of edit I spent on this project. The number of edits is calculated by analyzing the edits of the envolved articles. Edits on project documentation are excluded as are sandbox and Talk Page changes.
Edits on dpm's, dpy's and Year-pages
[edit]Work on the actual listing pages was divided between the dpm's, dpy's and the Year-pages like 2000. I used Xtools edit counter to resolve the the data regarding the dpy's and Year-pages. Especially the number of edits on Deaths in 1999 stood out.
Regarding the dpm's I decided to use extrapolation to calculate an approximation. I would select the two most 'average'-edited months and based on that would calculate the total regarding all months. Causality between the number of entries and the number of edits exists. So, looking at the total number of edits distributed per month I chose the months October and November. They are closest to the average of 3,563.75 entries per month of the year. Next table shows per year the edit count results for the three types of lists:
Year | Year-pages[22] | dpy's | dpm's Oct | dpm's Nov | dpm's Year[23] |
---|---|---|---|---|---|
1990 | 0 | 2 | 68[24] | 58[25] | 756 |
1991 | 0 | 2 | 72 | 70 | 852 |
1992 | 0 | 2 | 73 | 87 | 960 |
1993 | 9 | 2 | 5 | 5 | 60 |
1994 | 12[26] | 2 | 18 | 17 | 210 |
1995 | 1 | 2 | 75 | 77 | 912 |
1996 | 3 | 57 | 57 | 52 | 654 |
1997 | 17 | 93 | 75 | 67 | 852 |
1998 | 5 | 30 | 79 | 80 | 954 |
1999 | 3 | 631[27] | 93 | 76 | 1,014 |
2000 | 6 | 26 | 102 | 146 | 1,488 |
2001 | 4 | 48 | 128 | 119 | 1,482 |
2002 | 15 | 255 | 108 | 99 | 1,242 |
2003 | 2 | 202 | 93 | 73 | 996 |
2004 | 5 | 2 | 82 | 84 | 996 |
2005 | 5 | 2 | 112 | 106 | 1,308 |
Total | 87 | 1,358 | 14,736 |
Processing pages
[edit]As Mill 1 I used two processing pages to compile dpm's whose content was copied to the actual article when a month was done.
I set up this page exclusively for creating dpm content. Therefore all edits in this page can be attributed to the project. Total edits: 5,078[28]
Although I used this page to test my software and other stuff I mainly used it to for the project. Since 5 November 2022 I used it for other purposes so those edits are left out. Total count: 494
Category edits
[edit]Edits to new categories added up to a whopping total of 12:
Other page types
[edit]During the project I also needed to edit other page types like templates whose counts are stated in the next section.
Edit count summary
[edit]All edit counts add up to a total of 21,900:
Page type | Count |
---|---|
Dpm edits | 14,736 |
Dpy edits | 1,358 |
Year-page edits | 87 |
User:Mill 1/Months/December | 5,078 |
User:Mill 1/tmp | 494 |
Lists of deaths by year[29] | 17 |
Template edits[30] | 24 |
Category edits | 12 |
Sonictonic[31] | 94 |
Total | 21,900 |
Top 50 most edited pages
[edit]Miscellaneous data
[edit]Death dates with the highest number of notable entries
[edit]Total number of entries per month of dpm
[edit]Next table shows the combined number of notable entries regarding all dpm's per month of the year. So for example a total of 3,901 deaths are listed for January regarding the dpm's 1990-2005.
Jan | Feb | Mar | Apr | May | Jun | Jul | Aug | Sep | Oct | Nov | Dec | Total |
---|---|---|---|---|---|---|---|---|---|---|---|---|
3,901 | 3,512 | 3,665 | 3,462 | 3,500 | 3,326 | 3,387 | 3,453 | 3,464 | 3,540 | 3,575 | 3,980 | 42,765 |
Average number of entries per month: 3,563.75.
Top 10 largest revisions when adding NYTimes references (in bytes)
[edit]TODO zelfde tabel met top (1995) ref. replacements bijv. Dec.1995: 44 (gebruik prev./ next)
Thanks to my strict edit summaries I was able to compile this list. The full list can be found here.
Rank | Dpm | Added bytes | Added refs | Replaced refs[32] | Revision date |
---|---|---|---|---|---|
1 | Deaths in December 2001 | +10,934 | 29 | 14 | 9 May 2021 |
2 | Deaths in October 2000 | +10,874 | 30 | 8 | 12 December 2020 |
3 | Deaths in October 1999 | +10,491 | 29 | 5 | 23 November 2020 |
4 | Deaths in November 1996 | +9,921 | 28 | 1 | 18 September 2021 |
5 | Deaths in November 2001 | +9,864 | 26 | 11 | 7 May 2021 |
6 | Deaths in September 2004 | +9,806 | 25 | 8 | 26 August 2021 |
7 | Deaths in October 2001 | +9,744 | 26 | 14 | 5 May 2021 |
8 | Deaths in September 2003 | +9,732 | 26 | 7 | 18 August 2021 |
9 | Deaths in November 1998 | +9,703 | 26 | 7 | 18 November 2020 |
10 | Deaths in August 2002 | +9,643 | 23 | 13 | 10 August 2021 |
References
[edit]- ^ a b The reference density is the number of references / number of entries
- ^ About half of the content consists of citations
- ^ I created a separate web application to generate the result per month/year.
- ^ I had to hack the web application somewhat to get those numbers.
- ^ Coincidentally today (10 Dec. 2024) we picked up our campervan DIY kit at the 'de Vantast' and learned we are the 116th customer :)
- ^ Pageviews Analysis Deaths in January 2000
- ^ "Celebrity Deaths That Changed Music History: Gone Too Soon". Rolling Stone. August 14, 2017. Retrieved 29 November 2024.
- ^ Pageviews Analysis Deaths in November 1991
- ^ Pageviews Analysis Deaths in April 1994
- ^ Pageviews Analysis Deaths in March 1995
- ^ Pageviews Analysis Deaths in September 1996
- ^ Pageviews Analysis Deaths in March 1997
- ^ Pageviews Analysis Deaths in August 1997
- ^ Pageviews Analysis Deaths in August 2001
- ^ Pageviews Analysis Deaths in September 2001
- ^ Pageviews Analysis Deaths in December 2004
- ^ Pageviews Analysis Deaths in January 2003
- ^ Strangely also a big spike in December which I cannot explain.
- ^ Pageviews Analysis Deaths in August 1998
- ^ Pageviews Analysis Deaths in September 1997
- ^ The death and funeral of Diana, Princess of Wales will also have contibutes to this spike.
- ^ The Deaths and (Births) sections have since been removed from the Year-pages so it is fortunate that I didn't put much time into it.
- ^ Extrapolated to an entire year based on the data in columns dpm's Oct and dpm's Nov
- ^ Mill 1 - Deaths in October 1990 - Top Edits - XTools
- ^ Mill 1 - Deaths in November 1990 - Top Edits - XTools
- ^ Mill 1 - 1994 - Top Edits - XTools
- ^ https://xtools.wmcloud.org/topedits/en.wikipedia.org/Mill%201/0/Deaths%20in%201999 Mill 1 - 1994 - Top Edits - XTools
- ^ Page History User:Mill 1/Months/December - XTools
- ^ Mill 1 - Lists of deaths by year - Top Edits - XTools
- ^ Mill 1 - Template:Deaths by month and year - Top Edits - XTools
- ^ Apparently I had my reasons to process April and May 1993 this way
- ^ Existing citations that were replaced with more durable NYTimes references. As you can see in the revision diffs many existing NYTimes refs were also replaced, adding meta data.