User:Najawin/Sandbox 10

From Tardis Wiki, the free Doctor Who reference

Wow, I should have gotten to this project sooner, but the fork gives me reason to return to it.

Forum:Temporary forums/Categories for stories without summaries established that this category tree should exist, and that it should look a specific way. However, generating the specific categories takes a bit of work because it requires figuring out where all our, say, PROSE works are, and making sure that we can figure out a relatively simple definition of a PROSE work using our category system. (Obviously we could start from the root Category:Prose sources and work down to the leaves, but this is a monumental amount of work for the bot run and isn't guaranteed to be error free.)

Category tree

Category:Missing summaries

Category:Missing plot summaries
Clearly to be in this category the bot will need to detect that the "plot" section on whatever page is being tested says some variation of "To be added" or similar. (Perhaps if there both exists a plot section, and it is smaller than a certain length? We'd have to be careful about Vrs (short story) and a few others if we do that, but it's plausible.)
Category:Missing PROSE plot summaries
Category:Missing AUDIO plot summaries
Generate subcategories based on "Category:Audio stories by company", preserving subcategories within this structure. (So, eg, Category:Big Finish audio stories plot summaries, followed by Category:BFBS audio stories plot summaries)
Category:Missing COMIC plot summaries
Generate subcategories based on "Category:Comic stories by company", preserving subcategories within this structure.
Category:Missing HOMEVID plot summaries
No subcategories, generated entirely from the HOMEVID cat defined below
Category:Missing WC plot summaries
Category:Missing GAME plot summaries
Category:Missing NOTVALID plot summaries
Category:Missing publisher summaries
Same as before, but for publisher summary.
Category:Missing PROSE publisher summaries
Category:Missing AUDIO publisher summaries
Generate subcategories based on "Category:Audio stories by company", preserving subcategories within this structure. (So, eg, Category:Big Finish audio stories plot summaries, followed by Category:BFBS audio stories plot summaries)
Category:Missing COMIC publisher summaries
Generate subcategories based on "Category:Comic stories by company", preserving subcategories within this structure.
Category:Missing HOMEVID publisher summaries
No subcategories, generated entirely from the HOMEVID cat defined below
Category:Missing WC publisher summaries
Category:Missing GAME publisher summaries
Category:Missing NOTVALID publisher summaries
Category:Missing feature summaries
More difficult, probably has to be done by hand, since we don't have a full handle on standards for "feature" articles across the wiki yet.

Transversal of tree

Not sure if this will help the bot, but I'm specifying this just in case, our category tree is rather complicated, and it will also allow me to simplify my notation above.

NOTVALID = Floor 500->Real world->Sources->Non-DWU sources->Non-DWU sources by medium‎ (UNION of all subcategories)

Novel = Floor 500->Real world->Sources->PROSE stories->Novels->Novels by year of release (UNION of all subcategories) SETMINUS NOTVALID

Feature = Floor 500->Real world->Sources->Features->Features by year of release (UNION of all subcategories) SETMINUS NOTVALID

Short Story = Floor 500->Real world->Sources->PROSE stories->Short stories->Short stories by year of release (UNION of all subcategories) SETMINUS NOTVALID

PROSE = Short Story UNION Novel UNION Feature

WC = Floor 500->Real world->Sources->Webcasts (UNION Of all subcategories) SETMINUS NOTVALID ------- Need to redo this, there are some problems in categorization here.

Film = Floor 500->Real world->Sources->Theatrical films (UNION Of all subcategories) SETMINUS NOTVALID

Tv = Floor 500->Real world->Sources->Television stories->Television stories by airdate (UNION of all subcategories) SETMINUS NOTVALID

TV=Film UNION Tv

GAME = Floor 500->Real world->Sources->Real world video games->Video games by year of release (UNION of all subcategories) SETMINUS NOTVALID

HOMEVID = Floor 500->Real world->Sources->Home videos->Home videos by year of release (UNION of all subcategories) SETMINUS NOTVALID

COMIC = Floor 500->Real world->Sources->Comic stories->Comic stories by year of release (UNION of all subcategories) SETMINUS NOTVALID

AUDIO = Floor 500->Real world->Sources->Audio stories->Audio stories by year of release (UNION of all subcategories) SETMINUS NOTVALID