User:Najawin/Sandbox 10: Difference between revisions
Line 78: | Line 78: | ||
:::Take the leaves from this category + the leaves from the subcategories of [[:Category:Video games by Doctor]] | :::Take the leaves from this category + the leaves from the subcategories of [[:Category:Video games by Doctor]] | ||
::[[:Category:Missing NOTVALID publisher summaries]] | ::[[:Category:Missing NOTVALID publisher summaries]] | ||
====Transversal of tree==== | ====Transversal of tree==== |
Revision as of 02:08, 12 October 2024
(Note to self, as I return to this. Checked my old work on Homevid + WC, it's right, made sure that Game should be smooth sailing. Now come the tedious ones.)
Wow, I should have gotten to this project sooner, but the fork gives me reason to return to it.
Forum:Temporary forums/Categories for stories without summaries established that this category tree should exist, and that it should look a specific way. However, generating the specific categories takes a bit of work because it requires figuring out where all our, say, PROSE works are, and making sure that we can figure out a relatively simple definition of a PROSE work using our category system. (Obviously we could start from the root Category:Prose sources and work down to the leaves, but this is a monumental amount of work for the bot run and isn't guaranteed to be error free.)
Category tree
- Category:Missing plot summaries
- Clearly to be in this category the bot will need to detect that the "plot" section on whatever page is being tested says some variation of "To be added" or similar. (Perhaps if there both exists a plot section, and it is smaller than a certain length? We'd have to be careful about Vrs (short story) and a few others if we do that, but it's plausible.)
- Category:Missing PROSE plot summaries
- Category:Missing AUDIO plot summaries
- Generate subcategories based on "Category:Audio stories by company", preserving subcategories within this structure. (So, eg, Category:Big Finish audio stories plot summaries, followed by Category:BFBS audio stories plot summaries)
- Category:Missing COMIC plot summaries
- Generate subcategories based on "Category:Comic stories by publication", preserving subcategories within this structure.
- Category:Missing HOMEVID plot summaries
- No subcategories, generated entirely from the HOMEVID cat defined below
- Category:Missing WC plot summaries
- Category:Missing Arcbeatle WC plot summaries
- Comes from leaves in Category:Arcbeatle webcasts INTERSECT WC
- Category:Missing BBCi WC plot summaries
- Comes from leaves in Category:BBCi, minus BBC Red Button, and then the leaves of Category:TARDIS Cam stories) INTERSECT WC
- Category:Missing BBV WC plot summaries
- Comes from leaves inCategory:BBV webcasts INTERSECT WC
- Category:Missing Big Finish WC plot summaries
- Comes from leaves in Category:Big Finish webcasts INTERSECT WC
- Category:Missing Doom's Day WC plot summaries
- Comes from leaves in Category:Doom's Day webcasts INTERSECT WC
- Category:Missing Lego Dimensions WC plot summaries
- Comes from leaves in Category:LEGO Dimensions trailers INTERSECT WC
- Category:Missing Monster Files plot summaries
- Comes from leaves in Category:Monster Files stories INTERSECT WC
- Category:Missing P.R.O.B.E. plot summaries
- Comes from leaves in Category:P.R.O.B.E. webcasts INTERSECT WC
- Category:Missing SJA WC plot summaries
- Comes from Category:SJA webcasts's leaves, not the subcategory. INTERSECT WC
- Category:Missing Mr Smith's data updates WC plot summaries
- Comes from Category:Mr Smith's data updates's leaves. INTERSECT WC
- Category:Missing TLV WC plot summaries
- Comes from Category:TLV webcasts's leaves, not the subcategory. INTERSECT WC
- Category:Missing UNIT Field Log WC plot summaries
- Comes from Category:UNIT Field Log webcasts's leaves. INTERSECT WC
- Category:Missing Arcbeatle WC plot summaries
- Category:Missing GAME plot summaries
- Take the leaves from this category + the leaves from the subcategories of Category:Video games by Doctor
- Category:Missing NOTVALID plot summaries
- Category:Missing publisher summaries
- Same as before, but for publisher summary.
- Category:Missing PROSE publisher summaries
- Category:Missing AUDIO publisher summaries
- Generate subcategories based on "Category:Audio stories by company", preserving subcategories within this structure. (So, eg, Category:Big Finish audio stories plot summaries, followed by Category:BFBS audio stories plot summaries)
- Category:Missing COMIC publisher summaries
- Generate subcategories based on "Category:Comic stories by publication", preserving subcategories within this structure.
- Category:Missing HOMEVID publisher summaries
- No subcategories, generated entirely from the HOMEVID cat defined below
- Category:Missing WC publisher summaries
- Category:Missing Arcbeatle WC publisher summaries
- Comes from leaves in Category:Arcbeatle webcasts INTERSECT WC
- Category:Missing BBCi WC publisher summaries
- Comes from leaves in Category:BBCi, minus BBC Red Button, and then the leaves of Category:TARDIS Cam stories) INTERSECT WC
- Category:Missing BBV WC publisher summaries
- Comes from leaves in Category:BBV webcasts INTERSECT WC
- Category:Missing Big Finish WC publisher summaries
- Comes from leaves in Category:Big Finish webcasts INTERSECT WC
- Category:Missing Doom's Day WC publisher summaries
- Comes from leaves in Category:Doom's Day webcasts INTERSECT WC
- Category:Missing Lego Dimensions WC publisher summaries
- Comes from leaves in Category:LEGO Dimensions trailers INTERSECT WC
- Category:Missing Monster Files publisher summaries
- Comes from leaves in Category:Monster Files stories INTERSECT WC
- Category:Missing P.R.O.B.E. publisher summaries
- Comes from leaves in Category:P.R.O.B.E. webcasts INTERSECT WC
- Category:Missing SJA WC publisher summaries
- Comes from Category:SJA webcasts's leaves, not the subcategory. INTERSECT WC
- Category:Missing Mr Smith's data updates WC publisher summaries
- Comes from Category:Mr Smith's data updates's leaves. INTERSECT WC
- Category:Missing TLV WC publisher summaries
- Comes from Category:TLV webcasts's leaves, not the subcategory. INTERSECT WC
- Category:Missing UNIT Field Log WC publisher summaries
- Comes from Category:UNIT Field Log webcasts's leaves. INTERSECT WC
- Category:Missing Arcbeatle WC publisher summaries
- Category:Missing GAME publisher summaries
- Take the leaves from this category + the leaves from the subcategories of Category:Video games by Doctor
- Category:Missing NOTVALID publisher summaries
Transversal of tree
(For reference, Category:Sources) Not sure if this will help the bot, but I'm specifying this just in case, our category tree is rather complicated, and it will also allow me to simplify my notation above.
NOTVALID = Floor 500->Real world->Sources->Non-DWU sources->Category:Non-DWU sources by medium (UNION of all subcategories)
Novel = Floor 500->Real world->Sources->PROSE stories->Novels->Category:Novels by year of release (UNION of all subcategories) SETMINUS NOTVALID
Feature = Floor 500->Real world->Sources->PROSE stories->Features->Category:Features by year of release (UNION of all subcategories) SETMINUS NOTVALID
Short Story = Floor 500->Real world->Sources->PROSE stories->Short stories->Category:Short stories by year of release (UNION of all subcategories) SETMINUS NOTVALID
PROSE = Short Story UNION Novel UNION Feature
WC = Floor 500->Real world->Sources->Webcasts->Category:Webcasts by year of release (UNION Of all subcategories)
Film = Floor 500->Real world->Sources->Category:Theatrical films (UNION Of all subcategories) SETMINUS NOTVALID
Tv = Floor 500->Real world->Sources->Television stories->Category:Television stories by airdate (UNION of all subcategories) SETMINUS NOTVALID
TV=Film UNION Tv
GAME = Floor 500->Real world->Sources->Real world video games->Category:Video games by year of release (UNION of all subcategories) SETMINUS NOTVALID (nb: As far as I can tell the other groups can't consistently have plot summaries, they're just not the sorts of things where that makes sense, and almost always have the publisher summaries listed already. Personally I don't see the need for subcategories here for mediums, since the others should be effectively rounding errors. Also, in general I'm skeptical about doing this one by bot run, a lot of these aren't written in such a way that plot summary / publisher's summary are present. Might be better to completely ignore the GAME sources in the bot run and just add them by hand.)
HOMEVID = Floor 500->Real world->Sources->Home videos->Category:Home videos by year of release (UNION of all subcategories) SETMINUS NOTVALID
COMIC = Floor 500->Real world->Sources->Comic stories->Category:Comic stories by year of release (UNION of all subcategories) SETMINUS NOTVALID
AUDIO = Floor 500->Real world->Sources->Audio stories->Category:Audio stories by year of release (UNION of all subcategories) SETMINUS NOTVALID