-
Story
-
Resolution: Done
-
Medium
-
None
-
S19 SIMPLY September 3 - 17
We have three "collection" facets - "Full Collection", "Main Collection", and "Featured Books".
"Full Collection" is everything.
"Featured Books" is everything whose .quality is above the "featurable" level" – by default, 0.65.
"Main Collection" is everything except for open-access books whose .quality is less than 0.3.
The "Main Collection" facet was created for a specific purpose that no longer obtains: filtering out thousands of non-classic works imported through Project Gutenberg. These works were swamping search results with books that held little interest for contemporary readers, so we created a facet just to exclude them.
As far as I know, NYPL is the only library that imported these works into its collection before we replaced Gutenberg's public domain ebooks with CC-licensed editions from Feedbooks. As part of my work on https://jira.nypl.org/browse/SIMPLY-355 I've removed almost all of those low-interest works from NYPL's collection.
I've also discovered that a few open-access works we think are very important, such as Report On The Investigation Into Russian Interference In The 2016 Presidential Election, are not showing up in "Main Collection" simply because we don't have any quality information about them (so their quality is 0). So the fact that "Main Collection" is the default is excluding good books from the discovery process (though not from the search process).
Going forward it's safe to assume that a book is in a library's collection because the library specifically wants it there. So the distinction between "Main Collection" and "Full Collection" can go away. We can implement this by removing the "Main Collection" facet and making "Full Collection" the default. This will simplify the user interface – both in the admin interface and in the mobile client.