pending files code generates goofy dataset dimensions
The current pending files code generates lots of
union (file_name located)
.... ( file_name '%' and ...
which are ostensibly no-ops, but which are not neccesarily fast when
converted to a database query. We should for now filter them out with
string replacment, so that we don't bog down SAM.