What do we need to do to make --print-available-modules and --module-description work?
--module-description switches don't work well with our SRT build.
nova --print-available-modules gives me a large number of errors like this:
In: getModuleType ---- Configuration BEGIN Library specificaton "": does not correspond to any library in LD_LIBRARY_PATH of type "module" ---- Configuration END In: getFilePath ---- Configuration BEGIN Library specificaton "": does not correspond to any library in LD_LIBRARY_PATH of type "module" ---- Configuration END In: getDescription ---- Configuration BEGIN Library specificaton "": does not correspond to any library in LD_LIBRARY_PATH of type "module" ---- Configuration END
before providing the module list. Built in modules have
Source location, but all of our modules have
[ error ] and
[ not found ] for those.
The result for
nova --module-description is consistent, error messages and blank fields. Nor does it show anything in the
Allowed configuration block for a module I converted over to
Interestingly services seem to fare better, they get correct Type and Location fields. I haven't tried converting a service configuration yet.
What are the requirements for these features to work? Do we need to set environment variables or have the module libraries named according to a certain convention?
This is all for art 1.17.03
#2 Updated by Kyle Knoepfel over 3 years ago
- Tracker changed from Support to Bug
- Estimated time set to 2.00 h
Short answer: NOvA uses a library-naming scheme that does not support plugin-ambiguity resolution.
Full answer: Consider a module whose
.so is named:
To enable this module in an
art job, a user has two configuration options:
module_type: "MyModule" # or module_type: "dir1/MyModule"
The first specification is the "short spec", and the second is the "long spec" or full spec.
Using the long spec is essential for the following situation. Consider the source code paths for two modules:
Following the encouraged naming pattern, the
.so's for these modules is:
This way, users can specify
module_type: "dir1/dir2/MyModule" # or module_type: "dir1/dir3/MyModule"
in order to disambiguate the modules. It would not be possible otherwise, and
art, in fact, throws an exception if there is an ambiguity that cannot be resolved. NOvA, however, does not follow the naming pattern as mentioned above -- thus, no long spec is available.
--module-description facilities are called, the long spec is used -- thus the unhelpful printout you are seeing. For services, the long spec is never used by
art, and the
--service-description facilities therefore use the short spec, which is why you saw no problems with the services.
Bottom line: Even though NOvA does not name
.so's in a way that allows for ambiguity resolution, there's no reason why the
--module-description facility should require the long specification. This will be adjusted.
#3 Updated by Christopher Backhouse over 3 years ago
OK, so short answer is we should sit tight? Though I thought there were not going to be any more 1.17 releases? so it could be a while before we're able to use this.
Isn't a short spec equivalent to a long spec in the case that the library happens not to be in any directory? Imagine we just prefixed "a_" to all our library names, regardless of what directory the source came from.
#5 Updated by Christopher Backhouse over 3 years ago
I'd always assumed, without really thinking about it, that the behaviour was "multiple matches = error". But SRT test releases rely on first-match-in-LD_LIBRARY_PATH behaviour and definitely work reliably, so it sounds like the premise of your issue (and all the replies to it) is wrong?
#6 Updated by Kyle Knoepfel over 3 years ago
It is the stated policy that
art continues its feature development on top of a 1.18 branch. However, that does not preclude us from back-porting feature requests to 1.17, which we have already done. Also, I would characterize this issue as a bug, and we typically implement and release bug fixes in patch versions of the same minor series (1.17, in this case).
As for "first-match-in-
LD_LIBRARY_PATH wins" (which is a tangential discussion), that is typically true for any
.so. However, for plugins (i.e. modules, services, sources, etc.),
art explicitly loads them through its plugin manager, which throws an exception if a given library spec can be matched to more than one
.so file. For example, consider the two libraries:
The library spec for both is
dir2 are both on
art (more specifically
cetlib::LibraryManager) will throw an exception.
If you are seeing behavior contrary to this, please file a bug report giving a specific example so we can recreate it. Last I checked, there were no duplicate library specifications in novasoft (based on the list obtained running
nova --print-available-modules. So unless you added a module whose library name clashed with another, where both directories were included on
LD_LIBRARY_PATH, I would not expect you to see any exceptions being thrown.
As Rob already mentioned, there is a feature request for allowing the first
.so name match to be loaded.
#7 Updated by Christopher Backhouse over 3 years ago
OK, I'd be happy to see this fixed in a bugfix 1.17 release.
Whenever a user adds a package to their test release the modules in that package appear twice, as:
There's no error in this case, just first module wins (which is exactly what we want).
Perhaps this is OK because the
dir2 part in your example is identical (ie they're located in the same place relative to the element of
$LD_LIBRARY_PATH they're under).
#8 Updated by Kyle Knoepfel over 3 years ago
- Status changed from Assigned to Resolved
- % Done changed from 0 to 100
The fix has been implemented with commit art:a3946f2. There has been an additional request to include updates to
fhiclcpp in a future 1.17 release. We would likely fold this bug fix in when that happens.
Regarding the first-match-in-
LD_LIBRARY_PATH behavior. You are correct, Chris; I was mistaken. Indeed, the first match for a given
.so wins. The disadvantage to not using the recommend library-naming scheme (as alluded to previously) is that if a user specifies a certain module, they may not realize that they will be getting a module from an
LD_LIBRARY_PATH directory that is sooner in the list than what they intend (if the modules have the same base name). This undesirable situation can be avoided by using the recommend library-naming pattern, as the
LibraryManager is then able to detect an ambiguity, throw an exception and ask the user to resolve the ambiguity via a full specification.