Suppose I have a module file like this:
# my_module.py
print(\"hello\")
Then I have a simple script:
# my_script.py
import
I first needed to better understand the import
operation. Fortunately, this is well explained in the importlib documentation and scratching through the source code helped too.
This import
process is actually split in two parts. First, a finder is in charge of parsing the module name (including dot-separated packages) and instantiating an appropriate loader. Indeed, built-in are not imported as local modules for example. Then, the loader is called based on what the finder returned. This loader get the source from a file or from a cache, and executed the code if the module was not previously loaded.
This is very simple. This explains why I actually did not need to use abstract classes from importutil.abc
: I do not want to provide my own import process. Instead, I could create a subclass inherited from one of the classes from importuil.machinery
and override get_source()
from SourceFileLoader
for example. However, this is not the way to go because the loader is instantiated by the finder so I do not have the hand on which class is used. I cannot specify that my subclass should be used.
So, the best solution is to let the finder do its job, and then replace the get_source()
method of whatever Loader has been instantiated.
Unfortunately, by looking trough the code source I saw that the basic Loaders are not using get_source()
(which is only used by the the inspect
module). So my whole idea could not work.
In the end, I guess get_source()
should be called manually, then the returned source should be modified, and finally the code should be executed. This is what Martin Valgur detailed in his answer.
If compatibility with Python 2 is needed, I see no other way than reading the source file:
import imp
import sys
import types
module_name = "my_module"
file, pathname, description = imp.find_module(module_name)
with open(pathname) as f:
source = f.read()
source = source.replace('hello', 'world')
module = types.ModuleType(module_name)
exec(source, module.__dict__)
sys.modules[module_name] = module