我在从 dagster 代码(设置,而不是管道)加载文件时遇到问题。假设我有以下项目结构:
pipelines
-app/
--environments
----schedules.yaml
--repository.py
--repository.yaml
当我在项目文件夹($cd project && dagit -y app/repository.yaml
)中运行 dagit 时,该文件夹成为工作目录,并且在其中repository.py
我可以加载一个知道根目录的文件project
# repository.py
with open('app/evironments/schedules.yaml', 'r'):
# do something with the file
但是,如果我设置了计划,则项目中的管道不会运行。检查 cron 日志似乎该open
行引发了一个未找到文件的异常。我想知道是否会发生这种情况,因为执行 cron 时工作目录不同。
对于上下文,我正在为每个管道加载一个带有 cron_schedules 参数的配置文件。另外,在我的例子中,这是堆栈跟踪的尾部:
File "/home/user/.local/share/virtualenvs/pipelines-mfP13m0c/lib/python3.8/site-packages/dagster/core/definitions/handle.py", line 190, in from_yaml
return LoaderEntrypoint.from_file_target(
File "/home/user/.local/share/virtualenvs/pipelines-mfP13m0c/lib/python3.8/site-packages/dagster/core/definitions/handle.py", line 161, in from_file_target
module = import_module_from_path(module_name, os.path.abspath(python_file))
File "/home/user/.local/share/virtualenvs/pipelines-mfP13m0c/lib/python3.8/site-packages/dagster/seven/__init__.py", line 75, in import_module_from_path
spec.loader.exec_module(module)
File "<frozen importlib._bootstrap_external>", line 783, in exec_module
File "<frozen importlib._bootstrap>", line 219, in _call_with_frames_removed
File "/home/user/pipelines/app/repository.py", line 28, in <module>
schedule_builder = ScheduleBuilder(settings.CRON_PRESET, settings.ENV_DICT)
File "/home/user/pipelines/app/schedules.py", line 12, in __init__
self.cron_schedules = self._load_schedules_yaml()
File "/home/user/pipelines/app/schedules.py", line 16, in _load_schedules_yaml
with open(path) as f:
FileNotFoundError: [Errno 2] No such file or directory: 'app/environments/schedules.yaml'