dolphinscheduler-sdk-python icon indicating copy to clipboard operation
dolphinscheduler-sdk-python copied to clipboard

[impv] Auto create the whole task workspace to make the task the same workspace when running locally and on the DolphinScheduler platform

Open jieguangzhou opened this issue 3 years ago • 0 comments

For example, If I have the workspace like this: https://github.com/pytorch/examples/tree/main/word_language_model image

We can submit the task

# [start workflow]
from pydolphinscheduler.core import Workflow
from pydolphinscheduler.core.resource import Resource, LocalResource
from pydolphinscheduler.tasks import Shell

with Workflow(
    name="auto_resources_example",
) as workflow:
    # [start use_exists_resources]
    task_use_resource = Shell(
        name="example",
        command=f"python main.py --cuda",
        auto_resource=True
    )
    # [end use_exists_resources]

    workflow.run()
# [end workflow]

Then, all the files will be uploaded to the resource center. And there is another best way to track the file version, If this is the git repository, we only record the git repository, version, and diff. Then, we can reproduce the workspace in dolphinscheduler. (Inspired by ClearML) So, After developing a project locally, we can directly submit it to DolphinScheduler and run it directly. Except for some inconsistent environment variables.

jieguangzhou avatar Dec 21 '22 06:12 jieguangzhou