Skip to content

Add datasets image to Operate First #31

@pacospace

Description

@pacospace

Is your feature request related to a problem? Please describe.
As Thoth contributor,

I want to learn about Thoth datasets directly from Operate First, spawning an image in JH.

As Thoth Maintainer,
I would like to publish datasets about knowledge collected by Thoth and make it available in Operate First.

Describe the solution you'd like
Move datasets to Ceph Public Bucket on Operate First and modify notebooks to retrieve data from there (remove git LFS dependency). At the end add the new image created to Operate First to be spawned.

Tasks:

  • Learn how to access notebooks in JupyterHub[1] available through Open Data Hub [8] on Operate First [2] spawning Experimental Elyra image and learn about Thoth datasets [3].
  • Verify/Request the credentials of the public bucket available on Operate First.
  • Place datasets on that bucket under thoth/datasets/{dataset-name} using aws/s3 CLI [4] from your local terminal.
  • Modify notebooks to retrieve data from s3 instead of using local datasets.
  • Push changes to thoth-station/datasets using Git extension [5].
  • Create a release using Kebechet [6] and AICoE-CI [7]
  • Add image to JupyterHub list of images, so that users can learn about thoth datasets.
  • Demo

References:

Metadata

Metadata

Assignees

No one assigned

    Labels

    hacktoberfestIssues targeting the hacktoberfest participants.help wantedDenotes an issue that needs help from a contributor. Must meet "help wanted" guidelines.kind/demoThis is an Issue or PR someone want to give a demo or request a demo.kind/featureCategorizes issue or PR as related to a new feature.lifecycle/frozenIndicates that an issue or PR should not be auto-closed due to staleness.priority/important-soonMust be staffed and worked on either currently, or very soon, ideally in time for the next release.sig/user-experienceIssues or PRs related to the User Experience of our Services, Tools, and Libraries.

    Type

    No type

    Projects

    Status

    🆕 New

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions