github.com/pachyderm/pachyderm@v1.13.4/doc/docs/1.10.x/concepts/data-concepts/repo.md (about)

     1  # Repository
     2  
     3  A Pachyderm repository is a location where you store your data inside
     4  Pachyderm. A Pachyderm repository is a top-level data object that contains
     5  files and folders. Similar to Git, a Pachyderm repository tracks all
     6  changes to the data and creates a history of data modifications that you
     7  can access and review. You can store any type of file in a Pachyderm repo,
     8  including binary and plain text files.
     9  
    10  Unlike a Git repository that stores history in a `.git` file in your copy
    11  of a Git repo, Pachyderm stores the history of your commits in a centralized
    12  location. Because of that, you do not run into
    13  merge conflicts as you often do with Git commits when you try to merge
    14  your `.git` history with the master copy of the repo. With large datatsets
    15  resolving a merge conflict might not be possible.
    16  
    17  A Pachyderm repository is the first entity that you configure when you want
    18  to add data to Pachyderm. You can create a repository with the `pachctl create repo`
    19  command, or by using the Pachyderm UI. After creating the repository, you can
    20  add your data by using the `pachctl put file` command.
    21  
    22  A Pachyderm repo name can include alphanumeric characters, dashes, and underscores,
    23  and should be no more than 63 characters long.
    24  
    25  The following types of repositories exist in Pachyderm:
    26  
    27  Input repositories
    28  :   Users or external applications outside of Pachyderm can add data to
    29      the input repositories for further processing.
    30  
    31  Output repositories
    32  :   Pachyderm automatically creates output repositories
    33      pipelines write results of computations into these repositories.
    34  
    35  You can view the list of repositories in your Pachyderm cluster
    36  by running the `pachctl list repo` command.
    37  
    38  !!! example
    39      ```shell
    40      pachctl list repo
    41      ```
    42  
    43      **System Response:**
    44  
    45      ```shell
    46      NAME     CREATED     SIZE (MASTER)
    47      raw_data 6 hours ago 0B
    48      ```
    49  
    50  The `pachctl inspect repo` command provides a more detailed overview
    51  of a specified repository.
    52  
    53  !!! example
    54      ```shell
    55      pachctl inspect repo raw_data
    56      ```
    57  
    58      **System Response:**
    59  
    60      ```shell
    61      Name: raw_data
    62      Description: A raw data repository
    63      Created: 6 hours ago
    64      Size of HEAD on master: 5.121MiB
    65      ```
    66  
    67  If you need to delete a repository, you can run the
    68  `pachctl delete repo` command. This command deletes all
    69  data and the information about the specified
    70  repository, such as commit history. The delete
    71  operation is irreversible and results in a
    72  complete cleanup of your Pachyderm cluster.
    73  If you run the delete command with the `--all` flag, all
    74  repositories will be deleted.
    75  
    76  !!! note "See Also:"
    77      [Pipeline](../pipeline-concepts/pipeline/index.md)