These standards bind every repository a data creator opens as their primary workspace. They are not recommendations. Exceptions require a documented waiver in the PR.

1. Repo artifacts

Every data-product repo ships the following files. Missing any of them blocks PR merge:

2. Extension recommendations

3. Workspace settings

Important

Workspace settings commit to the repo. Never place a secret, a personal auth token, or a user-specific path (with /Users/you/... in it) in .vscode/settings.json. Use ${env:VAR} or ${workspaceFolder} substitutions.

4. Launch configs

5. Tasks

6. Devcontainers

7. Agent configuration

If the team uses AI agents (most do), additional rules apply:

See the MCP servers guide and AI agents for the specifics.

8. Git hygiene

Warning

Committing .venv/ to Git is the classic junior-engineer mistake that bloats the repo forever. Audit the initial commit. If it lands, prune with git filter-repo and force-push before the repo gets cloned widely.

9. Pre-commit hooks

Every repo ships a .pre-commit-config.yaml running, at minimum:

VS Code respects pre-commit; failures surface in the Problems pane and in the Source Control panel.

10. Shared formatters

Mixing formatters within a repo is a lint-noise generator. Pick one per language, pin the version in both pyproject.toml / .sqlfluff and CI.

11. Per-stack rules

Python

dbt

Databricks

Airflow

12. Anti-patterns

13. The review checklist

PRs that touch workspace configuration must satisfy:

14. Governance

Important

Workspace standards are what make onboarding a day, not a week. The discipline of committing the paved path to every repo compounds: new engineers clone, open, and are productive in an hour. Teams that skip this consistently onboard in weeks, not hours.

See also