1
1
mirror of https://github.com/github/semantic.git synced 2024-11-22 05:43:42 +03:00
Parsing, analyzing, and comparing source code across many languages
Go to file
rnkaufman ad281b52e3
Merge pull request #718 from sebastiaanspeck/patch-1
Fix links in why-tree-sitter
2024-05-13 12:31:47 -07:00
.github/workflows Explicitly configure semantic-source. 2024-03-13 12:54:35 -04:00
.licenses/semantic/cabal Merge branch 'master' into sinister-orphan-machinations 2019-09-24 17:32:32 -04:00
bin Keep a bin directory around for deployments 2016-02-02 13:50:07 -06:00
build 🔥 semantic-core. 2022-02-07 16:16:31 -05:00
docs Update why-tree-sitter.md 2024-05-13 15:22:21 +02:00
notices Remove docker section from third-party notice 2019-02-21 18:42:50 -05:00
proto Include byteRange in the output 2020-07-01 09:03:41 -07:00
script We don't need to output to some arbitrary file. 2022-04-14 11:43:43 -04:00
semantic ffs 2024-03-20 13:52:54 -04:00
semantic-analysis Spacing. 2024-03-25 09:47:13 -04:00
semantic-ast ffs 2024-03-20 13:52:54 -04:00
semantic-codeql ffs 2024-03-20 13:52:54 -04:00
semantic-go ffs 2024-03-20 13:52:54 -04:00
semantic-java ffs 2024-03-20 13:52:54 -04:00
semantic-json ffs 2024-03-20 13:52:54 -04:00
semantic-parse Bump version bounds to 0.2+. 2022-07-15 09:34:23 -04:00
semantic-php ffs 2024-03-20 13:52:54 -04:00
semantic-proto Disable the kind signature warnings. 2022-04-13 10:23:30 -04:00
semantic-python ffs 2024-03-20 13:52:54 -04:00
semantic-ruby ffs 2024-03-20 13:52:54 -04:00
semantic-rust ffs 2024-03-20 13:52:54 -04:00
semantic-scope-graph Whoops. 2022-07-15 15:36:12 -04:00
semantic-source Extra-doc-files. 2024-03-04 12:27:43 -05:00
semantic-tags Bump version bounds to 0.2+. 2022-07-15 09:34:23 -04:00
semantic-tsx ffs 2024-03-20 13:52:54 -04:00
semantic-typescript ffs 2024-03-20 13:52:54 -04:00
.bazelignore 🔥 semantic-core in .bazelignore. 2022-02-08 09:09:13 -05:00
.bazelrc Store bazel-repo in the cache as well. 2020-07-09 10:34:17 -04:00
.dockerignore Exclude a bunch of one-off files. 2019-10-30 09:21:24 -04:00
.ghci.repl Turn off some warnings in 8.10. 2020-07-14 15:00:43 -04:00
.ghci.sample Correct the package id for the sample .ghci. 2019-12-20 11:25:32 -05:00
.gitattributes The fixtures are binary files, don't diff them. 2019-06-20 01:37:58 -04:00
.gitignore start documenting 2020-06-27 11:31:23 -04:00
.gitmodules Remove vendored tree-sitter-ql submodule 2020-03-04 09:03:25 -08:00
.hlint.yaml move hlint to .hlint.yaml 2020-06-10 13:11:07 -04:00
.hspec Add .hspec for depths default, make sure hspec version supports 2017-02-17 08:27:51 -08:00
.stylish-haskell.yaml Change the stylish-haskell config to format language pragmas vertically and not align their closing tags. 2019-12-19 15:12:40 -05:00
BUILD.bazel move this to the root build file 2020-07-01 10:40:05 -04:00
cabal.project semantic-source is nonlocal. 2022-07-14 10:20:54 -04:00
cabal.project.ci semantic-source is nonlocal. 2022-07-14 10:20:54 -04:00
CODE_OF_CONDUCT.md Add code of conduct 2019-05-31 08:37:44 -04:00
CODEOWNERS Add CODEOWNERS file 2020-10-05 17:24:11 -04:00
CONTRIBUTING.md v2-. 2019-10-30 08:42:28 -04:00
Dockerfile Merge branch 'master' into symbol-node-type 2020-06-02 14:06:06 -07:00
HACKING.md Draft a HACKING.md file outlining best practices for development. 2020-01-31 15:46:05 -05:00
hie.yaml Update hie.yaml. 2022-04-14 11:41:12 -04:00
README.md Update README.md: new link for Data types à la carte 2023-08-05 17:08:38 -04:00
stack-snapshot.yaml 🔥 fused-syntax 2021-08-19 09:51:40 -04:00
stackage_snapshot.json Ba-bump. 2021-08-19 12:51:03 -04:00
WORKSPACE 🔥 fused-syntax 2021-08-19 09:51:40 -04:00

Semantic

semantic is a Haskell library and command line tool for parsing, analyzing, and comparing source code.

In a hurry? Check out our documentation of example uses for the semantic command line tool.

Table of Contents
Usage
Language support
Development
Technology and architecture
Licensing

Usage

Run semantic --help for complete list of up-to-date options.

Parse

Usage: semantic parse [--sexpression | (--json-symbols|--symbols) |
                        --proto-symbols | --show | --quiet] [FILES...]
  Generate parse trees for path(s)

Available options:
  --sexpression            Output s-expression parse trees (default)
  --json-symbols,--symbols Output JSON symbol list
  --proto-symbols          Output protobufs symbol list
  --show                   Output using the Show instance (debug only, format
                           subject to change without notice)
  --quiet                  Don't produce output, but show timing stats
  -h,--help                Show this help text

Language support

Language Parse AST Symbols† Stack graphs
Ruby
JavaScript
TypeScript 🚧
Python 🚧
Go
PHP
Java 🚧
JSON
JSX
TSX
CodeQL
Haskell 🚧 🚧

† Used for code navigation on github.com.

  • — Supported
  • 🔶 — Partial support
  • 🚧 — Under development
  • - N/A

Development

semantic requires at least GHC 8.10.1 and Cabal 3.0. We strongly recommend using ghcup to sandbox GHC versions, as GHC packages installed through your OS's package manager may not install statically-linked versions of the GHC boot libraries. semantic currently builds only on Unix systems; users of other operating systems may wish to use the Docker images.

We use cabal's Nix-style local builds for development. To get started quickly:

git clone git@github.com:github/semantic.git
cd semantic
script/bootstrap
cabal v2-build all
cabal v2-run semantic:test
cabal v2-run semantic:semantic -- --help

You can also use the Bazel build system for development. To learn more about Bazel and why it might give you a better development experience, check the build documentation.

git clone git@github.com:github/semantic.git
cd semantic
script/bootstrap-bazel
bazel build //...

stack as a build tool is not officially supported; there is unofficial stack.yaml support available, though we cannot make guarantees as to its stability.

Technology and architecture

Architecturally, semantic:

  1. Generates per-language Haskell syntax types based on tree-sitter grammar definitions.
  2. Reads blobs from a filesystem or provided via a protocol buffer request.
  3. Returns blobs or performs analysis.
  4. Renders output in one of many supported formats.

Throughout its lifecycle, semantic has leveraged a number of interesting algorithms and techniques, including:

Contributions

Contributions are welcome! Please see our contribution guidelines and our code of conduct for details on how to participate in our community.

Licensing

Semantic is licensed under the MIT license.