1
1
mirror of https://github.com/github/semantic.git synced 2024-11-24 08:54:07 +03:00
Parsing, analyzing, and comparing source code across many languages
Go to file
2020-06-25 08:56:09 -04:00
.github/workflows build the lib specifically 2020-06-24 20:37:38 -04:00
.licenses/semantic/cabal Merge branch 'master' into sinister-orphan-machinations 2019-09-24 17:32:32 -04:00
bin Keep a bin directory around for deployments 2016-02-02 13:50:07 -06:00
build save a ton of code 2020-06-25 08:56:09 -04:00
docs Update docs to removing graphing and diffing 2020-06-11 12:55:28 -07:00
notices Remove docker section from third-party notice 2019-02-21 18:42:50 -05:00
proto Remove a la carte diffing and graphing 2020-06-11 13:53:12 -07:00
script This particular specification wasn't necessary. 2020-06-23 15:22:30 -04:00
semantic save a ton of code 2020-06-25 08:56:09 -04:00
semantic-analysis save a ton of code 2020-06-25 08:56:09 -04:00
semantic-ast save a ton of code 2020-06-25 08:56:09 -04:00
semantic-codeql save a ton of code 2020-06-25 08:56:09 -04:00
semantic-core save a ton of code 2020-06-25 08:56:09 -04:00
semantic-go save a ton of code 2020-06-25 08:56:09 -04:00
semantic-java save a ton of code 2020-06-25 08:56:09 -04:00
semantic-json save a ton of code 2020-06-25 08:56:09 -04:00
semantic-parse save a ton of code 2020-06-25 08:56:09 -04:00
semantic-php save a ton of code 2020-06-25 08:56:09 -04:00
semantic-proto save a ton of code 2020-06-25 08:56:09 -04:00
semantic-python save a ton of code 2020-06-25 08:56:09 -04:00
semantic-ruby save a ton of code 2020-06-25 08:56:09 -04:00
semantic-rust Merge branch 'master' into codegen-cleanup 2020-06-08 13:42:45 -04:00
semantic-scope-graph save a ton of code 2020-06-25 08:56:09 -04:00
semantic-source save a ton of code 2020-06-25 08:56:09 -04:00
semantic-tags save a ton of code 2020-06-25 08:56:09 -04:00
semantic-tsx save a ton of code 2020-06-25 08:56:09 -04:00
semantic-typescript save a ton of code 2020-06-25 08:56:09 -04:00
vendor add dependencies on node-types files 2020-06-10 12:08:45 -04:00
.bazelignore Add -threaded everywhere 2020-06-23 14:54:32 -04:00
.bazelrc remove deprecated src_strip_prefix thing 2020-06-23 11:29:02 -04:00
.dockerignore Exclude a bunch of one-off files. 2019-10-30 09:21:24 -04:00
.ghci.repl Give script/repl some better defaults. 2020-04-08 15:44:03 -04:00
.ghci.sample Correct the package id for the sample .ghci. 2019-12-20 11:25:32 -05:00
.gitattributes The fixtures are binary files, don't diff them. 2019-06-20 01:37:58 -04:00
.gitignore first attempt at doing bazel stuff - this is problematic 2020-05-19 12:16:16 -04:00
.gitmodules Remove vendored tree-sitter-ql submodule 2020-03-04 09:03:25 -08:00
.hie-bios.sh The tooling appears to work, which is cool. 2020-06-23 17:00:13 -04:00
.hlint.yaml move hlint to .hlint.yaml 2020-06-10 13:11:07 -04:00
.hspec Add .hspec for depths default, make sure hspec version supports 2017-02-17 08:27:51 -08:00
.stylish-haskell.yaml Change the stylish-haskell config to format language pragmas vertically and not align their closing tags. 2019-12-19 15:12:40 -05:00
BUILD.bazel save a ton of code 2020-06-25 08:56:09 -04:00
cabal.project Merge remote-tracking branch 'origin/master' into bazel-experiments 2020-06-09 14:35:24 -04:00
cabal.project.ci half-hearted attempt to unbreak CI 2020-06-24 20:34:37 -04:00
CODE_OF_CONDUCT.md Add code of conduct 2019-05-31 08:37:44 -04:00
CONTRIBUTING.md v2-. 2019-10-30 08:42:28 -04:00
Dockerfile Merge branch 'master' into symbol-node-type 2020-06-02 14:06:06 -07:00
HACKING.md Draft a HACKING.md file outlining best practices for development. 2020-01-31 15:46:05 -05:00
hie.yaml The tooling appears to work, which is cool. 2020-06-23 17:00:13 -04:00
README.md Update docs to removing graphing and diffing 2020-06-11 12:55:28 -07:00
stack-snapshot.yaml semantic:lib now builds 2020-06-09 15:20:08 -04:00
WORKSPACE save a ton of code 2020-06-25 08:56:09 -04:00

Semantic

semantic is a Haskell library and command line tool for parsing, analyzing, and comparing source code.

In a hurry? Check out our documentation of example uses for the semantic command line tool.

Table of Contents
Usage
Language support
Development
Technology and architecture
Licensing

Usage

Run semantic --help for complete list of up-to-date options.

Parse

Usage: semantic parse [--sexpression | (--json-symbols|--symbols) |
                        --proto-symbols | --show | --quiet] [FILES...]
  Generate parse trees for path(s)

Available options:
  --sexpression            Output s-expression parse trees (default)
  --json-symbols,--symbols Output JSON symbol list
  --proto-symbols          Output protobufs symbol list
  --show                   Output using the Show instance (debug only, format
                           subject to change without notice)
  --quiet                  Don't produce output, but show timing stats
  -h,--help                Show this help text

Language support

Language Parse AST Symbols† Stack graphs
Ruby
JavaScript
TypeScript 🚧
Python 🚧
Go
PHP
Java 🚧
JSON
JSX
TSX
CodeQL
Haskell 🚧 🚧

† Used for code navigation on github.com.

  • — Supported
  • 🔶 — Partial support
  • 🚧 — Under development
  • - N/A

Development

semantic requires at least GHC 8.8.1 and Cabal 3.0. We strongly recommend using ghcup to sandbox GHC versions, as GHC packages installed through your OS's package manager may not install statically-linked versions of the GHC boot libraries. semantic currently builds only on Unix systems; users of other operating systems may wish to use the Docker images.

We use cabal's Nix-style local builds for development. To get started quickly:

git clone git@github.com:github/semantic.git
cd semantic
script/bootstrap
cabal v2-build
cabal v2-test
cabal v2-run semantic -- --help

stack as a build tool is not officially supported; there is unofficial stack.yaml support available, though we cannot make guarantees as to its stability.

Technology and architecture

Architecturally, semantic:

  1. Reads blobs.
  2. Generates parse trees for those blobs with tree-sitter (an incremental parsing system for programming tools).
  3. Assigns those trees into a generalized representation of syntax.
  4. Performs analysis, computes diffs, or just returns parse trees.
  5. Renders output in one of many supported formats.

Semantic leverages a number of interesting algorithms and techniques:

Contributions

Contributions are welcome! Please see our contribution guidelines and our code of conduct for details on how to participate in our community.

Licensing

Semantic is licensed under the MIT license.