1
1
mirror of https://github.com/github/semantic.git synced 2024-12-02 11:23:05 +03:00
Parsing, analyzing, and comparing source code across many languages
Go to file
2020-05-19 12:52:01 -04:00
.github/workflows Merge branch 'master' into precise-ql-tagging 2020-03-27 11:52:48 -07:00
.licenses/semantic/cabal Merge branch 'master' into sinister-orphan-machinations 2019-09-24 17:32:32 -04:00
bin Keep a bin directory around for deployments 2016-02-02 13:50:07 -06:00
docs make all github links permalinks 2020-04-27 12:34:03 -04:00
notices Remove docker section from third-party notice 2019-02-21 18:42:50 -05:00
proto Remove diff toc from proto datatypes 2020-04-03 11:25:11 -07:00
script Merge branch 'master' into precise-ql-tagging 2020-03-31 09:41:51 -07:00
semantic first attempt at doing bazel stuff - this is problematic 2020-05-19 12:16:16 -04:00
semantic-analysis source, analysis, and ast work 2020-05-19 12:52:01 -04:00
semantic-ast source, analysis, and ast work 2020-05-19 12:52:01 -04:00
semantic-codeql Fix Loc for CodeQL tags 2020-04-27 13:19:45 -07:00
semantic-core Bump to semantic-source 0.1.0 2020-03-23 13:40:50 -07:00
semantic-go Fix Loc for Go tags 2020-04-27 13:19:39 -07:00
semantic-java Fix Loc for Java tags 2020-04-27 13:23:09 -07:00
semantic-json update main-is for test suites in language cabal files 2020-04-20 11:53:47 -04:00
semantic-parse Bump to semantic-source 0.1.0 2020-03-23 13:40:50 -07:00
semantic-php Fix Loc for PHP tags 2020-04-27 13:19:33 -07:00
semantic-python Fix annoying warning re. inter-project dependencies. 2020-04-30 17:05:13 -04:00
semantic-ruby Fix Loc for Ruby tags 2020-04-27 13:19:19 -07:00
semantic-scope-graph Remove README 2020-04-01 10:53:47 -07:00
semantic-source source, analysis, and ast work 2020-05-19 12:52:01 -04:00
semantic-tags s/Implements/Implementation in this comment. 2020-04-23 10:55:19 -04:00
semantic-tsx Whitespace fixes 2020-04-27 13:19:50 -07:00
semantic-typescript Whitespace fixes 2020-04-27 13:19:50 -07:00
.bazelrc first attempt at doing bazel stuff - this is problematic 2020-05-19 12:16:16 -04:00
.dockerignore Exclude a bunch of one-off files. 2019-10-30 09:21:24 -04:00
.ghci.repl Give script/repl some better defaults. 2020-04-08 15:44:03 -04:00
.ghci.sample Correct the package id for the sample .ghci. 2019-12-20 11:25:32 -05:00
.gitattributes The fixtures are binary files, don't diff them. 2019-06-20 01:37:58 -04:00
.gitignore first attempt at doing bazel stuff - this is problematic 2020-05-19 12:16:16 -04:00
.gitmodules Remove vendored tree-sitter-ql submodule 2020-03-04 09:03:25 -08:00
.hlint.yaml Update .hlint.yaml 2020-02-13 10:28:57 -05:00
.hspec Add .hspec for depths default, make sure hspec version supports 2017-02-17 08:27:51 -08:00
.stylish-haskell.yaml Change the stylish-haskell config to format language pragmas vertically and not align their closing tags. 2019-12-19 15:12:40 -05:00
BUILD.bazel first attempt at doing bazel stuff - this is problematic 2020-05-19 12:16:16 -04:00
cabal.project first attempt at doing bazel stuff - this is problematic 2020-05-19 12:16:16 -04:00
cabal.project.ci Merge pull request #507 from github/precise-ql-tagging 2020-04-01 11:55:46 -07:00
CODE_OF_CONDUCT.md Add code of conduct 2019-05-31 08:37:44 -04:00
CONTRIBUTING.md v2-. 2019-10-30 08:42:28 -04:00
Dockerfile also need to fix this 2020-04-14 09:17:15 +08:00
generate_hie_yaml.rb first attempt at doing bazel stuff - this is problematic 2020-05-19 12:16:16 -04:00
HACKING.md Draft a HACKING.md file outlining best practices for development. 2020-01-31 15:46:05 -05:00
hie.yaml first attempt at doing bazel stuff - this is problematic 2020-05-19 12:16:16 -04:00
README.md Remove ToC mentions from README. 2020-04-08 17:47:25 -04:00
stack-snapshot.yaml source, analysis, and ast work 2020-05-19 12:52:01 -04:00
WORKSPACE source, analysis, and ast work 2020-05-19 12:52:01 -04:00

Semantic

semantic is a Haskell library and command line tool for parsing, analyzing, and comparing source code.

In a hurry? Check out our documentation of example uses for the semantic command line tool.

Table of Contents
Usage
Language support
Development
Technology and architecture
Licensing

Usage

Run semantic --help for complete list of up-to-date options.

Parse

Usage: semantic parse ([--sexpression] | [--json] | [--json-graph] | [--symbols]
                      | [--dot] | [--show] | [--quiet]) [FILES...]
  Generate parse trees for path(s)

Available options:
  --sexpression            Output s-expression parse trees (default)
  --json                   Output JSON parse trees
  --json-graph             Output JSON adjacency list
  --symbols                Output JSON symbol list
  --dot                    Output DOT graph parse trees
  --show                   Output using the Show instance (debug only, format
                           subject to change without notice)
  --quiet                  Don't produce output, but show timing stats

Diff

Usage: semantic diff ([--sexpression] | [--json] | [--json-graph] |
                     [--dot] | [--show]) [FILE_A] [FILE_B]
  Compute changes between paths

Available options:
  --sexpression            Output s-expression diff tree (default)
  --json                   Output JSON diff trees
  --json-graph             Output JSON diff trees
  --dot                    Output the diff as a DOT graph
  --show                   Output using the Show instance (debug only, format
                           subject to change without notice)

Graph

Usage: semantic graph ([--imports] | [--calls]) [--packages] ([--dot] | [--json]
                      | [--show]) ([--root DIR] [--exclude-dir DIR]
                      DIR:LANGUAGE | FILE | --language ARG (FILES... | --stdin))
  Compute a graph for a directory or from a top-level entry point module

Available options:
  --imports                Compute an import graph (default)
  --calls                  Compute a call graph
  --packages               Include a vertex for the package, with edges from it
                           to each module
  --dot                    Output in DOT graph format (default)
  --json                   Output JSON graph
  --show                   Output using the Show instance (debug only, format
                           subject to change without notice)
  --root DIR               Root directory of project. Optional, defaults to
                           entry file/directory.
  --exclude-dir DIR        Exclude a directory (e.g. vendor)
  --language ARG           The language for the analysis.
  --stdin                  Read a list of newline-separated paths to analyze
                           from stdin.

Language support

Priority Language Parse Assign Diff Symbols Import graph Call graph Control flow graph
1 Ruby 🚧
2 JavaScript 🚧
3 TypeScript 🚧
4 Python 🚧
5 Go 🚧
PHP 🚧 🚧 🚧 🚧
Java 🚧 N/A 🚧
JSON N/A N/A N/A N/A
JSX
Haskell 🚧 🚧 🚧 🚧
Markdown 🚧 🚧 🚧 N/A N/A N/A  
CodeQL N/A 🚧
  • — Supported
  • 🔶 — Partial support
  • 🚧 — Under development

Development

semantic requires at least GHC 8.8.1 and Cabal 3.0. We strongly recommend using ghcup to sandbox GHC versions, as GHC packages installed through your OS's package manager may not install statically-linked versions of the GHC boot libraries. semantic currently builds only on Unix systems; users of other operating systems may wish to use the Docker images.

We use cabal's Nix-style local builds for development. To get started quickly:

git clone git@github.com:github/semantic.git
cd semantic
script/bootstrap
cabal v2-build
cabal v2-test
cabal v2-run semantic -- --help

stack as a build tool is not officially supported; there is an unofficial stack.yaml available, though we cannot make guarantees as to its stability.

Technology and architecture

Architecturally, semantic:

  1. Reads blobs.
  2. Generates parse trees for those blobs with tree-sitter (an incremental parsing system for programming tools).
  3. Assigns those trees into a generalized representation of syntax.
  4. Performs analysis, computes diffs, or just returns parse trees.
  5. Renders output in one of many supported formats.

Semantic leverages a number of interesting algorithms and techniques:

Contributions

Contributions are welcome! Please see our contribution guidelines and our code of conduct for details on how to participate in our community.

Licensing

Semantic is licensed under the MIT license.