close
The Wayback Machine - https://web.archive.org/web/20200605155024/https://github.com/github/semantic
Skip to content
Parsing, analyzing, and comparing source code across many languages
Haskell Other
  1. Haskell 99.0%
  2. Other 1.0%
Branch: master
Clone or download

Latest commit

maxbrunsfeld Merge pull request #563 from github/symbol-node-type
Add 'node_type' field to the Symbol protobuf message
Latest commit e485681 Jun 3, 2020

Files

Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
Image .github/workflows Make sure we always run cabal-cache. May 20, 2020
Image .licenses/semantic/cabal Merge branch 'master' into sinister-orphan-machinations Sep 24, 2019
Image app Move semanticd into it's own directory, update builds May 22, 2018
Image bench One more import to remove Apr 3, 2020
Image bin Keep a bin directory around for deployments Feb 2, 2016
Image docs make all github links permalinks Apr 27, 2020
Image notices Remove docker section from third-party notice Feb 21, 2019
Image proto Merge branch 'master' into symbol-node-type Jun 2, 2020
Image script Add corresponding package to cabal.project May 19, 2020
Image semantic-analysis Merge pull request #507 from github/precise-ql-tagging Apr 1, 2020
Image semantic-ast improvement of AST.Test.exampleParser May 20, 2020
Image semantic-codeql Fix Loc for CodeQL tags Apr 27, 2020
Image semantic-core Bump to semantic-source 0.1.0 Mar 23, 2020
Image semantic-go Fix Loc for Go tags Apr 27, 2020
Image semantic-java Fix Loc for Java tags Apr 27, 2020
Image semantic-json update main-is for test suites in language cabal files Apr 20, 2020
Image semantic-parse Bump to semantic-source 0.1.0 Mar 23, 2020
Image semantic-php Fix Loc for PHP tags Apr 27, 2020
Image semantic-python Fix annoying warning re. inter-project dependencies. Apr 30, 2020
Image semantic-ruby Fix Loc for Ruby tags Apr 27, 2020
Image semantic-rust semantic-rust should run corpus tests for macros.txt May 20, 2020
Image semantic-scope-graph Remove README Apr 1, 2020
Image semantic-source Roll back temporary changes to semantic-source Mar 31, 2020
Image semantic-tags s/Implements/Implementation in this comment. Apr 23, 2020
Image semantic-tsx Whitespace fixes Apr 27, 2020
Image semantic-typescript Whitespace fixes Apr 27, 2020
Image src Handle new symbol kinds when converting to protobuf Jun 3, 2020
Image test Update symbols test fixtures to reflect node_type addition Jun 3, 2020
Image .dockerignore
Image .ghci.repl Give script/repl some better defaults. Apr 8, 2020
Image .ghci.sample Correct the package id for the sample .ghci. Dec 20, 2019
Image .gitattributes The fixtures are binary files, don't diff them. Jun 20, 2019
Image .gitignore Merge pull request #377 from github/sha-sha-sha-cha-cha-cha Nov 7, 2019
Image .gitmodules Remove vendored tree-sitter-ql submodule Mar 4, 2020
Image .hlint.yaml Update .hlint.yaml Feb 13, 2020
Image .hspec Add .hspec for depths default, make sure hspec version supports Feb 17, 2017
Image .stylish-haskell.yaml Don’t align -#}s in LANGUAGE pragmas. Jan 13, 2020
Image CODE_OF_CONDUCT.md Add code of conduct May 31, 2019
Image CONTRIBUTING.md v2-. Oct 30, 2019
Image Dockerfile Merge branch 'master' into symbol-node-type Jun 2, 2020
Image HACKING.md Draft a HACKING.md file outlining best practices for development. Jan 31, 2020
Image LICENSE Use MIT license for code May 31, 2019
Image README.md Updates unofficial stack configuration information May 18, 2020
Image Setup.hs Use the simple build type. Jul 23, 2017
Image cabal.project add semantic-rust into packages list May 12, 2020
Image cabal.project.ci Add corresponding package to cabal.project May 19, 2020
Image hie.yaml Tell hie-bios about the dependencies. Nov 1, 2019
Image semantic.cabal Revert testing code. May 20, 2020

README.md

Semantic

semantic is a Haskell library and command line tool for parsing, analyzing, and comparing source code.

In a hurry? Check out our documentation of example uses for the semantic command line tool.

Table of Contents
Usage
Language support
Development
Technology and architecture
Licensing

Usage

Run semantic --help for complete list of up-to-date options.

Parse

Usage: semantic parse ([--sexpression] | [--json] | [--json-graph] | [--symbols]
                      | [--dot] | [--show] | [--quiet]) [FILES...]
  Generate parse trees for path(s)

Available options:
  --sexpression            Output s-expression parse trees (default)
  --json                   Output JSON parse trees
  --json-graph             Output JSON adjacency list
  --symbols                Output JSON symbol list
  --dot                    Output DOT graph parse trees
  --show                   Output using the Show instance (debug only, format
                           subject to change without notice)
  --quiet                  Don't produce output, but show timing stats

Diff

Usage: semantic diff ([--sexpression] | [--json] | [--json-graph] |
                     [--dot] | [--show]) [FILE_A] [FILE_B]
  Compute changes between paths

Available options:
  --sexpression            Output s-expression diff tree (default)
  --json                   Output JSON diff trees
  --json-graph             Output JSON diff trees
  --dot                    Output the diff as a DOT graph
  --show                   Output using the Show instance (debug only, format
                           subject to change without notice)

Graph

Usage: semantic graph ([--imports] | [--calls]) [--packages] ([--dot] | [--json]
                      | [--show]) ([--root DIR] [--exclude-dir DIR]
                      DIR:LANGUAGE | FILE | --language ARG (FILES... | --stdin))
  Compute a graph for a directory or from a top-level entry point module

Available options:
  --imports                Compute an import graph (default)
  --calls                  Compute a call graph
  --packages               Include a vertex for the package, with edges from it
                           to each module
  --dot                    Output in DOT graph format (default)
  --json                   Output JSON graph
  --show                   Output using the Show instance (debug only, format
                           subject to change without notice)
  --root DIR               Root directory of project. Optional, defaults to
                           entry file/directory.
  --exclude-dir DIR        Exclude a directory (e.g. vendor)
  --language ARG           The language for the analysis.
  --stdin                  Read a list of newline-separated paths to analyze
                           from stdin.

Language support

Priority Language Parse Assign Diff Symbols Import graph Call graph Control flow graph
1 Ruby 🚧
2 JavaScript 🚧
3 TypeScript 🚧
4 Python 🚧
5 Go 🚧
PHP 🚧 🚧 🚧 🚧
Java 🚧 N/A 🚧
JSON N/A N/A N/A N/A
JSX
Haskell 🚧 🚧 🚧 🚧
Markdown 🚧 🚧 🚧 N/A N/A N/A  
CodeQL N/A 🚧
  • — Supported
  • 🔶 — Partial support
  • 🚧 — Under development

Development

semantic requires at least GHC 8.8.1 and Cabal 3.0. We strongly recommend using ghcup to sandbox GHC versions, as GHC packages installed through your OS's package manager may not install statically-linked versions of the GHC boot libraries. semantic currently builds only on Unix systems; users of other operating systems may wish to use the Docker images.

We use cabal's Nix-style local builds for development. To get started quickly:

git clone git@github.com:github/semantic.git
cd semantic
script/bootstrap
cabal v2-build
cabal v2-test
cabal v2-run semantic -- --help

stack as a build tool is not officially supported; there is unofficial stack.yaml support available, though we cannot make guarantees as to its stability.

Technology and architecture

Architecturally, semantic:

  1. Reads blobs.
  2. Generates parse trees for those blobs with tree-sitter (an incremental parsing system for programming tools).
  3. Assigns those trees into a generalized representation of syntax.
  4. Performs analysis, computes diffs, or just returns parse trees.
  5. Renders output in one of many supported formats.

Semantic leverages a number of interesting algorithms and techniques:

Contributions

Contributions are welcome! Please see our contribution guidelines and our code of conduct for details on how to participate in our community.

Licensing

Semantic is licensed under the MIT license.

You can’t perform that action at this time.