Skip to main content

Project SEMA-JOIN

Semantic Table Joins using PMI Scores

Easy to Use

Statistical Semantic Discovery

Automatically discovers semantic relationships by analyzing value co-occurrence patterns across table corpora. Handles entity variations, hierarchical relationships, and code mappings without manual rules.

Focus on What Matters

PMI-Based Join Quality

Uses Pointwise Mutual Information scores to quantify relationship strength. Two proven algorithms: CS-JP-LP for optimal quality and RS-JP for efficient performance with quality superior to traditional approaches.

Powered by React

Corpus-Driven Intelligence

Based on Microsoft Research paper. Stores co-occurrence patterns from your table corpus, then calculates semantic relationships on-demand for fast joins that understand your data domain without external knowledge bases.