Description
LynxKite is a complete graph data science platform for very large graphs and other datasets. It seamlessly combines the benefits of a friendly graphical interface and a powerful Python API.
LynxKite alternatives and similar packages
Based on the "Science and Data Analysis" category.
Alternatively, view LynxKite alternatives based on common mentions on social networks and blogs.
-
Pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more -
statsmodels
Statsmodels: statistical modeling and econometrics in Python -
Getting Started
PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis -
Biopython
Official git repository for Biopython (originally converted from CVS) -
Interactive Parallel Computing with IPython
IPython Parallel: Interactive Parallel Computing in Python -
Cubes
[NOT MAINTAINED] Light-weight Python OLAP framework for multi-dimensional data analysis -
#<Sawyer::Resource:0x00007f547e829e00>
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites. -
bcbio-nextgen
Validated, scalable, community developed variant calling, RNA-seq and small RNA analysis -
bccb
Incubator for useful bioinformatics code, primarily in Python and R -
Neupy
NeuPy is a Tensorflow based python library for prototyping and building neural networks -
PatZilla
PatZilla is a modular patent information research platform and data integration toolkit with a modern user interface and access to multiple data sources. -
Kotori
A flexible data historian based on InfluxDB, Grafana, MQTT, and more. Free, open, simple. -
dask-memusage
A low-impact profiler to figure out how much memory each task in Dask is using -
cclib
A library for parsing and interpreting the results of computational chemistry packages. -
ElasticBatch
Elasticsearch tool for easily collecting and batch inserting Python data and pandas DataFrames -
Open Babel
A chemical toolbox designed to speak the many languages of chemical data.
InfluxDB - Power Real-Time Data Analytics at Scale
* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.
Do you think we are missing an alternative of LynxKite or a related project?
Popular Comparisons
README
LynxKite
LynxKite is a complete graph data science platform for very large graphs and other datasets. It seamlessly combines the benefits of a friendly graphical interface and a powerful Python API.
- Hundreds of scalable graph operations, including graph metrics like PageRank, embeddedness, and centrality, machine learning methods including GCNs, graph segmentations like modular clustering, and various transformation tools like aggregations on neighborhoods.
- The two main data types are graphs and relational tables. Switch back and forth between the two as needed to describe complex logical flows. Run SQL on both.
- A friendly web UI for building powerful pipelines of operation boxes. Define your own custom boxes to structure your logic.
- Tight integration with Python lets you implement custom transformations or create whole workflows through a simple API.
- Integrates with the Hadoop ecosystem. Import and export from CSV, JSON, Parquet, ORC, JDBC, Hive, or Neo4j.
- Fully documented.
- Proven in production on large clusters and real datasets.
- Fully configurable graph visualizations and statistical plots. Experimental 3D and ray-traced graph renderings.
LynxKite is under active development. Check out our Roadmap to see what we have planned for future releases.
Getting started
Quick try:
docker run --rm -p2200:2200 lynxkite/lynxkite
Setup with persistent data:
docker run \
-p 2200:2200 \
-v ~/lynxkite/meta:/metadata -v ~/lynxkite/data:/data \
-e KITE_MASTER_MEMORY_MB=1024 \
--name lynxkite lynxkite/lynxkite
Contributing
If you find any bugs, have any questions, feature requests or comments, please file an issue or email us at [email protected].
You can install LynxKite's dependencies (Scala, Node.js, Go) with Conda.
Before the first build:
tools/git/setup.sh # Sets up pre-commit hooks.
conda env create --name lk --file conda-env.yml
conda activate lk
cp conf/kiterc_template ~/.kiterc
We use make
for building the whole project.
make
target/universal/stage/bin/lynxkite interactive
Tests
We have test suites for the different parts of the system:
Backend tests are unit tests for the Scala code. They can also be executed with Sphynx as the backend. If you run
make backend-test
it will do both. Or you can startsbt
and runtestOnly *SomethingTest
to run just one test. Run./test_backend.sh -si
to startsbt
with Sphynx as the backend.Frontend tests use Protractor to simulate a user's actions on the UI.
make frontend-test
will build everything, start a temporary LynxKite instance and run the tests against that. Usexvfb-run
for headless execution. If you already have a running LynxKite instance and you don't mind erasing all data from it, runnpx gulp test
in theweb
directory. You can start up a dev proxy that watches the frontend source code for changes withnpx gulp serve
. Run the test suite against the dev proxy withnpx gulp test:serve
.Python API tests are started with
make remote_api-test
. If you already have a running LynxKite that is okay to test on, runpython/remote_api/test.sh
. This script can also run a subset of the test suite:python/remote_api/test.sh -p *something*
License
*Note that all licence references and agreements mentioned in the LynxKite README section above
are relevant to that project's source code only.