Data &amp; Analytics

Comprehensive Python library for astronomy and astrophysics. This skill should be used when working with astronomical data including celestial coordinates, physical units, FITS files, cosmological calculations, time systems, tables, world coordinate systems (WCS), and astronomical data analysis. Use when tasks involve coordinate transformations, unit conversions, FITS file manipulation, cosmological distance calculations, time scale conversions, or astronomical data processing.

5.2K

Jan 9, 2026, 04:57 PM

cosmic-database

Data & Analytics

Access COSMIC cancer mutation database. Query somatic mutations, Cancer Gene Census, mutational signatures, gene fusions, for cancer research and precision oncology. Requires authentication.

5.2K

Jan 9, 2026, 04:57 PM

dask

Data & Analytics

Distributed computing for larger-than-RAM pandas/NumPy workflows. Use when you need to scale existing pandas/NumPy code beyond memory or across clusters. Best for parallel file processing, distributed ML, integration with existing pandas code. For out-of-core analytics on single machine use vaex; for in-memory speed use polars.

5.2K

Jan 9, 2026, 04:57 PM

drugbank-database

Data & Analytics

Access and analyze comprehensive drug information from the DrugBank database including drug properties, interactions, targets, pathways, chemical structures, and pharmacology data. This skill should be used when working with pharmaceutical data, drug discovery research, pharmacology studies, drug-drug interaction analysis, target identification, chemical similarity searches, ADMET predictions, or any task requiring detailed drug and drug target information from DrugBank.

5.2K

Jan 9, 2026, 04:57 PM

exploratory-data-analysis

Data & Analytics

Perform comprehensive exploratory data analysis on scientific data files across 200+ file formats. This skill should be used when analyzing any scientific data file to understand its structure, content, quality, and characteristics. Automatically detects file type and generates detailed markdown reports with format-specific analysis, quality metrics, and downstream analysis recommendations. Covers chemistry, bioinformatics, microscopy, spectroscopy, proteomics, metabolomics, and general scientific data formats.

5.2K

Jan 9, 2026, 04:57 PM

geniml

Data & Analytics

This skill should be used when working with genomic interval data (BED files) for machine learning tasks. Use for training region embeddings (Region2Vec, BEDspace), single-cell ATAC-seq analysis (scEmbed), building consensus peaks (universes), or any ML-based analysis of genomic regions. Applies to BED file collections, scATAC-seq data, chromatin accessibility datasets, and region-based genomic feature learning.

5.2K

Jan 9, 2026, 04:57 PM

geo-database

Data & Analytics

Access NCBI GEO for gene expression/genomics data. Search/download microarray and RNA-seq datasets (GSE, GSM, GPL), retrieve SOFT/Matrix files, for transcriptomics and expression analysis.

5.2K

Jan 9, 2026, 04:57 PM

geopandas

Data & Analytics

Python library for working with geospatial vector data including shapefiles, GeoJSON, and GeoPackage files. Use when working with geographic data for spatial analysis, geometric operations, coordinate transformations, spatial joins, overlay operations, choropleth mapping, or any task involving reading/writing/analyzing vector geographic data. Supports PostGIS databases, interactive maps, and integration with matplotlib/folium/cartopy. Use for tasks like buffer analysis, spatial joins between datasets, dissolving boundaries, clipping data, calculating areas/distances, reprojecting coordinate systems, creating maps, or converting between spatial file formats.

5.2K

Jan 9, 2026, 04:57 PM

hmdb-database

Data & Analytics

Access Human Metabolome Database (220K+ metabolites). Search by name/ID/structure, retrieve chemical properties, biomarker data, NMR/MS spectra, pathways, for metabolomics and identification.

5.2K

Jan 9, 2026, 04:57 PM

lamindb

Data & Analytics

This skill should be used when working with LaminDB, an open-source data framework for biology that makes data queryable, traceable, reproducible, and FAIR. Use when managing biological datasets (scRNA-seq, spatial, flow cytometry, etc.), tracking computational workflows, curating and validating data with biological ontologies, building data lakehouses, or ensuring data lineage and reproducibility in biological research. Covers data management, annotation, ontologies (genes, cell types, diseases, tissues), schema validation, integrations with workflow managers (Nextflow, Snakemake) and MLOps platforms (W&B, MLflow), and deployment strategies.

5.2K

Jan 9, 2026, 04:57 PM

matlab

Data & Analytics

MATLAB and GNU Octave numerical computing for matrix operations, data analysis, visualization, and scientific computing. Use when writing MATLAB/Octave scripts for linear algebra, signal processing, image processing, differential equations, optimization, statistics, or creating scientific visualizations. Also use when the user needs help with MATLAB syntax, functions, or wants to convert between MATLAB and Python code. Scripts can be executed with MATLAB or the open-source GNU Octave interpreter.

5.2K

Jan 9, 2026, 04:57 PM

neurokit2

Data & Analytics

Comprehensive biosignal processing toolkit for analyzing physiological data including ECG, EEG, EDA, RSP, PPG, EMG, and EOG signals. Use this skill when processing cardiovascular signals, brain activity, electrodermal responses, respiratory patterns, muscle activity, or eye movements. Applicable for heart rate variability analysis, event-related potentials, complexity measures, autonomic nervous system assessment, psychophysiology research, and multi-modal physiological signal integration.

5.2K

Jan 9, 2026, 04:57 PM

pyhealth

Data & Analytics

Comprehensive healthcare AI toolkit for developing, testing, and deploying machine learning models with clinical data. This skill should be used when working with electronic health records (EHR), clinical prediction tasks (mortality, readmission, drug recommendation), medical coding systems (ICD, NDC, ATC), physiological signals (EEG, ECG), healthcare datasets (MIMIC-III/IV, eICU, OMOP), or implementing deep learning models for healthcare applications (RETAIN, SafeDrug, Transformer, GNN).

5.2K

Jan 9, 2026, 04:57 PM

pyopenms

Data & Analytics

Complete mass spectrometry analysis platform. Use for proteomics workflows feature detection, peptide identification, protein quantification, and complex LC-MS/MS pipelines. Supports extensive file formats and algorithms. Best for proteomics, comprehensive MS data processing. For simple spectral comparison and metabolite ID use matchms.

5.2K

Jan 9, 2026, 04:57 PM

research-lookup

Data & Analytics

Look up current research information using Perplexity Sonar Pro Search or Sonar Reasoning Pro models through OpenRouter. Automatically selects the best model based on query complexity. Search academic papers, recent studies, technical documentation, and general research information with citations.

5.2K

Jan 9, 2026, 04:57 PM

scientific-brainstorming

Data & Analytics

Creative research ideation and exploration. Use for open-ended brainstorming sessions, exploring interdisciplinary connections, challenging assumptions, or identifying research gaps. Best for early-stage research planning when you do not have specific observations yet. For formulating testable hypotheses from data use hypothesis-generation.

5.2K

Jan 9, 2026, 04:57 PM

scientific-critical-thinking

Data & Analytics

Evaluate scientific claims and evidence quality. Use for assessing experimental design validity, identifying biases and confounders, applying evidence grading frameworks (GRADE, Cochrane Risk of Bias), or teaching critical analysis. Best for understanding evidence quality, identifying flaws. For formal peer review writing use peer-review.

5.2K

Jan 9, 2026, 04:57 PM

scikit-bio

Data & Analytics

Biological data toolkit. Sequence analysis, alignments, phylogenetic trees, diversity metrics (alpha/beta, UniFrac), ordination (PCoA), PERMANOVA, FASTA/Newick I/O, for microbiome analysis.

5.2K

Jan 9, 2026, 04:57 PM

scvi-tools

Data & Analytics

Deep generative models for single-cell omics. Use when you need probabilistic batch correction (scVI), transfer learning, differential expression with uncertainty, or multi-modal integration (TOTALVI, MultiVI). Best for advanced modeling, batch effects, multimodal data. For standard analysis pipelines use scanpy.

5.2K

Jan 9, 2026, 04:57 PM

statistical-analysis

Data & Analytics

Guided statistical analysis with test selection and reporting. Use when you need help choosing appropriate tests for your data, assumption checking, power analysis, and APA-formatted results. Best for academic research reporting, test selection guidance. For implementing specific models programmatically use statsmodels.

5.2K

Jan 9, 2026, 04:57 PM

torch-geometric

Data & Analytics

Graph Neural Networks (PyG). Node/graph classification, link prediction, GCN, GAT, GraphSAGE, heterogeneous graphs, molecular property prediction, for geometric deep learning.

5.2K

Jan 9, 2026, 04:57 PM

umap-learn

Data & Analytics

UMAP dimensionality reduction. Fast nonlinear manifold learning for 2D/3D visualization, clustering preprocessing (HDBSCAN), supervised/parametric UMAP, for high-dimensional data.

5.2K

Jan 9, 2026, 04:57 PM

vaex

Data & Analytics

Use this skill for processing and analyzing large tabular datasets (billions of rows) that exceed available RAM. Vaex excels at out-of-core DataFrame operations, lazy evaluation, fast aggregations, efficient visualization of big data, and machine learning on large datasets. Apply when users need to work with large CSV/HDF5/Arrow/Parquet files, perform fast statistics on massive datasets, create visualizations of big data, or build ML pipelines that do not fit in memory.

5.2K

Jan 9, 2026, 04:57 PM

sql-injection-testing

Data & Analytics

by Ed1s0nZ

SQL注入测试的专业技能和方法论

5.2K

Jan 15, 2026, 02:00 PM

query-builder

Data & Analytics

by clidey

Convert natural language questions into SQL queries. Activates when users ask data questions in plain English like "show me users who signed up last week" or "find orders over $100".

4.4K

Jan 8, 2026, 10:17 PM

schema-designer

Data & Analytics

by clidey

Help design database schemas, create tables, and plan data models. Activates when users ask to create tables, design schemas, or model data relationships.

4.4K

Jan 8, 2026, 10:17 PM

whodb

Data & Analytics

by clidey

Database operations including querying, schema exploration, and data analysis. Activates for tasks involving PostgreSQL, MySQL, MariaDB, SQLite, MongoDB, Redis, Elasticsearch, or ClickHouse databases.

4.4K

Jan 8, 2026, 10:17 PM

hunt-analytics-generation

Data & Analytics

by OTRF

Generate query-agnostic analytics that model adversary behavior by translating hunt investigative intent into analytic definitions grounded in schema semantics. This skill is used to define how behavior should manifest in data before query execution or validation, and works best when informed by system internals, adversary tradecraft, a structured hunt focus, and suggested data sources.

4.4K

Jan 12, 2026, 12:17 AM

hunt-blueprint-generation

Data & Analytics

by OTRF

Assemble a complete hunt blueprint by consolidating outputs from prior hunt planning skills into a single, structured plan for execution. Use this skill after system and tradecraft research, hunt focus definition, data source identification, and analytics generation have been completed. This skill is synthesis and packaging only and must not introduce new research, assumptions, or analytics.

4.4K

Jan 12, 2026, 12:17 AM

hunt-data-source-identification

Data & Analytics

by OTRF

Identify relevant security data sources that could capture the behavior defined in a structured hunt hypothesis. Use this skill after the hunt focus has been defined to translate investigative intent into candidate telemetry sources using existing platform catalogs. This skill supports hunt planning by reasoning over available schemas and metadata before analytics development or query execution.

4.4K

Jan 12, 2026, 12:17 AM

hunt-focus-definition

Data & Analytics

by OTRF

Define a focused hunt hypothesis by synthesizing completed system internals and adversary tradecraft research. Use this skill after research has been completed to narrow a high-level hunt topic into a single, concrete attack pattern with clear investigative intent. This skill produces a structured, testable hypothesis and should be used before selecting data sources, defining environment scope, or developing analytics.

4.4K

Jan 12, 2026, 12:17 AM

thealgorithm

Data & Analytics

by danielmiessler

Universal execution engine using scientific method to achieve ideal state. USE WHEN complex tasks, multi-step work, "run the algorithm", "use the algorithm", OR any non-trivial request that benefits from structured execution with ISC (Ideal State Criteria) tracking.

4.4K

Jan 11, 2026, 11:51 PM

creating-financial-models

Data & Analytics

by modelscope

This skill provides an advanced financial modeling suite with DCF analysis, sensitivity testing, Monte Carlo simulations, and scenario planning for investment decisions

3.8K

Jan 9, 2026, 10:01 AM

infographic-creator

Data & Analytics

by antvis

Create beautiful infographics based on the given text content. Use this when users request creating infographics.

3.7K

Jan 12, 2026, 03:40 AM

docetl

Data & Analytics

by ucbepic

Build and run LLM-powered data processing pipelines with DocETL. Use when users say "docetl", want to analyze unstructured data, process documents, extract information, or run ETL tasks on text. Helps with data collection, pipeline creation, execution, and optimization.

3.4K

Dec 30, 2025, 01:47 AM

linux-production-shell-scripts

Data & Analytics

This skill should be used when the user asks to "create bash scripts", "automate Linux tasks", "monitor system resources", "backup files", "manage users", or "write production shell scripts". It provides ready-to-use shell script templates for system administration.

3.0K

Jan 12, 2026, 12:44 AM

sql-injection-testing

Data & Analytics

This skill should be used when the user asks to "test for SQL injection vulnerabilities", "perform SQLi attacks", "bypass authentication using SQL injection", "extract database information through injection", "detect SQL injection flaws", or "exploit database query vulnerabilities". It provides comprehensive techniques for identifying, exploiting, and understanding SQL injection attack vectors across different database systems.

3.0K

Jan 12, 2026, 12:44 AM

sqlmap-database-penetration-testing

Data & Analytics

This skill should be used when the user asks to "automate SQL injection testing," "enumerate database structure," "extract database credentials using sqlmap," "dump tables and columns from a vulnerable database," or "perform automated database penetration testing." It provides comprehensive guidance for using SQLMap to detect and exploit SQL injection vulnerabilities.

3.0K

Jan 12, 2026, 12:44 AM

wireshark-network-traffic-analysis

Data & Analytics

This skill should be used when the user asks to "analyze network traffic with Wireshark", "capture packets for troubleshooting", "filter PCAP files", "follow TCP/UDP streams", "detect network anomalies", "investigate suspicious traffic", or "perform protocol analysis". It provides comprehensive techniques for network packet capture, filtering, and analysis using Wireshark.

3.0K

Jan 12, 2026, 12:44 AM

index-at-creation

Data & Analytics

by parcadei

Index at Creation Time

2.8K

Jan 11, 2026, 08:18 PM

leann-search

Data & Analytics

by parcadei

Semantic search across codebase using LEANN vector index

2.8K

Jan 11, 2026, 08:18 PM

rudin-real-complex-analysis

Data & Analytics

by parcadei

Problem-solving with Rudin's Real and Complex Analysis textbook

2.8K

Jan 11, 2026, 08:18 PM

langchain-data-handling

Data & Analytics

by jeremylongshore

Implement LangChain data privacy and handling best practices.Use when handling sensitive data, implementing PII protection,or ensuring data compliance in LLM applications.Trigger with phrases like "langchain data privacy", "langchain PII","langchain GDPR", "langchain data handling", "langchain compliance".

2.5K

May 17, 2026, 06:48 AM

06-database-script-management

Data & Analytics

数据库脚本管理规范，涵盖 DDL/DML 脚本编写、版本命名规则、增量更新策略、数据迁移、回滚方案。当用户编写数据库脚本、新增表结构、修改字段、进行数据迁移或管理 SQL 版本时使用。

2.5K

Jan 9, 2026, 09:52 AM

23-database-sharding

Data & Analytics

数据库分片指南，涵盖分片策略设计、分片键选择、跨分片查询、数据迁移、分片路由规则。当用户设计数据库分片、选择分片键、处理跨分片查询或进行分片数据迁移时使用。

2.5K

Jan 9, 2026, 09:52 AM

29-4-process-dao-database

Data & Analytics

Process 模块 DAO 层与数据库表结构详细分析，涵盖 JOOQ 使用、表结构设计、索引优化、数据分片。当用户开发 Process 数据访问、设计表结构、优化查询性能或处理数据存储时使用。

2.5K

Jan 9, 2026, 09:52 AM

44-database-design

Data & Analytics

BK-CI 数据库设计规范与表结构指南，涵盖命名规范、字段类型选择、索引设计、分表策略、数据归档。当用户设计数据库表、优化索引、规划分表策略或进行数据库架构设计时使用。

2.5K

Jan 9, 2026, 09:52 AM

data-fetching

Data & Analytics

by expo

Use when implementing or debugging ANY network request, API call, or data fetching. Covers fetch API, axios, React Query, SWR, error handling, caching strategies, offline support.

2.2K

Jan 20, 2026, 09:11 PM

obsidian-bases

Data & Analytics

by davepoon

2.2K

Jan 12, 2026, 04:25 AM

server-components

Data & Analytics

by davepoon

This skill should be used when the user asks about "Server Components", "Client Components", "'use client' directive", "when to use server vs client", "RSC patterns", "component composition", "data fetching in components", or needs guidance on React Server Components architecture in Next.js.

2.2K

Jan 12, 2026, 04:25 AM

research-lookup

Data & Analytics