Skip to content

End-to-end financial text-analysis using Bigdata API and the Bigdata-Research-Tools library. Ready-to-use notebooks with RAG & GenAI enabling thematic and risk screening, trend tracking, and automated report generation, extracting insights at scale.

License

Notifications You must be signed in to change notification settings

Bigdata-com/bigdata-cookbook

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

76 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Bigdata Cookbook

A comprehensive collection of financial analysis tools and report generators built on the Bigdata API and research tools. This repository contains ready-to-use notebooks for thematic screening, narrative mining, and various sector-specific analyses including pricing power, AI disruption risks, and regulatory issues in the technology sector.

Features

  • Client-Ready: Each project is self-contained with its own dependencies and documentation
  • Easy Setup: Uses Docker for containerized deployment or uv for fast, reliable dependency management
  • Comprehensive Analysis: Combines multiple data sources for robust insights
  • Professional Output: Generates Excel reports, HTML visualizations, and structured data
  • Modular Design: Each project can be run independently

Projects

Automated Thematic Analysis and Screening Tool

  • Thematic identification and categorization across multiple sectors
  • Automated screening based on thematic criteria
  • Theme tracking and evolution analysis
  • Investment opportunity identification through thematic lenses

Automated Analysis of Pricing Power Narratives and Competitive Positioning

  • Assesses competitive positioning across company watchlists
  • Provides sector-wide comparative analysis
  • Tracks temporal evolution of pricing narratives
  • Implements confidence scoring system for pricing power signals

Automated Analysis of AI Threats and Opportunities in Technology Companies

  • Evaluates AI disruption risks and proactive AI adoption
  • Provides standardized scoring for cross-company comparison
  • Generates investment intelligence from AI transformation narratives
  • Creates structured reports ranking companies by AI resilience

Automated Analysis of Regulatory Risks and Company Mitigation Strategies

  • Maps sector-wide regulatory issues across technology domains
  • Quantifies company-specific regulatory risks
  • Extracts mitigation strategies from corporate communications
  • Provides structured reporting on regulatory intensity and business impact

Automated Risk Analysis and Assessment Tool

  • Comprehensive risk assessment across multiple risk dimensions
  • Quantitative risk modeling with statistical analysis
  • Risk visualization and reporting capabilities
  • Automated risk scoring and ranking systems

Automated Narrative Analysis and Mining Tool

  • Narrative extraction and pattern recognition from unstructured data
  • Sentiment analysis and narrative sentiment tracking
  • Narrative evolution and temporal analysis
  • Automated narrative scoring and ranking systems

Automated Analysis of Board Member and Management Activity Exposure

  • Comprehensive person tracking across multiple name variations and contexts
  • Company-specific filtering ensuring relevance to monitored organizations
  • Multi-mode search precision from strict entity matching to broader coverage
  • Temporal analysis showing how coverage patterns evolve over time
  • Entity-specific monitoring using bigdata's entity tracking capabilities

Automated Analysis of Liquid Cooling Technology Providers and Adopters

  • Dual-role classification distinguishing technology providers from adopters
  • Network analysis mapping provider-customer relationships in the cooling ecosystem
  • Temporal tracking of adoption patterns and market evolution
  • Market positioning analysis with confidence scoring for investment decisions
  • Comprehensive ecosystem mapping for infrastructure investment intelligence

Automated Analysis of Corporate Perspectives on Electoral Outcomes

  • Positive vs. negative impact assessment distinguishing companies that expect benefits from those anticipating challenges under new elected officials' policies
  • Sector-wide political exposure mapping revealing industry patterns in positioning toward electoral results
  • Temporal positioning tracking showing how political expectations evolve over time
  • Corporate-political topic networks identifying key policy themes and company concerns through relationship analysis

Automated Analysis of AI Cost Cutting Providers and Users

  • Dual-role classification distinguishing companies developing AI cost cutting solutions from those implementing them
  • Technology ecosystem mapping revealing relationships between solution providers and corporate users
  • Adoption timeline tracking showing how AI cost cutting implementation evolves across different sectors
  • Market positioning analysis quantifying each company's role and exposure in the AI cost cutting ecosystem

Automated Analysis of AI Revenue Generation Providers and Users

  • Dual-role classification distinguishing companies developing AI revenue generation solutions from those implementing them
  • Technology ecosystem mapping revealing relationships between solution providers and corporate users
  • Adoption timeline tracking showing how AI revenue generation implementation evolves across different companies
  • Market positioning analysis quantifying each company's role and exposure in the AI revenue generation ecosystem

Automated Macroeconomic Inflation Analysis Tool

  • Automated theme breakdown into specific inflation components and drivers
  • Systematic document analysis using embeddings-based search and classification
  • Economic categorization that turns narrative signals into structured insights
  • Comprehensive reporting with analytical summaries for each inflation driver covering demand-pull, cost-push, wage increases, global factors, and monetary policy impacts

Automated Central Bank Announcements Monitoring and Analysis Tool

  • Lexicon generation of monetary policy and central bank-specific terminology
  • Real-time content retrieval via Bigdata API with parallelized keyword searches
  • Topic clustering and selection with AI-powered verification and ranking
  • Custom report generation with configurable ranking systems for trending topics
  • Market impact assessment scoring topics for trendiness, novelty, and magnitude

Automated Crude Oil Market Monitoring and Analysis Tool

  • Lexicon generation of crude oil industry-specific terminology and jargon
  • Real-time content retrieval via Bigdata API with parallelized keyword searches
  • Topic clustering and selection with AI-powered verification and ranking
  • Custom report generation with configurable ranking systems for trending topics
  • Market impact assessment scoring topics for trendiness, novelty, and magnitude

Quick Start

Prerequisites

For Docker Installation

  • Docker installed on your system
  • Bigdata API access
  • OpenAI API key (for advanced features)

For Local Installation

  • Python 3.8 or higher
  • uv package manager
  • Bigdata API access
  • OpenAI API key (for advanced features)

Clone repository

Clone the repository to your local computer. Please follow the below steps:

  • Navigate your local computer to the folder where you want to clone the repo and run the following command:
git clone https://github.com/Bigdata-com/bigdata-cookbook.git

Installation

Each project supports both Docker and local installation methods:

  • Docker Installation: Each project includes a Dockerfile for containerized deployment
  • Local Installation: Traditional installation using Python and uv package manager

Each project has its own detailed README with specific installation and usage instructions for both methods.

Project Structure

bigdata-cookbook/
├── Pricing_Power_Analysis/                          # Pricing power analysis
│   ├── Pricing Power.ipynb
│   ├── src/
│   ├── requirements.txt
│   └── README.md
├── Report_Generator_AI_Threats/                      # AI risk analysis
│   ├── Report Generator_ AI Disruption Risk.ipynb
│   ├── src/
│   ├── requirements.txt
│   └── README.md
├── Report_Generator_Regulatory_Isses_in_Tech/        # Regulatory analysis
│   ├── Report Generator_ Regulatory Issues.ipynb
│   ├── src/
│   ├── requirements.txt
│   └── README.md
├── Risk_Analyzer/                                    # Risk analysis tool
│   ├── Risk_Analyzer.ipynb
│   ├── src/
│   ├── requirements.txt
│   └── README.md
├── Thematic_Screener/                                # Thematic analysis tool
│   ├── ThematicScreener.ipynb
│   ├── src/
│   ├── requirements.txt
│   └── README.md
├── Narrative_Miners/                                 # Narrative analysis tool
│   ├── NarrativeMiner.ipynb
│   ├── src/
│   ├── requirements.txt
│   └── README.md
├── Board_Management_Monitoring/                      # Board monitoring tool
│   ├── Board_Management_Monitoring.ipynb
│   ├── src/
│   ├── requirements.txt
│   └── README.md
├── Liquid_Cooling_Market_Watch/                      # Liquid cooling analysis
│   ├── Liquid_Cooling_Market_Watch.ipynb
│   ├── src/
│   ├── requirements.txt
│   └── README.md
├── Election_Monitor/                               # Elecion Monitoring tool
│   ├── Trump_Reelection_Impact_Analisys.ipynb
│   ├── src/
│   ├── requirements.txt
│   └── README.md
├── AI_Cost_Cutting_Market_Analysis/                # AI cost cutting analysis
│   ├── AI_Cost_Cutting_Market_Analysis.ipynb
│   ├── src/
│   ├── requirements.txt
│   └── README.md
├── AI_Revenue_Generation_Market_Analysis/          # AI revenue generation analysis
│   ├── AI_Revenue_Generation_Market_Analysis.ipynb
│   ├── src/
│   ├── requirements.txt
│   └── README.md
├── Tracking_Inflation_Drivers/                     # Inflation analysis tool
│   ├── Tracking_Inflation_Drivers.ipynb
│   ├── src/
│   ├── requirements.txt
│   └── README.md
├── Daily_Digest_Central_Banks/                      # Central bank monitoring
│   ├── Daily_Digest_Central_Banks.ipynb
│   ├── src/
│   ├── assets/
│   ├── report/
│   ├── requirements.txt
│   ├── Dockerfile
│   └── README.md
├── Daily_Digest_Crude_Oil/                          # Crude oil market analysis
│   ├── Daily_Digest_Crude_Oil.ipynb
│   ├── src/
│   ├── assets/
│   ├── report/
│   ├── requirements.txt
│   ├── Dockerfile
└── README.md                                        # This file

Requirements

Core Dependencies

  • bigdata-client>=2.17.0 - Bigdata API client
  • bigdata-research-tools==0.17.2 - Research analysis tools
  • nest-asyncio>=1.6.0 - Async compatibility
  • matplotlib>=3.0.0 - Data visualization
  • numpy>=1.20.0 - Numerical computing
  • pandas>=1.3.0 - Data manipulation
  • jupyter>=1.0.0 - Notebook environment

Optional Dependencies

  • seaborn>=0.11.0 - Statistical visualizations
  • plotly>=5.0.0 - Interactive plots
  • ipython>=7.0.0 - Enhanced Python shell

Usage

Each project follows a similar workflow:

  1. Setup: Install dependencies and configure credentials
  2. Data Collection: Fetch relevant data from Bigdata platform
  3. Analysis: Run the analysis pipeline
  4. Reporting: Generate Excel and HTML reports
  5. Visualization: Create charts and insights

Support

  • Each project has its own detailed README with specific instructions
  • Check the individual project documentation for troubleshooting
  • Ensure you have valid Bigdata API credentials before running analyses

License

This project is licensed under the terms specified in the LICENSE file.


Note: This repository contains financial analysis tools. Please ensure compliance with relevant regulations and use appropriate risk management practices when making investment decisions based on these analyses.

About

End-to-end financial text-analysis using Bigdata API and the Bigdata-Research-Tools library. Ready-to-use notebooks with RAG & GenAI enabling thematic and risk screening, trend tracking, and automated report generation, extracting insights at scale.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •