Earnings calls play a pivotal role in shaping investor perceptions. The quality of communication between executives and analysts can significantly influence company performance. On-topic and proactive executives, who deliver proactive presentations, anticipate market queries, and provide clear, on-topic answers to analysts' questions—consistently outperform their peers. Conversely, off-topic and reactive executives, who fail to address analysts' key inquiries during presentations, and provide off-topic responses—significantly underperform.

Executives' ability to anticipate investor concerns and maintain a focused dialogue fosters confidence and strategic communication. In contrast, failing to provide clarity when analysts seek additional information can lead to misalignment and breakdowns in transparency. A long (short) portfolio of on-topic and proactive (off-topic and reactive) generates +515bps of annualized alpha.

This QuickStart with its notebook serves as a introduction for the research detailed in Quantitative Research & Solutions' recent publication, "Questioning the Answers: LLM's enter the Boardroom." It analyses executive on-topicness and proactiveness using the analysts questions, executives answers and LLM answers. This research harnesses alpha using LLM tools, including vector embeddings, vector cosine similarity, and the LLM question answering. There is a longer version available upon request that also covers how to create the input data from the datasets described in section 2, please reach out to QRS@spglobal.com for access to the longer version.

Through this QuickStart, you will learn how to use Snowflake Notebooks and Snowflake Cortex LLM functions on earnings call using the Machine Readable Transcripts dataset from S&P Global Market Intelligence.

What You'll Learn

What is Snowflake Cortex?

Snowflake Cortex is an intelligent, fully managed service that offers machine learning and AI solutions to Snowflake users. Snowflake Cortex capabilities include:

LLM Functions: SQL and Python functions that leverage large language models (LLMs) for understanding, querying, translating, summarizing, and generating free-form text.

ML Functions: SQL functions that perform predictive analysis such as forecasting and anomaly detection using machine learning to help you gain insights into your structured data and accelerate everyday analytics.

Learn more about Snowflake Cortex.

S&P Global Market Intelligence datasets

The "Questioning the Answers: LLM's enter the Boardroom." research is using the datasets below from the Snowflake Marketplace. Access to those are not necessary for running this QuickStart, where we are using a sample dataset.

To reproduce the full research using the complete datasets then request access to those below using the links or contact SnowflakeMarketplace@spglobal.com.

Name

Description

S&P Capital IQ Financials

S&P Capital IQ Financials provides global standardized financial statement data for over 180,000 companies, including over 95,000 active and inactive public companies, and As Reported data for over 150,000 companies. S&P Capital IQ Standardized Financials allows you to extend the scope of your historical analysis and back-testing models with consistent data from all filings of a company's historical financial periods including press releases, original filings, and all restatements.

Global Events

The Global Events dataset provides details on upcoming and past corporate events such as earnings calls, shareholder/analyst meetings, expected earnings release dates and more. With deep history back to 2003, clients can leverage this dataset to derive signals and support trading models across asset classes, trading styles and frequencies. This dataset also helps in research & analysis, risk management & compliance, and trade surveillance workflows.

Machine Readable Transcripts

The Machine Readable Transcripts dataset aggregates data from earnings calls delivered in a machine-readable format for Natural Language Processing (NLP) applications with metadata tagging. Leverage Machine Readable Transcripts to keep track of event information for specific companies including dates, times, dial-in and replay numbers and investor relations contact information. Easily combine data from earnings, M&A, guidance, shareholder, company conference presentations and special calls with traditional datasets to develop proprietary analytics.

Prerequisites

What You'll Build

In this QuickStart we will analyse executive on-topicness and proactiveness using the analysts' questions, executives' answers and LLM answers with the help of Snowflake Cortex AI functions.

Overview

This section covers cloning of the GitHub repository, creating the needed Snowflake objects (i.e role, warehouse, database, schema, etc..) and importing the notebook to be used.

Clone The Git Repository

The very first step is to clone the GitHub repository. This repository contains all the code you will need to successfully complete this QuickStart Guide.

Using HTTPS:

git clone https://github.com/Snowflake-Labs/sfguide-s-and-p-market-intelligence-analyze-earnings-transcripts-in-cortex-ai.git

OR, using SSH:

git clone git@https://github.com/Snowflake-Labs/sfguide-s-and-p-market-intelligence-analyze-earnings-transcripts-in-cortex-ai.git

You can also use the Git integration feature of Snowflake Notebooks, in order to do that you need to fork the GitHub repository to be allowed to commit changes. For instructions how to set up Git integration for your Snowflake account see here and for using it with Snowflake Notebooks see this page. You can refer to this video for a walkthrough on how you can set up Git integration and use it with Snowflake Notebooks.

Create Database, Schema, Table And Warehouse To Be Used

Run the following SQL commands in a SQL worksheet to create the objects needed for running this QuickStart. You can also find the code in the setup.sql file.

USE ROLE ACCOUNTADMIN;

CREATE DATABASE IF NOT EXISTS SP_LLM_QS;

CREATE WAREHOUSE IF NOT EXISTS SP_LLM_QS_WH;

USE DATABASE SP_LLM_QS;

-- Create a file format to be used when loading the sample data
CREATE or REPLACE file format csvformat
	TYPE=CSV
    PARSE_HEADER = TRUE
    FIELD_DELIMITER='|'
    TRIM_SPACE=TRUE
    FIELD_OPTIONALLY_ENCLOSED_BY='"'
    REPLACE_INVALID_CHARACTERS=TRUE
    DATE_FORMAT=AUTO
    TIME_FORMAT=AUTO
    TIMESTAMP_FORMAT=AUTO;

-- Create a stage that reference a AWS s3 bucket that has the sample data file
CREATE or REPLACE stage sp_data_stage
  file_format = csvformat
  url = 's3://sfquickstarts/sfguide_s_and_p_market_intelligence_analyze_earnings_transcripts_in_cortex_ai/';

-- Verify that we can read the S3 bucket from snowflake
ls @sp_data_stage;

-- Create the table to load the data into
CREATE OR REPLACE TABLE SAMPLE_TRANSCRIPT ( 
    CALLDATE DATE , 
    ENTEREDDATE DATE , 
    FISCALYEARQUARTER VARCHAR , 
    CALENDARYEARQUARTER VARCHAR , 
    TRADINGITEMID NUMBER(38, 0) , 
    COMPANYID NUMBER(38, 0) , 
    COMPANYNAME VARCHAR , 
    HEADLINE VARCHAR , 
    TRANSCRIPTID NUMBER(38, 0) , 
    SPEAKERTYPENAME VARCHAR , 
    TRANSCRIPTPERSONNAME VARCHAR , 
    TRANSCRIPTPERSONID NUMBER(38, 0) , 
    PROID NUMBER(38, 1) , 
    TRANSCRIPTCOMPONENTTYPEID NUMBER(38, 0) , 
    TRANSCRIPTCOMPONENTTYPENAME VARCHAR , 
    TRANSCRIPTCOMPONENTID NUMBER(38, 0) , 
    COMPONENTORDER NUMBER(38, 0) , 
    SENTENCEORDER NUMBER(38, 0) , 
    COMPONENTTEXT VARCHAR , 
    PROCESSEDTEXT VARCHAR 
); 

-- Load the data
COPY INTO SAMPLE_TRANSCRIPT
  FROM @sp_data_stage
    MATCH_BY_COLUMN_NAME = CASE_INSENSITIVE;

-- Verify that we have data in our table
SELECT * FROM SAMPLE_TRANSCRIPT LIMIT 10;

Create Snowflake Notebook

Navigate To Snowflake Notebooks

  1. Navigate to the Notebooks section by clicking Projects and then NotebooksNavigate to Notebooks
  2. Click on the *down arrow next to + NotebookNew notebook drop down
  3. If you have set up a git integration, choose Create from repository. If not, then choose import .ipynb file.New notebook from menu

Import .ipynb File

  1. Navigate to where you have cloned the GitHub repository and select 0_start_here.ipynb and click Open
    Select Notebook File
  2. Give the notebook a name, then select SP_LLM_QS and PUBLIC for Notebook location, SP_LLM_QS_WH for Notebook warehouse and click Create
    Select Notebook File

Create From Repository

If you have forked the GitHub repository and created a Git integration to it in Snowflake you can open the notebook directly from the repository. See here for instructions on how to set up Git integration.

  1. In the Create Notebook from Repository dialog, click on Select .ipynb file
    Create Notebook from Repository Dialog
  2. In the Select file from Git Repository dialog, click on the repository integration you are using and select the notebook file and click Select File, if you do not see the file press Fetch to refresh with the latest changes from the repository
  3. Back to the Create Notebook from Repository dialog, give the notebook a name, then select SP_LLM_QS, PUBLIC for Notebook location and SP_LLM_QS_WH for Notebook warehouse and click Create

During this step you will learn how to use Snowflake Cortex to analyse executive on-topicness and proactiveness using the analysts' questions, executives' answers and LLM answers.

This includes:

Follow along and run each of the cells in the Notebook.

Congratulations, you have successfully completed this QuickStart! Through this QuickStart, we were able to showcase how you can use Snowflake Notebooks and Snowflake Cortex LLM functions on earnings call using the Machine Readable Transcripts and additional dataset from S&P Global Market Intelligence.

What You Learned

In this QuickStart you have learned how to use Snowflake Cortex to analyse executive on-topicness and proactiveness using the analysts' questions, executives' answers and LLM answers by:

Related Resources