LogoLogo
TournamentsHedge FundDiscordGitHubForumDocs
  • Numerai Tournament
    • Overview
    • Data
    • Models
    • Submissions
      • Model Uploads
      • Compute Heavy
      • NGROK Example
      • CRON Example
    • Scoring
      • Definitions
      • Correlation (CORR)
      • Meta Model Contribution (MMC)
      • Feature Neutral Correlation (FNC)
      • Grandmasters & Seasons
    • Staking
    • Bounties
  • Numerai Signals
    • Overview
    • Data
    • Models
    • Submissions
    • Scoring
      • Definitions
    • Staking
    • Signals + QuantConnect
  • Numerai Crypto
    • Overview
    • Data
    • Submissions
    • Scoring
      • Definitions
    • Staking
  • Community
    • Discord
    • Forum
    • Twitter
    • Youtube
    • Apps & Content
      • Office Hours with Arbitrage
        • Office Hours Recaps: Season 1
          • OHwA S01E01
          • OHwA S01E02
          • OHwA S01E03
          • OHwA S01E04
          • OHwA S01E05
          • OHwA S01E06
          • OHwA S01E07
          • OHwA S01E08
          • OHwA S01E09
          • OHwA S01E10
          • OHwA S01E11
          • OHwA S01E12
        • Office Hours Recaps: Season 2
          • OHwA S02E01
          • OHwA S02E02
          • OHwA S02E03
          • OHwA S02E04
          • OHwA S02E05
          • OHwA S02E06
          • OHwA S02E07
          • OHwA S02E08
          • OHwA S02E09
          • OHwA S02E10
          • OHwA S02E11
        • Office Hours Recaps: Season 3
          • OHwA S03E01
          • OHwA S03E02
          • OHwA S03E03
          • OHwA S03E04
        • Office Hours Season 4
        • FAQ in Office Hours
        • Cited resources
      • Structure of Numerai by Wigglemuse
  • NMR
    • Coinbase
    • Uniswap
    • Etherscan
    • Github
  • Connect
    • Index
Powered by GitBook
On this page
  • Introduction
  • Data
  • Modeling
  • Submissions
  • Scoring
  • Staking
  • FAQ
  • Support
  1. Numerai Signals

Overview

Everything you need to know about Numerai Signals.

PreviousBountiesNextData

Last updated 29 days ago

Introduction

While the provides free data, asks you to bring your own data - your own unique signal. A signal in the stock market is a feed of information. It is numerical data about stocks that can be used by quantitative hedge funds like Numerai. These feeds of information are used to model the stock market and construct portfolios (collections of stock holdings).

Examples of stock market signals include:

  • (, , )

  • (, , )

  • (, , )

  • (, )

If you're a data provider you can submit unique features directly as signals. If you're a data scientist, you can model unique data to submit predictions as signals. Signals are then scored against our targets and other submitted signals. Signals can be staked with the NMR cryptocurrency to earn (or burn) NMR based on performance.

Numerai Signals is one part of the Numerai master plan to build the world's last hedge fund. Read the and watch to learn more about how it all fits together.

Data

You can get started with the Signals V1 Data:

from numerapi import NumerAPI
import pandas as pd

napi = NumerAPI()
# Use int8 to save on storage and memory
napi.download_dataset("signals/v2.0/train.parquet")
training_data = pd.read_parquet("signals/v2.0/train.parquet")

Modeling

Here is a an example of a Signals model:

import lightgbm as lgb

features = [
      f for f in training_data.columns
      # there are two non-numerical feature cols
      if "feature" in f and f not in ("feature_country", "feature_exchange_code")
]

model = lgb.LGBMRegressor(
      n_estimators=2000,
      learning_rate=0.01,
      max_depth=5,
      num_leaves=2 ** 5,
      colsample_bytree=0.1
)
model.fit(
      training_data[features],
      training_data["target"]
)

Note: This model is incredibly basic and will likely have no performance as it's trained against the basic data we already have.

Submissions

The list of stock tickers in your submission are defined by the Numerai Signals stock market universe. And the numerical values should be between 0 and 1.

Here is an example of how you generate and upload live predictions in Python:

# Use API keys to authenticate
sapi = SignalsAPI("[your api public id]", "[your api secret key]")

# Download latest live features
sapi.download_dataset(f"signals/v2.0/live.parquet")
live_data = pd.read_parquet(f"signals/v2.0/live.parquet")

# Generate live predictions
live_predictions = model.predict(live_data[features])

# Format and save submission
submission = pd.Series(
    live_predictions, index=live_features.index
).to_frame("prediction")
submission.to_csv(f"submission.csv")

# Upload submission
sapi.upload_predictions(f"submission.csv", model_id="your-model-id")

Scoring

The primary difference is that Numerai first neutralizes each signal to our own existing signals - extracting the unique component of each signal. The targets used to evaluate signals are also neutralized to these signals. This highlights the key unique aspect of Signals: Numerai Signals is not about predicting stock returns, it is about finding original signals that Numerai doesn't already have.

Furthermore, Signals Grandmasters has different tier qualifications.

Staking

It is important to note that the opportunity to stake your signal is not an offer by Numerai to participate in an investment contract, a security, a swap based on the return of any financial assets, an interest in Numerai’s hedge fund, or in Numerai itself or any fees we earn. Payouts will be made at our discretion, based on a blackbox target that will not be disclosed to users. Fundamentally, Numerai Signals is a service offered by Numerai that allows users to assess the value of their signals, using NMR staking as a way to validate “real” signals. In return, Numerai uses the staked signals and related data in the Numerai hedge fund. Users with different expectations should not stake signals.

FAQ

What is a "Ticker" or "Symbol"?

A stock ticker / symbol is an arrangement of characters—usually letters—representing publicly-traded securities on an exchange.

What is a stock universe?

What is the difference between Numerai Tournament and Numerai Signals?

Bring your own data. For Signals, users have to use their own features. Numerai does provide a small set of features to guide feature development, but Signals users must collect and process their own features to train models for success in the tournament.

Where can I find stock price data and data in general?

Is my code / data protected from Numerai?

Yes. Numerai does not view the code that builds your data or predictions. Numerai only receives the predictions themselves. Reverse-engineering your code is nearly impossible and Numerai is uninterested in doing this.

Do I have to stake NMR in order to participate?

No. You can submit your prediction file and receive performance without staking.

Why shouldn't I just trade on my own?

Can you tell me if my diagnostics are good?

No. It's trivial to overfit a signal on validation and it would be impossible to tell purely based on diagnostics. We recommend that you submit your predictions without stake and view the live out-of-sample performance over time. This is often called "paper trading" and frequent timely submissions will give you plenty of feedback beyond diagnostics.

Do I need know coding / finance in order to participate?

Support

This data is primarily meant to be used to either add to your own data or your signal, because it's data we already have and know how to use. You will need to acquire distinct and unique stock market data to generate a high quality signal. There are a number of other data providers you can also use to get started such as , and . There are also platforms that make it easy to create signals such as , and . Check out this for a list of data sources, platforms, and tools.

If you don't have stock market data, try the instead.

If you're not sure where to start with modeling your data, there are a list of tutorial models . We also highly recommend learning how to do over on the Numerai Tournament. Once you're confident in your modeling skills, they should transfer to Signals.

Signals submissions are very similar to the Numerai Tournament (see ), except you use stock tickers instead of ids. Your signals should be a list of stock tickers each with a numerical value:

Numerai has a number of requirements for the structure of your Signals Submissions. To understand all of these requirements, read more about submissions .

Scoring signals is also very similar to the Numerai Tournament (see ).

See the section for more details.

Just as with the Numerai Tournament, you can optionally stake on your model to earn or burn based on performance. For more information please read .

Please read our for further information.

A set of stock tickers / symbols that can be traded. Numerai wants predictions for only these symbols. See for details.

Check out

You can and we can't stop. However, Numerai is equipped to tell you if your signal is already known and being used by everyone else - it's not very valuable to trade a signal that everyone else is already trading. .

While example models and exploratory notebooks are available to get started, people new to coding and finance are encouraged to try rather than Signals. Signals requires a high-level of coding and finance knowledge to be successful.

Find us on for questions, support, and feedback!

Numerai Tournament
Numerai Signals
Fundamental signals
P/E ratio
dividend yield
analyst ratings
Technical signals
MACD
RSI
MFI
Alternative data signals
credit card transactions
satellite images
social media sentiment
Blended signals
Barra risk factors
Fama French factors
Medium Post
the short film
Numerai Tournament
here
basic modeling
here
here
Scoring
NMR
the Staking docs
Terms of Service
here
this discussion at the forum
Learn more in this Medium post
Numerai Tournament
Discord
Yahoo Finance
Quandl
Koyfin
QuantConnect
Alpaca
forum thread
neutralize
here
An example stock market signal