Skip to main content

Last Updated On: August 19, 2025

This article guides community administrators to connect to the community data exposed in Gainsight’s Amazon S3.

Overview

Community managers and administrators can enhance their data analysis capabilities by copying data from Amazon S3 to local warehouses. While they can query data directly from the Gainsight Customer Communities (CC) data lake using Business Intelligence (BI) tools such as Tableau or Power BI, copying data offers the added opportunity to join community data with information from various other sources. This capability creates more opportunities for  generating comprehensive reports on community usage in the context of company activities, analyzing engagement and developing strategies for improvement.

To achieve this, community administrators must connect to Amazon S3 and copy the relevant data into their local data warehouse. This process allows for seamless data integration and provides richer insights, facilitating more informed decision-making.

 

Create an S3 Connection in Community

Gainsight CC provides a one-click functionality to simplify and speed up the process of creating an S3 connection. To connect with S3:

  1. Log in to Control.
  2. Navigate to Integrations > Apps. The Apps page appears, displaying a list of third-party applications available for integration.
  1. Click S3. The S3 Connector screen appears.
  2. From the Installation tab, click Install. Alternatively, click Install Now

The S3 Connector screen updates to show the Installed status. Furthermore, a green check appears on the S3 connection on the Apps Page.

IMPORTANT:

  • When the connection is successfully established, Gainsight generates credentials to access S3 bucket with your community data.
  • When the connection is created for the first time, Gainsight generates a unique Secret Access Key. We recommend saving this key for future reference, as Gainsight does not store it. If you lose this key, you need to reinstall the connection again to get a new pair of credentials.

 

Note: When you install the S3 connector, all other data lake connectors, such as Power BI, Tableau, and Looker are automatically installed. You can configure each data lake connector using the same Secret Access Key.

 

Integrate Data with S3

Once you have your Identity and Access Management (IAM) user credentials to access Amazon S3, you can directly connect to your data or use an external tool for data integration.

To use an external tool, create a new connection and select Amazon S3 as the data source. Once the connection is created, configure the connection with your bucket name and access credentials generated in the integration application.

For more information on how to integrate data using an external tool, visit the official documentation on the tool's website.

Notifications in S3

Once your S3 Connector is installed, you can configure real-time notifications to help community managers and admins stay updated on data changes.

Gainsight offers three notification options to ensure timely and efficient data updates:

 

  • SNS Topics
  • SQS Queue
  • HTTP/HTTPS

Note: You can enable one, two, or all three options based on your organization’s needs.

To enable the notifications:

  1. Navigate to Integration > S3 Connector.
  2. Click to expand any of the notification options.
  3. Enter the details required for the notifications.
  4. Click Save.

Use Community Data

Your community data in the CC data lake is stored in the form of Parquet file objects, namely fact and dimension. Analyzing data in the CC data lake is crucial for gaining insights and making strategic decisions. For more information, refer to the Getting Started with Data Analytics in CC Data Lake article.

  • Fact Object - A fact object represents a collection of quantitative information that typically refers to actions, events, or process results. These facts are organized in folders by date, allowing for effective historical analysis. The fact data is refreshed every 30 minutes.
  • Dimension Object - A dimension object contains qualitative data that provides context to the facts by describing who or what performed or was affected by the action. Dimension objects store the latest state of the data. Any updates to these objects will overwrite the existing data, ensuring access to the most current information. The dimension data is refreshed every 4 hours.

You can also refer to the Data Catalog and Connectors category and understand different objects and its fields that exist in the CC data lake.

If you have any queries or feedback, please drop an email to docs@gainsight.com or post a reply to this article.

Hi there, 

I am trying to understand a bit more about this s3 connection. Is there an s3 ARN, url, or something similar that I can find for my data? 

Without a location for source data, I am struggling to understand how it is possible to pull the data.  


Thank you!


Hi ​@HannahStrava - I had to check with our engineers on this one as I’m not so familiar with this integration myself yet.  It turns out that we deployed some changes just a few hours ago that allow you to directly see your S3 bucket name and path in the CC control environment.  You’ll find it under Integrations > Apps > S3 Connector > Installation (tab) > Show AWS Parameters. Hope this helps.


Thanks for the helpful guide!

I’ve successfully configured the S3 connection using the provided IAM credentials, and I can browse the data structure. However, I’m running into issues when trying to query the data using Amazon Athena.

Specifically:

  • I’m receiving an AccessDeniedException when trying to run queries via Athena (athena:StartQueryExecution).

  • I also don’t appear to have permissions to specify or write to an output S3 location for query results.

  • I don’t have documentation on the schema, model names, or partition structure of the Parquet files.

Could you please clarify:

  1. Is Athena supported as a querying method for community data when using the S3 connector?

  2. If yes, what permissions or setup steps are required to make it work?

  3. Is there a public or internal schema/data catalog available for the Parquet files in the S3 data lake?

Appreciate your help!
—Julia


Hi ​@juliafmorgado 

I’ll ping you to get some details so our eng team can take a look at it from our end.

 

Meanwhile to answer to your information related questions

Is there a public or internal schema/data catalog available for the Parquet files in the S3 data lake?

Every step to be performed for the setup has been documented and made it available on the CC DataLake connector pages.

FYI: You can find everything here like steps for connection, data catalog etc here  chttps://communities.gainsight.com/data-catalog-connectors-313]


Hey Community! 👋🚀

We’ve rolled out some exciting new features to supercharge your experience! Here’s what’s new:

Notifications in the S3 Connector

Stay instantly informed with notification support for the S3 Connector!
Now, Community Managers and Admins can get real-time alerts when new data lands in your Community Data Lake (Amazon S3). No third-party automation tools required.

🔔 Supported notification methods:

  • SNS Topic

  • SQS Queue

  • HTTP/HTTPS Endpoints


Reply