[Home](https://www.selecthub.com/) \> [ETL](https://www.selecthub.com/category/etl/) \> [ETL Tools](https://www.selecthub.com/c/etl-tools/) \> AWS Glue 

Categories:

* [ETL Tools](https://www.selecthub.com/c/etl-tools/)
* [Data Integration Tools](https://www.selecthub.com/c/data-integration-tools/)
* [...](#)

## What Is AWS Glue?

**Industry Specialties:** Serves all industries

AWS Glue is a fully managed, event-driven serverless computing platform that extracts, cleanses and organizes data for insights. Automatic code generation ensures citizen data scientists and power users can create and schedule integration workflows. An event-driven architecture enables setting triggers to launch data integration processes.

  
A common data catalog with automatic schema generation ensures data is unique and easily accessible. With streaming data integration, it catalogs assets from datastores like Amazon S3, making it available for querying with Amazon Athena and Redshift Spectrum. Developers can access readymade endpoints to edit and test code.

PRICE

$

$

$

$

$

COMPANY SIZE

S

M

L

DEPLOYMENT

PLATFORM

[ Try Before You Buy. Request a Free Demo Today! Request Demo It's completely free! ](https://pmo.selecthub.com/get-product-demo/?category=ETL+Tools&product%5Fname=AWS%2BGlue&origin%5Furl=https%3A%2F%2Fwww.selecthub.com%2Fp%2Fetl-tools%2Faws-glue%2F&product%5Flogo=https%3A%2F%2Fcdn.selecthub.com%2Fproducts%2F3a077e8acfc4a2b463c47f2125fdfac5-d34e433e93d77f5433219e5f1602055c%2Fresources%2Fnormal%2Flogo.png%3F1693319122) 

 User Sentiment i 

![User satisfaction level icon: great]() 

Based on 165 reviews:

 Add your rating:

![Screenshots]()![Screenshots]()![Screenshots]()![Screenshots]() 

 Product Screenshots and Videos

## #3

 AWS Glue is ranked #3 in the ETL Tools product directory based on the latest available data collected by SelectHub. Compare the leaders with our In-Depth Report.

[ Get the Report Now](https://pmo.selecthub.com/request-custom-scorecard?category%5Fslug=etl-tools&product%5Fslug=aws-glue&slug=aws-glue&product%5Fname=AWS+Glue&category=ETL+Tools&origin%5Furl=https%3A%2F%2Fwww.selecthub.com%2Fp%2Fetl-tools%2Faws-glue%2F) 

## AWS Glue Pricing

Based on our most recent analysis, AWS Glue pricing starts at $0 (Per M-DPU-Hour, Usage-Based).

[Get Price Quote](https://pmo.selecthub.com/get-product-pricing/?category=ETL+Tools&product%5Fname=AWS%2BGlue&origin%5Furl=https%3A%2F%2Fwww.selecthub.com%2Fp%2Fetl-tools%2Faws-glue%2F&product%5Flogo=https%3A%2F%2Fcdn.selecthub.com%2Fproducts%2F3a077e8acfc4a2b463c47f2125fdfac5-d34e433e93d77f5433219e5f1602055c%2Fresources%2Fnormal%2Flogo.png%3F1693319122&price=1) 

Price

$

$

$

$

$

 i

Starting From

$0.44

Pricing Model

Per M-DPU-Hour, Usage-Based

Free Trial

No

## Training Resources

 AWS Glue is supported with the following types of training:

Documentation

In Person

Live Online

Videos

Webinars

## Support

 The following support services are available for AWS Glue:

Email

Phone

Chat

FAQ

Forum

Help Desk

Knowledge Base

Tickets

Training

24/7 Live Support

## AWS Glue Benefits and Insights

Why use AWS Glue?

### Key differentiators & advantages of AWS Glue

* **Effortless Data Integration:** Streamline data movement across diverse sources like databases, applications, and cloud storage with pre-built connectors and automated schema discovery.
* **Simplified Data Preparation:** Clean, transform, and enrich data with a visual drag-and-drop interface and built-in transformations, eliminating the need for complex coding.
* **Serverless Scalability:** Forget infrastructure management! Glue seamlessly scales to handle massive data volumes without upfront provisioning or ongoing maintenance.
* **Cost-Effective Flexibility:** Pay-per-use pricing based on actual resource consumption makes Glue ideal for both small and large data pipelines, optimizing your costs.
* **Seamless AWS Integration:** Leverage the power of the AWS ecosystem! Glue effortlessly integrates with S3, Redshift, and other AWS services, creating a unified data pipeline within your existing infrastructure.
* **Improved Data Accessibility:** Deliver prepared data to data lakes, data warehouses, and analytics platforms, democratizing access for data scientists, analysts, and business users.
* **Enhanced Collaboration:** Share data pipelines and workflows with other users and teams, fostering collaboration and streamlining data-driven workflows.
* **Centralized Data Catalog:** Maintain a single source of truth for your data assets with Glue Data Catalog, ensuring data consistency and discoverability.
* **Continuous Monitoring and Optimization:** Track job performance, identify bottlenecks, and optimize your pipelines for efficiency with built-in monitoring and logging tools.
* **Future-Proof Data Infrastructure:** Stay ahead of the curve with Glue's serverless architecture and cloud-native approach, adapting to your evolving data needs with ease.

### Industry Expertise

AWS Glue provides data integration to multiple clients in enterprises globally. Some of these are healthcare, travel and hospitality, catering, agriculture and life sciences.

## AWS Glue Reviews

Based on our most recent analysis, AWS Glue reviews indicate a 'great' User Satisfaction Rating of 85% based on 165 user reviews from 3 recognized software review sites.

![User satisfaction level icon: great]() 

165 reviews

85%

of users would recommend this product

###  Synopsis of User Ratings and Reviews

Based on an aggregate of AWS Glue reviews taken from the sources above, the following pros & cons have been curated by a SelectHub Market Analyst.

#### Pros

* **Cost-Effective & Serverless:** Pay only for resources used, eliminates server provisioning and maintenance
* **Simplified ETL workflows:** Drag-and-drop UI & auto-generated code for easy job creation, even for non-programmers
* **Data Catalog:** Unified metadata repository for seamless discovery & access across various data sources
* **Flexible Data Integration:** Connects to diverse data sources & destinations (S3, Redshift, RDS, etc.)
* **Built-in Data Transformations:** Apply pre-built & custom transformations within workflows for efficient data cleaning & shaping
* **Visual Data Cleaning (Glue DataBrew):** Code-free data cleansing & normalization for analysts & data scientists
* **Scalability & Performance:** Auto-scaling resources based on job needs, efficient Apache Spark engine for fast data processing
* **Community & Support:** Active user community & helpful AWS support resources for problem-solving & best practices

#### Cons

* **Limited Customization & Control:** Visual interface and pre-built transformations may not be flexible enough for complex ETL needs, requiring manual coding or custom Spark jobs.
* **Debugging Challenges:** Troubleshooting Glue jobs can be complex due to limited visibility into underlying Spark code and distributed execution, making error resolution time-consuming.
* **Performance Limitations for Certain Workloads:** Serverless architecture may not be optimal for latency-sensitive workloads or large-scale data processing, potentially leading to bottlenecks.
* **Vendor Lock-in & Portability:** Migrating ETL workflows from Glue to other platforms can be challenging due to its proprietary nature and lack of open-source compatibility.
* **Pricing Concerns for Certain Use Cases:** Pay-per-use model can be expensive for long-running ETL jobs or processing massive datasets, potentially exceeding budget constraints.

#### Researcher's Summary:

User reviews of AWS Glue paint a picture of a powerful and user-friendly ETL tool for the cloud, but one with limitations. Praise often centers around its intuitive visual interface, making complex data pipelines accessible even to non-programmers. Pre-built connectors and automated schema discovery further simplify setup, saving users time and effort. Glue's serverless nature and tight integration with the broader AWS ecosystem are also major draws, offering seamless scalability and data flow within a familiar environment. However, some users find Glue's strength in simplicity a double-edged sword. For complex transformations beyond basic filtering and aggregation, custom scripting in Python or Scala is required, limiting flexibility for those unfamiliar with these languages. On-premise data integration is another pain point, with Glue primarily catering to cloud-based sources. This leaves users seeking hybrid deployments or integration with legacy systems feeling somewhat stranded. Cost also arises as a concern. Glue's pay-per-use model can lead to unexpected bills for large data volumes or intricate pipelines, unlike some competitors offering fixed monthly subscriptions. Additionally, Glue's deep integration with AWS can create lock-in anxieties for users worried about switching cloud providers in the future. Overall, user reviews suggest Glue shines in cloud-based ETL for users comfortable with its visual interface and scripting limitations. Its scalability, ease of use, and AWS integration are undeniable strengths. However, for complex transformations, on-premise data needs, or cost-conscious users, alternative tools may offer a better fit.

## Key Features

* **Console:** Discover, transform and make available data assets for querying and analysis. Builds complex data integration pipelines; handles dependencies, filters bad data and retries jobs after failures. Monitor jobs and get task status alerts via Amazon Cloudwatch.
* **Data Catalog:** Gleans and stores metadata in the catalog for workflow authoring, with full version history. Search and discover desired datasets from the data catalog, irrespective of where they are located. Saves time and money – automatically computes statistics and registers partitions with a central metadata repository.
* **Automatic Schema Discovery:** Creates metadata automatically by gleaning schema, quality and data types through built-in datastore crawlers and stores it in the Data Catalog. Ensure up-to-date assets – run crawlers on a schedule, on-demand or based on event triggers. Manage streaming data schemas with the Schema Registry.
* **Event-driven Architecture:** Move data automatically into data lakes and warehouses by setting triggers based on a schedule or event. Extract, transform and load jobs with a Lambda function as soon as new data becomes available.
* **Visual Data Prep:** Prepare assets for analytics and machine learning through Glue DataBrew. Automate anomaly filtering, convert data to standard formats and rectify invalid values with more than 250 pre-designed transformations – no need to write code.
* **Materialized Views:** Create a virtual table from multiple different data sources by using SQL. Copies data from each source data store and creates a replica in the target datastore as a materialized view. Ensures data is always up-to-date by monitoring data in source stores continuously and updating target stores in real time.

  
## Limitations

At the time of this review, these are the limitations according to user feedback:

  
* Isn’t beginner-friendly; requires technical knowledge.
* Supports limited data sources.
* Doesn’t support conventional RDBMS systems.
* Isn’t easy to use with products other than the AWS suite.
* Can’t show real-time data for complex operations since it lacks incremental data sync with the source.

  
## Suite Support

Go through technical documentation, the knowledge center, support forums, user guides and tutorials available on the vendor’s website for self-service issue resolution and answers to queries.

  
Avail Basic support that includes access to the resource center, service health dashboard, product FAQs, discussion forums and support for health checks for no additional cost. Sign up for the Developer Support plan for 24/7 access to customer service, with email support during business hours.

  
Subscribe to the Business Support plan or above for 24/7 access to support via email, phone and chat, with the Personal Health Dashboard, Trusted Advisor and third-party software support. Get Enterprise Support for access to a dedicated Technical Account Manager (TAM) for onboarding and subject matter experts.

  
_mail\_outline_Email: Not specified.

_phone_Phone: Not available. Phone callbacks are available to licensed subscribers only.

_school_Training: Register for free, instructor-led courses, either virtually or in-person, on the vendor’s website. Or sign up for self-paced paid online courses from digital training providers.

_local\_offer_Tickets: Create a case from the Support Center through a subscriber account.

  
## Compare ETL Tools

These are the top products most often compared.

 Generating Scorecard...

Compare to AWS Glue

You can choose 4 products to compare

[ IDMC ](https://www.selecthub.com/p/data-management-tools/informatica-idmc/) 

[ InfoSphere Information Server ](https://www.selecthub.com/p/data-integration-tools/infosphere-information-server/) 

[ Talend ](https://www.selecthub.com/p/data-management-tools/talend/) 

[ Informatica PowerCenter ](https://www.selecthub.com/p/etl-tools/informatica-powercenter/) 

[ SAP Data Services ](https://www.selecthub.com/p/etl-tools/sap-data-services/) 

[ Oracle Data Integrator ](https://www.selecthub.com/p/data-integration-tools/oracle-data-integrator/) 

[ Pentaho ](https://www.selecthub.com/p/data-management-tools/pentaho/) 

[ Dataflow ](https://www.selecthub.com/p//dataflow/) 

[ Azure Data Factory ](https://www.selecthub.com/p/data-integration-tools/azure-data-factory/) 

[ SAS Data Management ](https://www.selecthub.com/p/data-management-tools/sas-data-management/) 

 Generating Scorecard...

Compare to AWS Glue

## Head-to-Head  
 Comparison

![AWS Glue Software Tool]() 

vs

* [Cloud Data Fusion](https://www.selecthub.com/etl-tools/aws-glue-vs-cloud-data-fusion/)
* [DataStage](https://www.selecthub.com/etl-tools/aws-glue-vs-datastage/)
* [Daton](https://www.selecthub.com/etl-tools/aws-glue-vs-daton/)
* [Dexi](https://www.selecthub.com/etl-tools/aws-glue-vs-dexi/)
* [Fivetran](https://www.selecthub.com/etl-tools/fivetran-vs-aws-glue/)
* [Hevo](https://www.selecthub.com/etl-tools/aws-glue-vs-hevo-data/)
* [Informatica PowerCenter](https://www.selecthub.com/etl-tools/aws-glue-vs-informatica-powercenter/)
* [Integrate.io](https://www.selecthub.com/etl-tools/aws-glue-vs-integrate-io/)
* [Mozart Data](https://www.selecthub.com/etl-tools/aws-glue-vs-mozart-data/)
* [Pipestream](https://www.selecthub.com/etl-tools/aws-glue-vs-pipestream/)
* [Qlik Replicate](https://www.selecthub.com/etl-tools/aws-glue-vs-qlik-replicate/)
* [SAP Data Services](https://www.selecthub.com/etl-tools/aws-glue-vs-sap-data-services/)
* [SQL Server Integration Services](https://www.selecthub.com/etl-tools/sql-server-integration-services-vs-aws-glue/)
* [Task Factory](https://www.selecthub.com/etl-tools/aws-glue-vs-task-factory/)

## Awards

SelectHub research analysts have evaluated AWS Glue and concluded it earns best-in-class honors for Workflow Management. 

![Workflow Management Award]()

## Similar Products

Here are the most similar products to AWS Glue.

[ Cloud Data Fusion ](https://www.selecthub.com/p/etl-tools/cloud-data-fusion/) 

[ DataStage ](https://www.selecthub.com/p/etl-tools/datastage/) 

[ Etleap ](https://www.selecthub.com/p/etl-tools/etleap/) 

[ Pipestream ](https://www.selecthub.com/p/etl-tools/pipestream/) 

[ Task Factory ](https://www.selecthub.com/p/etl-tools/task-factory/) 

[ Hevo ](https://www.selecthub.com/p/etl-tools/hevo-data/) 

[ Daton ](https://www.selecthub.com/p/etl-tools/daton/) 

[ SAP Data Services ](https://www.selecthub.com/p/etl-tools/sap-data-services/) 

[ Dexi ](https://www.selecthub.com/p/etl-tools/dexi/) 

[ Qlik Replicate ](https://www.selecthub.com/p/etl-tools/qlik-replicate/) 

 Your review has been submitted  
and should be visible within 24 hours.

Review Title 

Pros 

Cons 

Overall feedback 

Your name 

Your job title 

Industry

[ Choose your main industry](javascript:void%28%29) 

* [Accounting / CPA](javascript:void%28%29)
* [Advertising](javascript:void%28%29)
* [Aerospace & Defense](javascript:void%28%29)
* [Agriculture](javascript:void%28%29)
* [Apparel](javascript:void%28%29)
* [Architecture](javascript:void%28%29)
* [Auto Dealership](javascript:void%28%29)
* [Automotive](javascript:void%28%29)
* [Banking & Financial Services](javascript:void%28%29)
* [Banking & Mortgage](javascript:void%28%29)
* [Chemicals](javascript:void%28%29)
* [Construction & Engineering](javascript:void%28%29)
* [Construction / Contracting](javascript:void%28%29)
* [Consulting](javascript:void%28%29)
* [Consumer Products](javascript:void%28%29)
* [Distribution](javascript:void%28%29)
* [E-commerce](javascript:void%28%29)
* [Education](javascript:void%28%29)
* [Electronics](javascript:void%28%29)
* [Energy & Utilities](javascript:void%28%29)
* [Federal Government](javascript:void%28%29)
* [Field Maintenance](javascript:void%28%29)
* [Food & Beverage](javascript:void%28%29)
* [Healthcare / Social Services](javascript:void%28%29)
* [Hospitality / Gaming / Travel](javascript:void%28%29)
* [Human Resources](javascript:void%28%29)
* [Industrial Machinery](javascript:void%28%29)
* [Information Technology & High Tech](javascript:void%28%29)
* [Insurance](javascript:void%28%29)
* [Legal](javascript:void%28%29)
* [Maintenance / Field Service](javascript:void%28%29)
* [Manufacturing](javascript:void%28%29)
* [Marketing Services](javascript:void%28%29)
* [Media & Communications / Entertainment](javascript:void%28%29)
* [Mill Products](javascript:void%28%29)
* [Mining / Metals](javascript:void%28%29)
* [Mortgage](javascript:void%28%29)
* [Non-Profit](javascript:void%28%29)
* [Not Available](javascript:void%28%29)
* [Oil & Gas](javascript:void%28%29)
* [Other](javascript:void%28%29)
* [Other Services](javascript:void%28%29)
* [Payroll Provider](javascript:void%28%29)
* [Pharmaceuticals](javascript:void%28%29)
* [Professional Employer Organization](javascript:void%28%29)
* [Professional Services](javascript:void%28%29)
* [Property Management](javascript:void%28%29)
* [Public Sector](javascript:void%28%29)
* [Real Estate](javascript:void%28%29)
* [Recruiting Agency](javascript:void%28%29)
* [Religious Institutions](javascript:void%28%29)
* [Retail](javascript:void%28%29)
* [Sales & Marketing](javascript:void%28%29)
* [Semiconductors](javascript:void%28%29)
* [Software / IT](javascript:void%28%29)
* [Sports and Recreation](javascript:void%28%29)
* [Staffing Agency](javascript:void%28%29)
* [State & Local Government](javascript:void%28%29)
* [Telecommunications](javascript:void%28%29)
* [Third-Party Administrator](javascript:void%28%29)
* [Transportation & Logistics](javascript:void%28%29)
* [Wholesale Distribution](javascript:void%28%29)

Company Size

[ Choose your company size](javascript:void%28%29) 

* [1 employee](javascript:void%28%29)
* [2 to 9 employees](javascript:void%28%29)
* [10 - 19 employees](javascript:void%28%29)
* [20 - 49 employees](javascript:void%28%29)
* [50 - 99 employees](javascript:void%28%29)
* [100 - 499 employee](javascript:void%28%29)
* [500 - 999 employees](javascript:void%28%29)
* [1,000 - 2,499 employees](javascript:void%28%29)
* [2,500 - 4,999 employees](javascript:void%28%29)
* [5,000 - 9,999 employees](javascript:void%28%29)
* [10,000 - 24,999 employees](javascript:void%28%29)
* [25,000 - 49,999 employees](javascript:void%28%29)
* [50,000 + employees](javascript:void%28%29)

```json
{
              "@context": "https://schema.org",
              "@type": "BreadcrumbList",
              "itemListElement": [
              {
                "@type": "ListItem",
                "position": 1,
                "name": "Home",
                "item": "https://www.selecthub.com/"
              }, 
              {
                "@type": "ListItem",
                "position": 2,
                "name": "ETL",
                "item": "https://www.selecthub.com/category/etl/"
              }, 
              {
                "@type": "ListItem",
                "position": 3,
                "name": "ETL Tools",
                "item": "https://www.selecthub.com/c/etl-tools/"
              }, 
              {
                "@type": "ListItem",
                "position": 4,
                "name": "AWS Glue"
              }
            ]
          }
{
          "@context": "http://schema.org",
          "@type": "SoftwareApplication",
          "name": "AWS Glue",
          "description": "AWS Glue is a fully managed, event-driven serverless computing platform that extracts, cleanses and organizes data for insights. Automatic code generation ensures citizen data scientists and power users can create and schedule integration workflows. An event-driven architecture enables setting triggers to launch data integration processes.



A common data catalog with automatic schema generation ensures data is unique and easily accessible. With streaming data integration, it catalogs assets from datastores like Amazon S3, making it available for querying with Amazon Athena and Redshift Spectrum. Developers can access readymade endpoints to edit and test code.", 
          "review": {
            "@type": "Review","reviewRating": {
            "@type": "Rating",
            "ratingValue": 88,
            "bestRating": 100
          },
            "author": {
              "@type": "Person",
              "name": "SelectHub",
              "reviewBody": "User reviews of AWS Glue paint a picture of a powerful and user-friendly ETL tool for the cloud, but one with limitations. Praise often centers around its intuitive visual interface, making complex data pipelines accessible even to non-programmers. Pre-built connectors and automated schema discovery further simplify setup, saving users time and effort. Glue's serverless nature and tight integration with the broader AWS ecosystem are also major draws, offering seamless scalability and data flow within a familiar environment.

However, some users find Glue's strength in simplicity a double-edged sword. For complex transformations beyond basic filtering and aggregation, custom scripting in Python or Scala is required, limiting flexibility for those unfamiliar with these languages. On-premise data integration is another pain point, with Glue primarily catering to cloud-based sources. This leaves users seeking hybrid deployments or integration with legacy systems feeling somewhat stranded.

Cost also arises as a concern. Glue's pay-per-use model can lead to unexpected bills for large data volumes or intricate pipelines, unlike some competitors offering fixed monthly subscriptions. Additionally, Glue's deep integration with AWS can create lock-in anxieties for users worried about switching cloud providers in the future.

Overall, user reviews suggest Glue shines in cloud-based ETL for users comfortable with its visual interface and scripting limitations. Its scalability, ease of use, and AWS integration are undeniable strengths. However, for complex transformations, on-premise data needs, or cost-conscious users, alternative tools may offer a better fit."
            }
          },
              
            "image": "https://cdn.selecthub.com/products/3a077e8acfc4a2b463c47f2125fdfac5-d34e433e93d77f5433219e5f1602055c/resources/normal/logo.png?1693319122",
            "aggregateRating": {
              "@type": "AggregateRating",
              "ratingValue": "85",
              "bestRating": "100",
              "worstRating": "1",
              "ratingCount": "165"
            }, 
            "offers": {
              "@type": "Offer",
              "priceSpecification": {
                "@type": "priceSpecification",
                "price": "0.44",
                "priceCurrency": "USD"
              }
            },
              "positiveNotes": {
                "@type": "ItemList",
                "itemListElement": [  
                  {
                      "@type": "ListItem",
                      "position": 1,
                      "name": "Cost-Effective &amp; Serverless: Pay only for resources used, eliminates server provisioning and maintenance"
                    },
                     
                  {
                      "@type": "ListItem",
                      "position": 2,
                      "name": "Simplified ETL workflows: Drag-and-drop UI &amp; auto-generated code for easy job creation, even for non-programmers"
                    },
                     
                  {
                      "@type": "ListItem",
                      "position": 3,
                      "name": "Data Catalog: Unified metadata repository for seamless discovery &amp; access across various data sources"
                    }
                ]
              },
              "negativeNotes": {
                "@type": "ItemList",
                "itemListElement": [  
                  {
                    "@type": "ListItem",
                    "position": 1,
                    "name": "Limited Customization &amp; Control: Visual interface and pre-built transformations may not be flexible enough for complex ETL needs, requiring manual coding or custom Spark jobs."
                    },
                     
                  {
                    "@type": "ListItem",
                    "position": 2,
                    "name": "Debugging Challenges: Troubleshooting Glue jobs can be complex due to limited visibility into underlying Spark code and distributed execution, making error resolution time-consuming."
                    },
                     
                  {
                    "@type": "ListItem",
                    "position": 3,
                    "name": "Performance Limitations for Certain Workloads: Serverless architecture may not be optimal for latency-sensitive workloads or large-scale data processing, potentially leading to bottlenecks."
                    }
                ]
              },
          "applicationCategory": "ETL Tools"
        }
```
