Welcome

Dell Sites

Dell Technologies
Premier Sign In
Partner Program Sign In
Dell Financial Services
Support

Dell Sites

Dell Technologies
Premier Sign In
Partner Program Sign In
Dell Financial Services
Support

Sign Out

Welcome to Dell

My Account

Place orders quickly and easily
View orders and track your shipping status
Enjoy members-only rewards and discounts
Create and access a list of your products
Manage your Dell EMC sites, products, and product-level contacts using Company Administration.

US/EN

Cart

Your Dell.com Carts

Products
Solutions
Services
Contact Us

Support
Product Support
Manuals

ECS 3.6.2 Data Access Guide

Notes, cautions, and warnings
S3
- Amazon S3 API support in ECS
- S3 API supported and unsupported features
  - Behavior where bucket already exists
- Bucket policy support
- Object Tagging
- S3 Object Lock
  - Managing Object Lock
- Object lifecycle management
- S3 Extensions
- Metadata Search
- S3 and Swift Interoperability
- Create and manage secret keys
  - Create a key for an object user
    - Generate a secret key from the ECS Portal
    - Create an S3 secret key using the ECS Management REST API
  - Create an S3 secret key: self-service
    - Working with self-service keys
- Authenticating with the S3 service
- Using s3curl with ECS
- Use SDKs to access the S3 service
  - Using the Java Amazon SDK
    - ECS S3 APIs compatible with AWS Java SDK
    - AWS SDK APIs not supported in ECS S3 APIs
  - ECS Java SDK
    - Disabling request timeouts
      - Changing timeout parameters
- ECS S3 error codes
- Hadoop S3A for ECS
- Enabling data2 IP in ECS S3
Cloud DVR
- Cloud DVR overview
- Cloud DVR supported APIs
  - Cloud DVR API Examples
ECS IAM for S3
- ECS IAM overview
- ECS IAM identities
  - Tagging ECS IAM users and roles
- Backward compatibility
  - ECS legacy users
  - Access control
- ECS IAM API and SDK access
- AWS SDK APIs not supported in ECS IAM
- ECS IAM error codes
- ECS IAM supported condition keys
- ECS IAM limitations on entities and objects
- ECS IAM access management
- Secure Token Service
- ECS IAM SAML support
OpenStack Swift
- OpenStack Swift support in ECS
- OpenStack Swift supported operations
- Swift extensions
- Swift byte range extensions
- Retention
- File system enabled
- S3 and Swift interoperability
- OpenStack Swift authentication
- Authorization on Container
- ECS Swift error codes
EMC Atmos
- EMC Atmos API support in ECS
- Supported EMC Atmos REST API Calls
- Unsupported EMC Atmos REST API Calls
- Subtenant Support in EMC Atmos REST API Calls
- API Extensions
  - Appending data to an object
  - ECS support for retention and retention expiration periods for Atmos objects
- ECS Atmos error codes
CAS
- Setting up CAS support in ECS
- Cold Storage
- Compliance
- CAS retention in ECS
- Advanced retention for CAS applications: event-based retention, litigation hold, and the min/max governor
- Set up namespace retention policies
- Create and set up a bucket for a CAS user
- Set up a CAS object user
- Set up bucket ACLs for CAS
- ECS Management APIs that support CAS users
- Content Addressable Storage (CAS) SDK API support
  - CAS connection string
- ECS CAS error codes
- Enabling data2 IP in CAS
ECS Management REST API
- ECS Management REST API introduction
- Authenticate with the ECS Management REST API
ECS HDFS
- ECS HDFS Introduction
- Configuring Hadoop to use ECS HDFS
- Hadoop authentication modes
- Migration from a simple to a Kerberos Hadoop cluster
  - Hadoop Kerberos authentication mode
- File system interaction
- Supported Hadoop applications
- Integrate a simple Hadoop cluster with ECS HDFS
- Integrate a secure Hadoop cluster with ECS HDFS
Verify that AD/LDAP is correctly configured with a secure Hadoop cluster
Pig test fails: unable to obtain Kerberos principal
Permission denied for AD user
Permissions errors
Failed to process request
Enable Kerberos client-side logging and debugging
Debug Kerberos on the KDC
Eliminate clock skew
Configure one or more new ECS nodes with the ECS service principal
Workaround for Yarn directory does not exist error
Set up the Kerberos KDC
Configure AD user authentication for Kerberos
Secure bucket metadata
Hadoop core-site.xml properties for ECS HDFS
- Sample core-site.xml for simple authentication mode
Hadoop core-site.xml properties for ECS S3
- Sample core-site.xml for ECS S3
External key management
Document feedback

PDF

Loading, Please wait

Secure the ECS bucket using metadata

To ensure that the ECS bucket can work with a secure Hadoop cluster, the bucket must have access to information about the cluster.

In a secure Hadoop cluster, the Kerberos principal must be mapped to a HDFS username. In addition, the user must be mapped to a UNIX group. Within the Hadoop cluster, the NameNode gathers this information from the Hadoop nodes themselves and from the configuration files (core-site.xml and hdfs.xml).

To enable the ECS nodes to determine this information and to validate client requests, the following data must be made available to the ECS nodes:

Kerberos user to UNIX user and group mapping
Superuser group
Proxy user settings

The data is made available to the ECS nodes as a set of name-value pairs held as metadata.

Kerberos users

Information about every Kerberos user (not AD users) that requires Hadoop access to a bucket must be uploaded to ECS. The following data is required:

Principal name
Principal shortname (mapped name)
Principal groups

If there are 10 Kerberos principals on a Hadoop node, you must create 30 name value pairs in the JSON input file. Every name must be unique, so you will must uniquely assign a name for every principal name, principal shortname, and principal group. ECS expects a constant prefix and suffix for the JSON entry names.

The required prefix for every Kerberos user entry is internal.kerberos.user, and the three possible suffixes are name, shortname and groups. As shown in the following example.

{
    "name": "internal.kerberos.user.hdfs.name",
    "value": "hdfs-cluster999@EXAMPLE_HDFS.EMC.COM"
},
{
    "name": "internal.kerberos.user.hdfs.shortname",
    "value": "hdfs"
},
{
    "name": "internal.kerberos.user.hdfs.groups",
    "value": "hadoop,hdfs"
},

The value between the prefix and suffix can be anything, as long is it uniquely identifies the entry. For example, you could use:

"name": "internal.kerberos.user.1.name",
"name": "internal.kerberos.user.1.shortname",
"name": "internal.kerberos.user.1.groups",

Principals can map to a different users. For example, the rm principal user is usually mapped to the yarn users using auth_to_local setting for the Hadoop cluster, like this.

RULE:[2:$1@$0](rm@EXAMPLE_HDFS.EMC.COM)s/.*/yarn/

So for any principal that maps to a different principal (for example, the rm principal maps to the yarn principal), you must use the mapped principal in the shortname value, so the entry for the rm principal would be:

{
"name": "internal.kerberos.user.rm.name",
"value": "rm@EXAMPLE_HDFS.EMC.COM"
},
{
"name": "internal.kerberos.user.yarn.shortname",
"value": "yarn@EXAMPLE_HDFS.EMC.COM"
},
{
"name": "internal.kerberos.user.yarn.groups",
"value": "hadoop"
},

Supergroup

You must tell ECS which Linux group of users on the Hadoop nodes get superuser privileges based on their group. Only one entry in the JSON input file is expected for the supergroup designation. It must be like the following:

{
    "name": "dfs.permissions.supergroup",
    "value": "hdfs"
}

Proxy settings

For proxy support, you must identify all proxy settings that are allowed for each Hadoop application, where application means one of the Hadoop-supported applications, for example, hive, and so on.

In the following example, proxy support for the hive application is granted to users who are members of the s3users group (AD or Linux group), and can run hive on any of the hosts in the Hadoop cluster. So the JSON entry for this is two name/value pairs, one for the hosts setting, and one for the groups setting.

{
    "name": "hadoop.proxyuser.hive.hosts",
    "value": "*"
},
{
    "name": "hadoop.proxyuser.hive.groups",
    "value": "s3users"
}

The complete file

The three types of metadata must be combined into a single JSON file. The JSON file format is as shown in the following example.

{
    "head_type": "hdfs",
    "metadata": [
    {
        "name": "METADATANAME_1",
        "value": "METADATAVALUE_1"
    },
    {
        "name": "METADATANAME_2",
        "value": "METADATAVALUE_2"
    },

        :

    {
        "name": "METADATANAME_N",
        "value": "METADATAVALUE_N"
    }
    ]
}

NOTE:

The last name/value pair does not have a trailing "," character.

An example of a JSON file is shown in: Secure bucket metadata.

Secure and non-secure buckets

Once metadata is loaded into a bucket, it is referred to as a secure bucket and you must have Kerberos principals to access it. A request from a non-secure Hadoop node is rejected. If metadata is not loaded, the bucket is not secure and a request from a secure Hadoop node is rejected.

The following error is seen if you try and access a secure bucket from a non-secure cluster. A similar message is seen if you try and access a non-secure bucket from a secure cluster.

[hdfs@sandbox ~]$ hadoop fs -ls -R viprfs://hdfsBucket3.s3.site1/
ls: ViPRFS internal error (ERROR_FAILED_TO_PROCESS_REQUEST).

Data is not available for the Topic

Rate this content

All fields are required unless marked otherwise.

Accurate

not accurate

somewhat accurate

mostly accurate

accurate

very accurate

Useful

not useful

somewhat useful

mostly useful

useful

very useful

Easy to understand

not easy

somewhat easy

mostly easy

easy

very-easy

Was this article helpful?

Yes

Send us feedback (Optional)

0/3000 characters

Comments cannot contain these special characters: <>()\

Sorry, our feedback system is currently down. Please try again later.

Thank you for your feedback.

Please provide ratings (1-5 stars).

Please select whether the article was helpful or not.

Comments cannot contain these special characters: <>()\

US/EN

Site Map

Account

My Account
Order Status
Profile Settings
My Products
Make a Payment
Dell Rewards Balance

Support

Support Home
Contact Technical Support
Returns

Connect with Us

Community
Contact Us
X (Twitter)
LinkedIn
Instagram
YouTube

Site Map

US/EN

Our Offerings

Artificial Intelligence
Products
Solutions
Services
Deals

Our Company

Who We Are
Careers
Dell Technologies Capital
Investors
Newsroom
Perspectives
Recycling
ESG & Impact
Customer Stories

Our Partners

Find a Partner
Find a Reseller
OEM Solutions
Partner Program

Resources

Blog
Dell Rewards
Events
Email Sign-Up
Dell Learning Center
Glossary
Privacy Center
Resource Library
Security & Trust Center
Trial Software Downloads

Dell Technologies
Dell Premier
Dell Financial Services

Terms of Sale
Privacy Statement
Do Not Sell or Share My Personal Information
Cookies, Ads & Emails
Legal & Regulatory
Accessibility
Anti-Slavery, Human Trafficking & Child Labor