DataDiscover - File Analysis & Metadata Management

Quickly see, analyze, and act on all your file data

Start your free trial

See DataDiscover in Action

Join us weekly for a LIVE Igneous demo

Weekly Live Demo

Simple

Easy to install software, no hardware or system management overhead

Fast

Speedy time to knowledge with continuously fresh analytics in near real-time

Scalable

A global view across multiple locations, multiple NAS systems, exabytes of data, and billions of files

Spend less on NAS storage

  • Discover the 60% of your data that is cold
  • Reclaim NAS capacity and save costs
  • Manage end-user storage consumption without the fire drills
DataDiscover Datasheet 

Fact-Based Actions with DataDiscover

Fact Based Conversations

with Project & Data Owners to optimize storage utilization

Reclaim NAS Capacity and Costs

and more accurately plan future storage spend

dd-product-screenshot-zoomed2

Right Size Refreshes and Clouds

using accurate capacity, aging, and change rates

Enable Your End Users

to manage their own storage consumption without the firedrills

Before DataDiscover

I don’t know how much active file data we really have.

screen1-1

After DataDiscover

Global file analysis of all your data

Before DataDiscover

I can’t see which projects, directories and files are really consuming our NAS capacity.

screen1-1

After DataDiscover

Explore usage at any directory level or project in real time.

Before DataDiscover

I don’t have the information to determine what to archive and why.

screen1-1

After DataDiscover

Surgically take action on all NAS data with facts

Before DataDiscover

I can’t justify spending less  - or more- on primary NAS spend.

screen1-1

After DataDiscover

Active and Cold data facts lead to smart spend decisions.

Metadata Management
Shouldn’t Require a Long and Expensive Deployment

pointer

Ridiculously easy configuration

DataDiscover installs a lightweight VM, imports your NAS systems in seconds, and you’re ready to get started. The file analysis process begins within minutes. No further configuration is needed.

graph-1

Ridiculously simple to use

The interactive dashboard shows capacity, # of files, and aging of files globally across your entire enterprise.

setting-1

Ridiculously easy to manage

You’ll never have to update software, debug issues, or manage licenses – unless you really want to. Igneous remotely monitors and manages all aspects of DataDiscover for you, dramatically reducing administrative overhead for your team.

Visibility for Billions of Files a Day

How is file analysis with DataDiscover so fast?

We combine latency aware scanning—that won't interfere with your NAS performance—with scalable indexing to handle billions of files every day.


Also, we’re not in the datapath, don’t mount the file systems and don’t use agents – all things that can slow down how quickly you get your information.

panel3

DataDiscover scans at a rate of 17 billion files per day across any NAS system. Your time to knowledge is hours and your analytics continuously stay fresh.

DataDiscover Whitepaper

Metadata Management Optimized for Massive Scale and Depth

✓  DataDiscover gives you a global view across multiple locations and multiple NAS systems including NetApp, Isilon, Pure FlashBlade and Qumulo

✓  Gather the facts at any level of your directories and projects

✓  Continuous discovery and indexing whether you have 10’s of millions or 100’s of millions of files
 
Proactively Manage System Utilization-photo

You get

Without all this

You get without all this

Visibility for all your file data where it lives
yellow-arrow
Costly capacity additions to keep up with the growth rate of data
Four click configuration 
yellow-arrow
Time consuming deployments with complex configuration
Time to visibilty in hours
yellow-arrow
State of visibility of data that often takes weeks to see
Continuous visibility with AdaptiveScan ™
yellow-arrow
Visibilty of data that can take weeks to months
Zero administrative overhead
yellow-arrow
Expensive on-premises hardware and software

Getting Started is Easy

Step 1

Deploy DataDiscover virtual machine and start scanning

Step 2

Explore your Results

Step 3

Take Fact-based Action

Start your free trial

DataDiscover FAQs

What does DataDiscover show me?

A unified view of all file data across your enterprise. It answers the “what do I have”, ”where is it”, and “how old is it” questions.  DataDiscover provides an interactive experience to browse and find data project by project allowing you to take surgical action.

What can I do with better the visibility?

Our customers typically take many actions based on the visibility a metadata management tool provides them with.  At the most basic level, it empowers them to have a fact based conversation with their end users who are responsible for the data footprint.  Some use this insight to decide what data to continue to retain and what to delete. Others leverage our DataProtect solution to archive aged data onto our data protection platform.

How is DataDiscover deployed?

DataDiscover is deployed as-a-service, meaning a small VM on-premises that talks to the Igneous DataDiscover cloud service where all metadata is processed into the interactive view of your data. There is no infrastructure needed on-premises other than the virtual machine making deploying DataDiscover easy and frictionless.

How are you securing my metadata?

All metadata scanned by the Igneous DataDiscover virtual machine is compressed and encoded into a proprietary binary format. Data in transit is encrypted as it is uploaded to the Igneous cloud instance via HTTP over TLS (HTTPS). This uploaded data is sent to a customer specific endpoint where all metadata is secured and isolated. Each provisioned Igneous cloud instance is single-tenant and customer-specific.

What do I need to deploy?

A virtual machine with 4 to 8 cores, 16 to 32GB of RAM, and 100GB of disk space. Outbound access to cloud.igneous.io over https (port 443) and a customer specific endpoint over https (also port 443).  One time administrative access to NetApp, Qumulo, Pure Flashblade, or Isilon simplifies system import through APIs but is optional. All other NAS (Lustre, LInux, Gluster, ZFS, Stornext, GPFS) can be imported and scanned.

How is DataDiscover different than other file analytics tools like DataFrameworks and Komprise?

The biggest differences between Igneous DataDiscover and any other packaged file analysis or metadata management tool is our simplicity, speed and scale.  Igneous is designed for billions of files, petabytes of data and deep directory structures across all NAS systems. We scan at a rate of 200,000 files/sec (that’s 17 billion files a day/per job) without impacting the performance of your NAS systems.  And to keep things simple, we deploy in a single VMWare VM, have you up and running in minutes, and proactively monitor and update DataDiscover software.  There is no software to update, databases or elastic search clusters to manage, we do all the heavy lifting using our InfiniteIndex technology in the cloud so that you dont have to worry about it.

What types of systems can be scanned?

Isilon NFS/SMB, NetApp C-mode NFS/SMB, NetApp 7-mode NFS/SMB, Pure Flashblade NFS/SMB, Qumulo NFS, any other NAS NFS.

How much does DataDiscover cost?

You can use DataDiscover for as little as $2000 a month.  You'll get unlimited scanning across all of your NAS systems in a single location using one DataDiscover OVA.  If you'd like to extend out to more locations, generate faster scans in a single location, export results data, or 1-click archive give us a call for pricing info.

Is DataDiscover available on AWS Marketplace?

Yes. If you are already an AWS customer you can quickly purchase DataDiscover through AWS Marketplace. You can consolidate your cloud spend by including DataDiscover on your AWS bill and use committed AWS spend or credits.  We offer a free 30 day trial - just contact us at awssales@igneous.io and we’ll get you started.

Is there a performance impact on my NAS filers when DataDiscover is running?

Little to none.  We monitor the latency of file system operations and dynamically throttle down scan operations if latency starts creeping up.  You can give it a try on your systems using the DataDiscover Test Drive.

What are the requirements for deploying DataDiscover?

Click here to learn more about the requirements and answers to frequently asked questions about deploying DataDiscover.