Tuesday, 19 May 2015

Ask Data Anything

Ask Data Anything is Cognitum's approach to exploring data by using a subset of natural language which articulates concepts and instances modeled in ontologies to provide a meaningful quering experience. Ask Data Anything seizes on regularities of language to provide a natural interpretation of queries being asked; its semantics are provided via R and rOntorion (alternatively  F# and Ontorion).

Technically, Ask Data Anything is capable of performing projection, sub-setting, grouping and aggregation operations, providing answers for queries involving the following information:
  • What? Any of the columns of your data table are considered a quantitative field over which to perform queries,
  • How? How the output is to be shown. The results of the query can be retrieved on either a table, histogram or a map,
  • Where? (Optional) The "in" preposition allows to restrict the search to an specific named group of items  as happens for instance with continents which can be seeing as a group of countries,
  • Of? (Optional) The "of" preposition allows to dive into the data, restricting the desired results to a certain set of types (concepts in the Fluent Editor sense) by searching the data in a certain column for instances (in Fluent Editor sense) of those types; we call this material sub-setting,
  • By? (Optional) By which type (in Fluent Editor sense) you would like to group the results for aggregation purposes.
  • When? (Optional) Queries can contain time constraints.

Inside Ask Data Anything

Ask Data Anything consists of the following blocks:

Data and Models are tightly coupled, for models provide an interface to query the data and everything aimed to be queried needs to be modeled in order to provide an appropriate interpretation.

The ontology modeling adds additional semantic layers on top of your data, expanding the search dimensionality and providing in turn an insightful querying experience.

Operational Semantics

Next we are going to briefly describe the 4 types of exploring modes available: projection, aggregation, sub-setting (Circumstantial) and material sub-setting (Conceptual).

For demonstration purposes, let us take the data represented in the following table as our sample dataset:

Transaction Item Price Quantity City Date Trademark
T-001 Sleeve Shirt 543 11 Warsaw 03/07/2013 Lacoste
T-002 Men's Dark One Button Suit 1395 15 Krakow 07/12/2013 Armani
T-003 Solid Polo Shirt 580 18 Krakow 17/08/2012 Gucci
T-004 Men's Mallow Graphic Tee 163 9 Warsaw 19/03/2013 Nike
T-005 I'm Bob Graphic Tee 73 5 Berlin 01/02/2014 Zara
T-006 21 Years Old Women's Dark T-Shirt 386 7 Alicante 22/12/2012 Armani
T-007 Hanes Men's Comfortblend EcoSmart Jersey Polo, 2 Pack 425 11 Berlin 14/03/2013 Chanel
T-008 Men's Short Sleeve Stripe Polo 820 7 Boston 05/09/2014 Tommy Hilfiger
T-009 Women's Button Down Roll Tab Shirt 244 12 Munchen 29/08/2014 Lacoste
T-010 Men's jeans 184 17 Munchen 06/02/2012 Nike
T-011 Men's Geometric Print Short Sleeve Shirt 975 12 Alicante 23/08/2014 Armani
T-012 Men's Sasquatch Hunter Graphic Tee 180 10 Boston 19/11/2012 Gucci
T-013 Women's Essential V-Neck Tee 147 3 Madrid 21/07/2013 Nike
T-014 Men's Bass Guitar Guy Graphic Tee 86 2 Krakow 22/03/2012 Zara
T-015 Men's Essential shirt 754 2 Boston 01/04/2013 Zara
T-016 Women's Scoopneck Tee 2-Pack 448 6 Munchen 26/07/2012 Chanel

 What's inside?

To start exploring the possibilities, it is always useful to know what is inside:

The dimensions are the columns of the data (quantitative fields), the possible operations are sum and averaging and the outputs are histogram, table and map.


This identity operation allows for projections over the data, retrieving subsets of it meeting certain requirements expressed through mathematical expressions.

Example query:

Item with price > 700


Aggregation is performed over hierarchical data, modeled in ontologies through typed instances (instances of concepts) related by either an "making part of" property or the ordinary time embedding  i.e., days as part of months and months as part of years.  This way instances of the concept country are related to the concept continent as "Every country is part of a continent".

Example query:

Quantity summed by month on histogram:

Sub-setting (Circumstantial sub-setting)

Sub-setting allows to retrieve a subset of the data by a (circumstantial) belonging relation. This means, we can ask about the specific results in a given country or modeled group of instances: in this latter case we can constraint the result for groups of brands categorized by origin, i.e, Spanish, American, etc.

Quantity summed in Europe by country on map:

 Quantity summed in Spanish-Brands:

Material sub-setting (Conceptual sub-setting)

Material sub-setting allows sub-setting by diving into the data properties as modeled in the provided ontology. This features allows us to make pretty expressive queries as:

Quantity summed in Europe for item of (type) t-shirt:

By default the aggregation is performed in the target type that is marked here by the query sub-part "in Europe", which subsets the data using as discriminant a continent, so it returns the result of quantity for cities (which is the type in the data and is a part of continent). This behavior can be modified by adding the "by" part, as in "by brand", which would retrieve the aggregated sum of  the quantitative field Quantity by brand (Lacoste, Armani, etc.).

You can go further and make a consistency check by retrieving all t-shirts from the Item column:

Semantic Modeling

The key feature offered by Ask Data Anything is adding additional semantic layers on top of the data (which are not explicit in the data itself) implying an increment of its dimensionality, which enhances the possibilities for data exploration.

This way we are capable of asking queries as Price averaged in French-Brands, with French-Brands being a modeled instance of some "brands-by-origin" concept, which adds a grouping abstraction over the (modeled) brand instances that is not otherwise present in the data. Hereby we have models for the brand instances "Chanel" and "Lacoste" and therefore the averaging would be performed over all occurrences of this 2 brands in the data.

In summary Ask Data Anything can handle queries involving:
    1. Location
    2. Time
    3. User-defined concepts
    4. Instances of predefined concepts
Extracting this information from queries to perform the appropriate action chain: any (mix) of projection, sub-setting, grouping or/and aggregation.

Watch quick overview of Ask Data Anything: https://goo.gl/XnaIq3
To learn more about Semantic Technologies visit: http://goo.gl/7pkWIQ


  1. Replies
    1. https://goodstyle.tech/ You can go further and make a consistency check by retrieving all t-shirts from the Item.

  2. Hello,
    I have a question related to the data loading process and the data type recognition.
    In the example above you have the column City which contains the cities' name, with ADA your able to aggregate over them. I would like to understand if ADA recognize those values as cities by the column name (City) or it is able to recognize by the column content or there is some config file which contains some configuration.
    Thanks a lot in advance.

  3. Sometimes technology amazes me; how far have we come and how more are we going to progress! I’m an IT student and I have recently been given to write an assignment upon a similar topic for which I’ve taken Cheap Assignment Help to prepare a thoroughly researched assignment. I hope that it would cover all the important points.

  4. Bookkeepers are sanctioned bookkeepers in London, offering help to huge number of specialists, workers for hire, online business and Sme's. Accountants London

  5. This behaviour can be changed by including the "by" part, as in "by brand," which will return the aggregated sum of the quantitative field. Brand quantity it is truly amazing how Cheap Assignment Help works in a highly systematic order to provide the premium quality paper writing service to students.

  6. This comment has been removed by the author.

  7. These are the expert assistance provided by the professionals possessing the knowledge of a particular subject. These professionals renders their services and knowledge for the creation of quality content in the dissertation. These dissertations are prepared by a group of peoples and for the purpose of the students from all fields of educations be it science, commerce, any other. Even these professionals are hired for their content either in educational institute, company, or an organisations. These dissertations are written by the professionals as others possessing other subject’s cannot prepare it.

  8. Thanks for sharing the information. Data can be easily explored if we ask for the specific brand or word. For example, I want to know the best dissertation writing services, it is easy and efficient way to ask the data to interpret the query with the best solution.

  9. Science assignments deal with the investigation of the cosmos and its minuscule elements. When choosing a scientific major in college or university, students find that they have a lot to learn about the science subject and its various subfields. We at British Dissertation Help offer online Science Assignment Help and make an effort to cover the many subfields of the general subject. On a variety of subjects, including formal science, social science, and natural science, top academics and researchers are accessible to answer your questions. We provide writing services for scientific assignments in all of these fields at the most affordable rates because of our sector-leading competence.

  10. AWS was formally sent off in 2002 and began picking up speed on the double: north of 1 million clients, around 42% offer in the public cloud market (winning the greatest players - Microsoft, Rackspace, IBM, and Google). In 2018, AWS administrations produced a lot of $26 billion in incomes. As of the Q1 of 2020, AWS has previously figured out how to make $10 million which demonstrates its authority among the contenders.

    AWS administrations cover various enterprises: fabricating, IT programming advancement, medical services, energy and power, creation. Both tech monsters like General Electric, Intuit, and Streams Siblings and new businesses, like Airbnb, Mapbox, and Lyft enlist AWS specialists to upgrade their client administrations while zeroing in on developments acquired from AWS>> aws developers

  11. Prepositions can do things other than locating in space or time but almost all of them modify in one way or another. Most of the time prepositions are combined with other words to form prepositional phrases. Use preposition finder for this. A prepositional English prepositional phrase always begins with a preposition and ends with a noun or pronoun that is the preposition. is the purpose of. The object is the word to which the preposition belongs.

  12. It's always exciting to see how technology can be used to streamline and improve data analysis processes. Same , students can get online assistance from Dissertation Writing Services UK to improve grades.

  13. I found it quite fascinating. It's amazing how technology has come so far that we can now rely on data for answers to our burning questions. Speaking of questions, I have one for you. Can data help to do my university assignment for me? See, I'm struggling to keep up with my coursework, and I'm wondering if there is any way that data can assist me in completing my assignments.

  14. Cognitum uses sk Data Anything, a subset of natural language that articulates concepts and instances that are modeled in ontologies, to provide a meaningful querying experience for data exploration. In order to provide a natural interpretation of the questions being asked, Ask Data Anything capitalizes on language patterns; R and contortion provide its semantics.

  15. Ask Data Anything refers to the concept of engaging in a conversation or interaction with an artificial intelligence (AI) system, like ChatGPT, where users can pose questions or request information on a wide range of topics. The phrase implies that users have the freedom to inquire about any subject matter and seek answers or insights from the AI, leveraging its knowledge and capabilities to provide responses. Whether it's about factual information, advice, explanations, or opinions, users can ask the AI anything within the limits of its programming and training. The AI model attempts to understand the queries and generate relevant and coherent responses based on its training data and understanding of language patterns.If any one want to buy dissertation service at a cheap rate.You can easily make a call.