Wednesday 21 October 2015

Ask Data Anything - NYPD Motor vehicle accidents

In modern organizations, data management is a major issue and at the same time a major resource. In our experience, the first challenge a business that wants to use its data is facing how to have a unified view of their data. Generally data inside organizations is stored in different databases that have often proprietary API making it difficult to move from one database to the other. Furthermore, also when the technology used to store data is the same, there are still semantic problems like different terminologies, languages etc.


The bigger the company is, the lower the possibility to standardize the procedures are, so that these kind of situations will not happen. This happens because we are human and we naturally tend to interpret data using our own experience and knowledge. Thus we cannot expect the technical team to call all pieces of a car using the exact same terminology as the logistic department. This is why, our solution aims at giving the possibility to standardize the way in which the end user interact with the data without actually changing the source of the data.

Ask your Data Anything (ADA), allows companies to add a semantical layer on top of the data without the need of copying data. The product is managing term disambiguation, aggregation of data using hierarchies defined in ontologies, data integration between different data sources.


This tutorial shows ADA's capabilities and it is based on New York City Police Department's data on Motor Vehicle Collisions. It reveals how easily a person without specialist training can access exact data such as number of injured in accidents on every street, or a statistical comparison for number of killed between different types of vehicles. Results aggregated by locations can be shown on map to provide a complex overview of road safety in New York City. All necessary information for police officers, journalists and interested citizens is at hand.

 Many useful and free datasheets containing statistical data from New York City can be accessed and downloaded from this (https://data.cityofnewyork.us/website). The data we used for making tutorial is slightly different from the original one. Amount of data was largely reduced and column names were changed. A small ontology was used to describe crucial concepts.

Data structure

After importing data to ADA we can check what it contains by clicking "What's inside?".



Our data is a table in which every row represents information about particular accident. ADA's Dimensions tab contains names of columns, treated as ontological concepts.The Operations tab informs about possible operations to be used in query. The Output tab shows possible methods to visualize query results.
As one can see, the data is consisted of temporal information (date, time), location (borough, street, longitude, latitude, (zip)code), type of vehicle that had an accident, factor which was the main cause, number of killed and injured people (with distinction to injured pedestrians cyclists and motorists).

To view all data stored in particular column simply write its name and press ENTER.


Getting the data

One of the simplest questions one may ask is about getting the number of accidents in a particular area. It can be done by executing the following query:

Example query:
count by borough in Brooklyn



The count operation counts number of instances of a particular concept, in this case borough. By adding in Brooklyn results are restricted to only those that have Brooklyn as its borough. So the returned value is the number of accidents(rows) that happened in Brooklyn.

The number of accidents in a borough may be rated as high or low only when it is compared to similar rates in other regions of city. To compare some data, one can use summarize operation.

Example query:
summarize borough by city



After executing the query a table will be shown with number of accidents in each borough. Adding by city specifies the aggregation to city level. Raw data contains no information about city, it is added on top of it by using the ontology file.

A table is not always the best way to represent data. ADA provides additional output methods, for instance pie charts.

Example query:
summarize vehicle by borough on piechart 





Output mode can be changed by adding on followed by the name of output method. The above query returns a set of pie charts, each of them represents factors that causes accidents of one particular vehicle type.

Numerical data

ADA automatically recognizes date and numerical data, so a user does not have to worry about it. It also enables performing basic statistics.

Example query:
sum injured by borough on histogram


Sum injured sums all values from injured column. Typing by borough on histogram presents data divided by boroughs on a histogram.

Another way to present data is by a map. ADA supports Google Maps and can use it to visualize data if a concept, that a user is using to group results, is a location.

Example query:
sum killed by city on map




The size of a green circle represents the value compared to other data points on map. Exact value can be seen in a balloon.

The 'time' column contains hour time of accidents as decimal number (9:30=9.5). It enables using mathematical operations on its values.

Example query:
average time by city



Above query returns an average time value for New York City (because it is the only city in our example). It is approximately 16.7 which suggest that an average road accident in NYC takes place about 4:40 PM, during afternoon rush hours.

ADA allows user to do projections over the data and retrieve subsets of it that match certain mathematical expressions.

Example query:
street with killed > 1

The result shows all streets where accidents with fatalities occurred.

Summary 

Ask Data Anything enables swift access to needed data in a trivial way. A user can check names of streets where accidents with fatalities occurred, compare number of accidents in different boroughs on histogram, check how many percent of accidents involved sport cars on pie chart, see number of injured in every borough on map or simply ask for a whole column from data set. Imposing more restrictions can provide very specific and precise results which would be hard and time-consuming to obtain using traditional applications. ADA can speed up writing reports and articles, it can be also used for educational purposes. By developing the ontology on top of the data set, more advanced constraining and detailed results can be get.  For example, inserting knowledge about street names and neighborhoods would enable asking about accidents in particular region. Such ontology could be used for other data sets that contain data by street names, like a list of nurseries or Chinese restaurants.  

Ask Data Anything is an excellent tool to quickly analize large amount of data in a smart semantic way, with only basic ontology created in Fluent Editor. Cognitum's best specialists work on improving it to provide higher efficiency and more advanced features.

References

A quick overview on Ask Data Anything and its features:



New York Police Department's official website:
http://www.nyc.gov/html/nypd/html/home/home.shtml

Cognitum's official webite:
http://www.cognitum.eu/ 


26 comments:

  1. It's very important that Government agencies adopt to the latest innovation and technology that we have. It will be easy to access information such as the NYPD and the ones from only professional essay writers for hire.

    ReplyDelete
  2. This post is really write-my-essay-for-me.com astounding one! I was delighted to read this, very much useful. Many thanks

    ReplyDelete
  3. Hi guys! Do you like read article about surrealist costume? Avant-garde is a fashion style developed through innovation, and it is a form of revolution in the fashion industry. The kinds of clothes designed through the avant-garde are full of creativity and can only be worn in the theatre. It is a style that was developed by some olden day’s artists, some of which were surrealists and Dadaists. Some avant-garde artists like Jean Piere were the collaborators with Elsa Schiaparelli to develop film costumes.

    ReplyDelete
  4. very nice post. It is an great information about Motor vehicle. you gave some very useful information here. Loved reading this article. Thank so much for sharing with us. Drivill

    ReplyDelete
  5. Great post, really this is awesome informtaion about ask data anything. before I didn't hear about it. but after visiting this post I get some amazing information. thanks for sharing. we giving the best Ride Sharing App in Bangladesh

    ReplyDelete
  6. Hello! If you see that you have no time for writing papers then it will be good idea to use additional help. You can find a lot of useful information at the https://elitewritings.com/buy-article-critique.html and save your time.

    ReplyDelete
  7. Acadecraft hires professional who identify and fix accessibility issues. We are specialized in providing document Accessibility Remediation Solution. We have the capacity to produce high volume Doc remediation capacity to fast transform digital assets, and reach compliance.

    ReplyDelete
  8. Escalators are incredibly convenient. After all, who wants to hike up all those stairs when you can do so effortlessly on an escalator. While escalators don’t exactly strike fear in most of us, escalator accidents do happen, and they can be remarkably dangerous. Often such accidents are caused by faulty design, faulty parts, or inadequate maintenance. If an escalator leaves you injured, it’s time to consult with an experienced Los Angeles personal injury attorney.

    ReplyDelete
  9. Thanks for sharing this article. It's to information and knowledge. The most important point is injurylawyersgroupla Many useful and free datasheets containing statistical data from New York City can be accessed and downloaded from this (https://data.cityofnewyork.us/website). The data we used for making tutorial is slightly different from the original one. Amount of data was largely reduced and column names were changed. A small ontology was used to describe crucial concepts.

    ReplyDelete
  10. Hello there. I create amazing idea after reading this article. As well, it can be also exciting for somebody of you. Likewise, I need to do one more ting. Who can help me write my letter?

    ReplyDelete
  11. We should control accidents because through this, we are losing our lives which are very important to us. So, it is vehicle manufacturing company's duty to provide us more features, and through this, we will save our lives. Assignment help UK.

    ReplyDelete
  12. A cheap 3.5-in-1 ford edge titanium for sale
    A cheap 3.5-in-1 ford edge titanium for sale. The cheapest 3.5-in-1 at titanium easy flux 125 amp welder T-Tech China online, and in used ford edge titanium the Asia. Shop today titanium tv apk from our implant grade titanium earrings wide range of suppliers titanium jewelry for piercings

    ReplyDelete
  13. I appreciate you sharing your wisdom and amazing thoughts about NYPD Motor vehicle in this post; I found your information to be quite educational. This is incredibly useful because most people aren't familiar with it. This page now has some really intriguing content that you contributed. If you ever need assignment help, assignment help services in Manchester is incredibly affordable, so please make use of this. I'm going to tell all of my friends about this website after utilising it today.

    ReplyDelete
  14. Whether you want to observe a fantastic example of how to expand your team or obtain support for your talent from the industry, this knowledgeable group of IT experts is available to you. This business can provide guidance on where to get a respectable consultant and Choosing the Best Medical IoT Services | Hire IoT Developers. It also has many years of experience and stellar evaluations. If there is such a deal, don't waste your time.

    ReplyDelete
  15. This comment has been removed by the author.

    ReplyDelete
  16. Amazing, Your blogs are really good and informative. I got a lots of useful information in your blogs. Sum injured sums all values from injured column. Typing by borough on histogram presents data divided by boroughs on a histogram breach of contract dispute. It is very great and useful to all. Keeps sharing more useful blogs...

    ReplyDelete
  17. Your blog about NYPD Motor vehicle accidents is a consistently provides valuable insights and engaging content. I appreciate the dedication and expertise you bring to your writing. Your well-researched articles are a source of inspiration and information, making your blog a go-to destination for knowledge and enjoyment. Thank you for the quality you consistently deliver. New York State No Fault Divorce Dismiss Order Of Protection New Jersey

    ReplyDelete
  18. "Unleashing the power of data, this initiative opens the floodgates for insightful inquiries into NYPD motor vehicle accidents, fostering a data-driven dialogue."
    By inviting questions from all angles, 'Ask Data Anything' catalyzes a collaborative exploration, unraveling patterns and promoting a deeper understanding of road safety dynamics in the city.||Monmouth County Reckless Driving Attorney ||New Jersey Careless Driving Statute

    ReplyDelete
  19. "Ask Data Anything's exploration of NYPD motor vehicle accidents offers a comprehensive dive into the crucial data surrounding road incidents. The platform's user-friendly interface and insightful queries enable a nuanced understanding of patterns, contributing factors, and potential preventive measures. Unveiling the layers of information within NYPD's accident records, this resource proves invaluable for anyone seeking to grasp the dynamics of motor vehicle incidents in the city. From analyzing hotspots to deciphering the time-based trends, Ask Data Anything empowers users to make informed decisions and advocates for safer roadways. A commendable effort in leveraging data for public awareness and fostering a culture of safety on the streets." Most students are drawn to these types of articles and information, but they are unable to prepare for their exams, If you have been struggling with your exams and want assistance, students can do my class - do my online class and get higher grades on their examinations by providing them with the best available resources, including quality academic services.

    ReplyDelete
  20. I recently delved into the comprehensive dataset on motor vehicle accidents provided by the NYPD, and the insights are truly eye-opening. The data not only sheds light on the frequency and locations of accidents but also offers a deeper understanding of contributing factors. It's a goldmine for anyone keen on enhancing road safety or analyzing traffic patterns. Kudos to the NYPD for making such valuable information accessible and paving the way for informed decisions in our community. Most students are drawn to these types of articles and information, but they are unable to prepare for their exams, If you have been struggling with your exams and want assistance, students can pay to do my online class - pay someone to do my online class and get higher grades on their examinations by providing them with the best available resources, including quality academic services.

    ReplyDelete
  21. In the realm of information and data accessibility, the concept of "Ask Data Anything" takes center stage, creating a bridge between individuals and a wealth of knowledge. This innovative approach empowers users to pose questions about various subjects, including complex datasets such as NYPD motor vehicle accidents. As we navigate the intricacies of understanding accident trends and patterns, it becomes evident that a similar thirst for knowledge extends to individuals contemplating the query "do my online course-do my courses." Just as one seeks answers about motor vehicle accidents through data, individuals seek educational empowerment through online courses. This parallel underscores the importance of embracing the digital era for both information and education, demonstrating that the desire to learn is as diverse as the questions we ask of our data.

    ReplyDelete

  22. A tech blog is an online platform that focuses on publishing articles, reviews, and updates related to technology, gadgets, and digital advancements. These blogs often cover topics such as software, hardware, cybersecurity, and emerging tech trends. Tech blogs serve as informative resources for tech enthusiasts, professionals, and the general public interested in staying updated on the latest developments in the tech world.
    virginia statute of limitations personal injury
    northern virginia personal injury attorney
    abogado de accidentes de semi camiones
    bufete de abogados de accidentes de motocicleta






    ReplyDelete