top of page
cisuhydduacedid

Researchers develop new open-source system to unlock and leverage the potential of complex datasets



Apache Hive is an open-source project developed on top of Apache Hadoop. It allows reading, writing, and managing large datasets available in various repositories and allows users to combine their own functions for custom analysis.




Researchers develop new open-source system to manage and share complex datasets



Due to the scope and complexity of National Center for Education Statistics (NCES) datasets, researchers often must use several different software tools to access, clean, and analyze its data. In response, AIR developed specialized tools to streamline the process. These tools take advantage of advances in computing and meet the shifting trend in higher education to move away from using commercial statistical software packages in favor of open-source software packages.


Director: Daniel DeLaurentisThe Center seeks to address challenges encountered in managing, integrating, and developing complex systems (vehicles, platforms, humans)and information management for collaboration in systems-of-systems (e.g., missile defense, transportation, network-enabled architectures). 2ff7e9595c


0 views0 comments

Recent Posts

See All

Comments


bottom of page