Do You Miss Statista?

Last Fall, UC Libraries made the tough decision to break up with our Statista subscription.  The decision came down to two reasons: 

  1. Statista’s price rose over 200% in the past two years
  2. More importantly, Statista’s data is accessible from other vendors and resources.  The Statista  platform serves as a portal to finding statistics. Basically, subscribers pay for ease of access to a  lot of statistics. Most of these statistics come from government resources, industry associations, and other vendors of statistical information.  

A few examples:  

  1. Data Planet
    Touted as the “largest repository of standardized and structured statistical data”. It has 13.5 billion datasets, pulls from over 90 data providers and covers 16 major subject categories. It also provides users with tutorials on how to analyze data and incorporate it into their research. 
  2. US Census
    One of the largest producers of social and economic data and statistics. The economic census includes Trade, Manufacturing, Construction, Financial, Insurance and Real Estate statistics. 
  3. FRED (Federal Reserve Economic Data)
    Economic data out of the St. Louis Federal Reserve Bank including inflation rate, consumer price index, unemployment rate, and more. FRED also provides a lot of teaching resources for economic data.  
  4. Market Research Reports from IBIS, Mintel, Euromonitor and many more!
    UC Libraries subscribes to many market research databases.  These reports provide domestic and international industry level data as well as consumer demographics and statistics. 

These databases and resources are accessible through UC Libraries and UC’s research guides. These resources are being updated these through the Summer, so please check back to see what’s new. 

If you are looking for something specific, please think of Maggie Patel (UC’s Business and Data Analytics Librarian) as your statistical matchmaker and she will work with you to find the appropriate resource.  You may contact her through email.

Please don’t despair if you loved Statista, there are plenty of other statistical databases in the sea.  You will improve your research skills and discover new statistical resources. Fall in love again with something new! 

UC Libraries Mark Chalmers named Drexel University LEADING Fellow

mark chalmersMark Chalmers, science and engineering librarian, has been accepted into Drexel University’s highly selective Library Information Science (LIS) Education and Data Science Integrated Network Group (LEADING) program as a 2021 LEADING Fellow.

The LEADING program is a Laura Bush 21st Century Librarian (LB21) National Digital Infrastructures and Initiatives project, supported by the Institute of Museum and Library Services (IMLS) and hosted at Drexel University’s College of Computing & Informatics (CCI). The LEADING program scales-up the highly successful LEADS-4-NDP initiative and will prepare a diverse, nation-wide cohort of 50 LIS doctoral students and early to mid-career librarians for data science endeavors.

LEADING Fellows will complete the following:

  • self-paced online preparatory curriculum,
  • an online data science bootcamp with Drexel and other LEADING fellows,
  • six-month virtual data science internship at a LEADING hub or node,
  • development of a communication plan to connect with mentor/s on a regular basis, and
  • development of research output (e.g., a paper, poster, presentation, published data, etc.).

Mark’s LEADING Fellow research project will focus on “Assessing Overlap and Aggregation Potential of Open-Source Software Platforms and Their Data.” LYRASIS, a consortium that provides access to the world’s shared academic, scientific and cultural heritage, currently offers six hosted services on open-source software. Five of the six of which involve academic libraries and their cultural heritage assets. As a result, LYRASIS seeks expertise in cultural heritage metadata and data science to assist in assessing this overlap between platforms. In addition, they seek recommendations on how machine learning models can be leveraged to connect data in different platforms and to assess how researchers can be served by access to data across the different platforms.

“I am eager to work with Drexel and LYRASIS to grow my competencies in the domains of data science, data integration, large scale machine learning and exploratory feasibility projects,” said Mark upon learning of his acceptance into the LEADING program.

“The experience and skills Mark will gain as a LEADING Fellow will not only contribute to the completion of the LYRASIS research project, but he will bring new knowledge back to his role in the Science and Engineering Libraries while at the same time contributing to the university’s digital future and open access agenda,” said Xuemao Wang, vice provost of digital scholarship and dean and university librarian.

Love Data Week, Feb 8-12, 2021.

Love Data Week 2021

Love Data Week 2021

Love Data Week takes place the week of Valentine’s Day. Created to raise awareness of research data management, sharing, reuse, and preservation of data, it has been promoted by library and data professionals since 2016 in both online and in-person events.  You can see some of the events taking place around the world at the International Love Data Week 2021 schedule.

One of the ways we at RDS/UCL celebrate is to participate in the Adopt a Dataset program that ICPSR hosts every year.  ICPSR choses to highlight interesting data sets from their collection for individuals to explore to learn more about data. Once a dataset is chosen, you may fill out the Dataset adoption form and have your name added to the wall of adopters.  Adopters are encouraged to delve into the data by reading about the research, look at the variables, try out the analysis tools online, and read related publications.

ICPSR is a member consortium that UC belongs to that provides access to datasets from over 15,000 studies, over 5.6 million variables, which have had over 95,000 publications that cited those studies. It also provides curated data repository services for researchers, including secure data enclaves, which fulfill finding requirements for data management plans. Although most studies are quantitative in the social and behavioral sciences, there are also themed collections in the arts, humanities, and some health sciences. Teaching resources, online analysis tools, and pre-made exercise modules also are available to members.

There are a wide variety of datasets to choose from this year, ranging from education, to social media, social justice, to health. I chose to adopt one on music, Study of Jazz Artists, 2001 (ICPSR 35593). I started out by looking at the description of the study and how it was conducted. I then jumped into browsing the variables, one of the tabs found on the data set page. One of the great features of ICPSR is that you can search by variable if you are looking for specific studies to replicate or want to see if your own survey questions fit what others have asked in the past.

One variable jumped out at me – Q32- Age Began Playing First Instrument. My own children play instruments and I always wonder if we’ve started them at the right age or not. Looking at the result for this variable, I can see the unweighted results, including summary statistics and a variables chart. The median age was 9, the mode was 10, the maximum was 35, and the minimum was 1! I guess there’s still hope if you’re in your 30s to pick up an instrument to become a professional jazz musician!   The sweet spot to start your child appears to be the 8-10 year old range.

If you have questions about Love Data Week,  ICPSR, Data, Data Management Plans, Cleaning, Storing, Finding, or Using Data, contact us at Research and Data Services here at UC Libraries. We would love to help you with your projects, offer a workshop to your department or class, or discuss your data needs.

3C GIS Day Nov. 18 to explore power of geospatial analysis and visualization to answer research questions and solve real-world problems

GIS graphicJoin us Wednesday, November 18, 9:00am – 4:00pm for GIS Day, an annual event for students, staff, faculty, and the broader community to learn more about Geographic Information Systems (GIS) and to celebrate the power of geospatial analysis and visualization in answering research questions and solving real-world problems.

The 2020 GIS Day program, called 3C GIS Day, is a collaboration between The Ohio State UniversityCase Western Reserve University, and the University of Cincinnati.

This exciting virtual program will include four tracks of lightning talks highlighting diverse applications of geospatial methods and technologies, an ArcGIS StoryMaps Showcase featuring contributions from the three sponsoring institutions, as well as provide opportunities for professional networking and development.

Registration is now open for the 3C GIS Day and the 3C StoryMaps Showcase

The event is free, online, and open to all.  Questions can be sent to ASKData@UC.Edu.

Join us for UC DATA DAY, Oct. 23 to examine World Changing Data: How Digital Data Will Change Our Future

Sponsored by UC Libraries, IT@UC the Office of Research and the Office of the Provost, the virtual UC DATA Day will include a trainings, an interactive panel and keynote speaker Glenn Ricart, founder and CTO, US Ignite, who will present “A Day In Our Digital Future – The Intersection between Data and Humans.”

Researchers producing and using data face similar, but unique, challenges in data management, data sharing, reproducible research and preservation. This event highlights these challenges and showcases solutions and opportunities available to the broad research and education community.  UC Data Day 2020 focuses on the role and impact of the world-changing data generated by the explosion in Artificial Intelligence, Machine Learning and VR/AR, including how we work, live and educate in our urban digital present and future.

Event details, including registration and panel information, is available at https://libapps.libraries.uc.edu/blogs/dataday/.

data day graphic

Offering Workshops on the Publishing Life Cycle of Data

In partnership with the Faculty Enrichment Center, UC Libraries RDS and IT@UC Advance Research Computing are pleased to offer the next three sessions in our DCSS series focused on the Publishing Life Cycle of Data.  Visit the Faculty One Stop Professional Development site for more information and to register for any of these events.

Next up in the series:

July 13, 2-3 p.m. Getting Published – A Journal Editors’ Panel  will feature UC faculty and staff who serve as editors on journals discussing how to identify the right journal, avoid predatory journals and how to make the biggest impact with your research and scholarship by publishing more than just the article including data publishing and taking an active role in the publishing workflow.  Jeff Blevins, Theresa Culley, Steve Lang and Victoria Carr are the featured editors. Visit the Faculty One Stop Professional Development to register for this event.

Jeff Blevins, Theresa Culley, Steve Lang and Victoria Carr

Jeff Blevins, Theresa Culley, Steve Lang and Victoria Carr

Save the dates for these upcoming events as registration information will be available soon:

July 27, 2-3 p.m. “Understanding Open Access and Open Data” will feature Claudio Aspesi, Senior Research consultant and advisor to SPARC, coauthor of “The Changing Academic Publishing Industry – Implications for Academic Institutions.”

August 10, 2-3 p.m. “Publishing Data in Repositories” – more details to come.

 

Dorcas Washington takes on new role of data analytics specialist on the Research and Data Services Team

dorcas washingtonDorcas Washington, formerly content analyst on the Content Services Team, has transitioned to the new role of data analytics specialist on the Research and Data Services (RDS) Team.

In her new role, Dorcas will provide leadership and expertise in the areas of quantitative and qualitative data analysis, as well as play a leading role in developing and executing a research reproducibility support program across disciplines. Dorcas holds a BA in mathematics, an MS in applied statistics with a concentration in bio-statistics and has recently been accepted into UC’s PhD program for environmental health: bio-statistics. In her new role, she will be able to utilize her knowledge and skill set to its fullest and to build upon her experiences gained as a member of the Content Services Team.

As data analytics specialist, Dorcas will be responsible for:

  • Leading library services related to the evaluation, manipulation and visualization of data and the use of statistical tools
  • Developing and delivering scalable research and data-related services and resources in collaboration with RDS, with the Digital Scholarship Center, and others as appropriate
  • Leading the development and implementation of consultation and instruction services for statistical and analytical methods
  • Promoting best practices for ethical and reproducible data production, analysis and dissemination
  • Developing and coding executable programs to automate processes to perform computational inquiries
  • Co-managing Informatics Lab operations within the Donald C. Harrison Health Sciences Library.

“I look forward to getting back to my roots of mathematics and statistics by utilizing those skills to help people further research at UC,” said Dorcas. “This position interested me because it’s doing work I enjoy with a diverse group of people (students, faculty, staff and more).”

Those interested in services, should contact RDS via e-mail askdata@uc.edu.

CANCELLED: Got Data?! Join Us for the Data Visualization Showcase Oct. 25

PLEASE NOTE: THIS EVENT HAS BEEN CANCELLED.

The University of Cincinnati Libraries’ Research & Data Services is calling for virtual submissions that best demonstrate the power of visualization to present complex data.

Event

The Data Visualization Showcase will be held from 1-3 pm on Friday, Oct. 25, 2019 in the Visualization Laboratory (240H Braunstein Hall, Geology-Mathematics-Physics Library). Coffee and refreshments will be served. All are welcome.

Eligibility & Deadlines

Submissions for the showcase are open to all University of Cincinnati affiliates, but must be submitted to AskData@uc.edu by Oct. 11 to be considered for the awards. All submissions will be evaluated by a panel of judges and should follow submission guidelines.

Judging

The showcase will be juried by a panel of interdisciplinary judges scoring each submission on the following four tenants of data visualization: Impact, Storytelling, Technical Aptitude and Creativity. See the rubric for more details.

Continue reading

Research Labs @ GMP Library News update – Zhiyuan Yao Attends the AAG-UIUC Summer School

Zhiyuan Yao is one of two GIS support students working in the Research & Data Service research labs at the Geology Math and Physics Library.  The Data & GIS collab is open to students, staff and faculty seeking help with their geospatial data needs, and the Visualization lab is open for data visualization consultations and collaborative work.  Email us at ASKData@ucmail.uc.edu for more information.  

Great learning and collaboration experience in AAG-UIUC Summer School

This summer in July, I was honored to be offered the opportunity to attend AAG-UIUC 2019 Summer School, which focused on Reproducible Problem Solving with Cyber GIS and Geospatial Data Science. During the one-week summer camp, I met many scholars, got access to the supercomputer Virtual Roger through CyberGIS-Jupter, learned the cutting-edge advances regarding geospatial data science, and got a deeper understanding about reproducibility and replicability. I absolutely had a wonderful time there, and this experience provoked me to think more about how we could develop novel solutions to complex problems.

 

Participants in the AAG-UIUC summer school with mentor Diana Sinton (Ex Director of UCGIS in the  green shirt) in the middle.

Continue reading

XSEDE High Performance Computing (HPC) Boot Camp

Posted on behalf of Amy Latessa,  Research Coordinator for IT@UC Research & Development

The Data & Computational Science Series presents:
XSEDE High Performance Computing (HPC) Boot Camp

Free Registration (required)

Description: This 4-day event will include MPI, OpenMP, OpenACC and accelerators. This event will be presented using the Wide Area Classroom(WAC) training platform and will conclude with a special hybrid exercise contest that will challenge the students to apply their skills over the following 3 weeks and be awarded the Fifth Annual XSEDE Summer Boot Camp Championship Trophy.  In addition, an XSEDE Badge will be available to those who complete the Challenge.

Tentative Agenda

When: June 3-6, 2019

Location: Langsam Library room 462

2911 Woodside Drive
Cincinnati, OH 45221

 

Note: You need an XSEDE account to register Create Account

Parking: Woodside Avenue Garage

Special Instructions: Participants should bring their own laptop, lunch will be provided.

Questions? Contact Amy Latessa 

XSEDE (eXtreme Science and Engineering Discovery Environment) is a virtual system that provides compute resources for scientists and researchers from all over the country. Its mission is to facilitate research collaboration among institutions, enhance research productivity, provide remote data transfer, and enable remote instrumentation. XSEDE is funded by National Science Foundation (NSF).  Getting Started Guide for XSEDE.