Making the Most out of ICPSR Membership


Welcome again to Making the Most out of
ICPSR Membership and thank you everyone for joining us. We are going to get right
to the agenda here and that is an overview of ICPSR data and education
services. We’re going to talk through our data, our training and education services,
as well as our data management and deposit options. As we walk through those
three areas with data being, kind of, the the biggest or the longest, if you will,
of those areas we’ll examine some of the unique elements of these services. I
call them sometimes uniquely ICPSR because a number of these things we do a
little bit differently than what you’ll see of other services that are
out there. Then after that we will end with a tour of the tools and
training available to ICPSR, particularly tools and training that are
aimed at Official and Designated Representatives or those who are
managing ICPSR with their organizations from an administrative front. We’ll get
started with What is ICPSR? I’m making an assumption that some of you are
quite new to ICPSR and that’s why you’re here. We are one of the world’s oldest
and largest social science data archives. We were founded in 1962 and then the Summer
Program in Quantitative Methods was founded a year later and we’ll talk more
about that program throughout… after we get done with data, I guess. Our
current holdings are around 10,000 studies. There are oftentimes
many more data sets or many data sets affiliated with a study so it’s about…
getting close to 78,000 data sets that we hold in the repository. We have a
Bibliography of over 72,000 data related publications. These are publications that,
somehow those data sets were part and parcel to generating those research
works. So we’ve connected them to each of the data sets where possible. It’s not, we
never claim that it is global in perspective. We’re always
finding new things and we actually rely on a lot of you out there to tell us “Hey
I found this research work. Could you check? I think it’s related to this
data set.” So that’s always a continuing evolving database. We are a membership
that started out with 22 institutions and now we have over 770. The portion of
the data collection, about half of it is what we call member funded. It exists
because we have institutions who are paying this annual membership fee.
That allows us to curate, to collect, and keep that data safe. It also helps us to
produce teaching materials for our member institutions. And for those that
attend our Summer Program, whether it’s a four week course, or whether it’s a three
day, or a two day, or a five day workshop those fees to members and
institutions are at about half of those to non-member institutions. A large
portion of the data collection is freely available and that is because it’s
externally funded. It’s funded by government agencies and foundations who
are currently expending money and giving it to ICPSR to curate, and sustain, and
conduct outreach and training on the data that they are most interested in.
We’ll look at those a little bit as well. As I mentioned, we’re
a member organization. We are governed by a governing board
that we call the Council. Those reports, they’re coming in June. Those
reports were just due so I just did the official count and we will be talking
with them about having, right now, 778 consortium members. We are
represented in every state except for Alaska, Hawaii, and South Dakota.
Those are universities across the US and there’s 411 of those right now. We’re
working on those three states. You can imagine I like it… we have had
times when we’ve had every state in the US represented and I’m actively working on getting us to that condition again. We
have several types of institution types. Extensives are traditionally known as R1s,
they represent the largest portion of our members with 140. But we also have,
as you can see, Masters, and Bachelors, and Specialized Institutions, a few Community
Colleges. You will see too that we have Associates. These are institutions that
might be think tanks or some type of NGO, if you will. Canadian institutions are at their highest as they’ve ever been and then of
course institutions outside of North America. So we are global and very
diverse. What we do, this is a little bit of our elevator story, our main
goal is to seek research data and the documents and documentation surrounding
that data from data producers. Data producers might be individual PIs,
they might be research agencies, they may be the federal government, they may be
foundations. Anyone who is collecting data, we head out there and lobby, if you
will, to get that data from them so that we can share it. We process the data that
we bring in, which means essentially cleaning it, describe it with lots of
metadata. Some people call that tagging. And then we preserve the data and
documents, a copy for the public as well as a preservation copy. This is
called data curation and that’s a little bit of a preview of things I’ll talk
about in future slides. But that’s data curation and that’s what we do.
We share that data out with the public and with our members. We provide, of
course, education and training and instructional resources so that we can
get data into the classroom early and often. We’re also assisting grant
writers with data management plans. Often it’s that two-page plan that you know
about if you have anything to do with grants. And then also, of course,
fulfilling those data sharing plans once they’ve won their grant and they are
then required to share that data. And lastly, we are in
a great position to demonstrate the impact of research data. Because when
people come to us and they search, and they view, and they download, and then
perhaps they write a paper on it and we find that paper and connect it,
we’re a great spot for PIs or agencies to come back and say “Look at how many
downloads I’ve had. Look at how many research works have originated,
evolved, or come to being because of the data that we’ve collected.”
And we’re going to show you all of those things shortly. So our mission is that we
advance and expand social and behavioral research, acting as a global leader in data stewardship and providing data resources
and responsive educational opportunities for present and future
generations. And that really does sum it up. It’s simple. It’s get data,
preserve that data, share that data, and train about the data. That’s what our
mission is really talking about. So Let’s Talk Data. I’m often asked “What kinds of
data do you have?” Which a question like that is, of course, something where
it can be taken from different kinds of perspectives.
We’ll combine those first two bullets. So first of all, when we look
through our data catalog and the types of data that were in there, what we
found were over 40 fields or 40 specialties of data that we’re
actually in there or that people who are using the data. So we talk of
ourselves as a social and behavioral data archive, but we define that as very
very… it’s very very diverse and very very broad. And of course, as a lot of
interdisciplinary work is coming in there it’s getting wider and wider all
the time. Kinds of data data might also be “Well do you have survey data?” and
indeed ICPSR started as a lot of survey data in the political science arena.
Shortly thereafter a lot of administrative records came in.
And today we have a growing qualitative data. We have a project that
was very significant that utilized video data. We have some geographic data
coming in. We’re actually in the mode right now of totally replacing our
technological infrastructure because, you know, there’s what they call
“data in the wild” or undesigned data, and also the desire to connect certain data
sets in order for analysis. And so we are redesigning our technological
infrastructure. We’re about, I’d say, halfway through or, maybe on a good day,
two thirds of the way through getting that up and running and redone so that
we can be more flexible as the data that people need and demand, particularly our
members, changes and evolves. So we’re going to go out and we’re going to talk
about how you discover data, what does it look like, and how do you access it? The best way to do that is not to talk on but it’s actually to go out and
take a look at our site. So this is our front page and I’m going to head to
Start Sharing Data. Oh I’m sorry, to Find Data and don’t just say what you’re
reading there, what’s in front of you, I guess, Linda right? So this is the
opening, or the how you open up what we refer to as the ICPSR data catalog.
There are a number of ways that will help you if you’re looking for
restricted use data or wanting to understand more of it, you can click in
that area and we’ll be back there a little bit later. You can use some of
these browse capabilities, it’ll help narrow you in to some things. But for the
purposes of today, I’m going to click on View all here because I want to show you
the totality of the catalog and of our collection. So the first thing
you’ll notice, you see studies here. Studies were today, and this changes
daily as new things are released, are at 10,512. You can see, as you scroll down there, that they’re from all over, all of those
various fields that we noted on the earlier slides. There is also this thing
called publications, which again is what we call the ICPSR Bibliography or
up here it’s called Data Related Publications. What these are, they are research works. They might be, if you float over
this, you’ll see it’s a report or they might be a journal article or
they might be a dissertation. And what we we’ve determined that this
particular report or dissertation is connected to a data set or a study that
we have. So if we look at this 10-year analysis of head and neck injuries
involving nonpowder firearms, we can see that this journal article was based on these studies here that range from 2005 to
2013, so this is a series study. So all of these in our collection, our Bibliography
and our collection it’s a work that is connected to a data set that we
have in these collections. And we have a librarian and a number of student
assistants that help build this collection year after year. The next
thing that you’ll notice is Variables. Variables are what’s inside a… maybe a
questionnaire or, I guess, a data set. So, kind of, the easiest way is if I type
on gender usually many many or sometimes most of our data have a gender
code in there. So we’ll see if this one is from Drug Consumption Collected
Online on March 2011-2012 and so if you click in here, what a variable it’s
just simply the question that was asked or maybe if it wasn’t asked it was noted,
if it was observed. And so in this one, you can see oh this one has 942 females
in it compared to 943 males, which was incredible to get a 50/50
split like that. And it was taken from this particular
study. So that’s what a variable is it’s all of the things that are within
that data set or within sometimes if it’s a survey, the questionnaire. And
you can look, so for example let’s say you wanted, how is the best way to either
observe or ask about gender you have all kinds of samples there that you can then
go in and compare. Gender is probably an easier one to find out but, you know,
maybe you’re looking for appropriate income categories, or perhaps you’re
teaching about the best income categories to use and you want to if
you’re teaching a research methods course and you’re talking about how to
design a survey, this is an excellent getting into the variables is an
excellent way to go. So I’m getting ahead of myself on the teaching piece. The
other thing is Series. So a lot of people want to look at information based on how,
look at data based on how it’s changed or evolved or maybe not changed or
evolved over time. So the Series tab will show you, we’re listing 229 series
type studies, and that will show you sets that we have that, as you can see
here, have been here over time. So the American Community Survey, the ACS. So
this one you can see, let’s see, earliest date 1996 up to 2007 and actually there are
more entries there after. And this one actually has some combined. So
sometimes you’ll see a data set that you don’t have to do the combining yourself
that it is already done. So that is Series. So I’m going to head back to, yeah
we’ll just start all over from the beginning and go nice and clear.
Because the other thing I want you to make sure that you do is look at the
left side, the filters. These are geography, we get that. Subject, subject
terms, we get that. There are restriction types, restricted-use. You can see there’s
almost 1,500 studies that are listed under restricted-use.
We’ll talk again about that in a moment. Data availability, and what you’re
looking at here is, you’ve got member funded and public data. So a little over
half of the data that we hold has been, the funds from the members have been
used to curate these data sets and so these all here, again 5342 today, are
available to individuals who are at member institutions only. What you’ll
see is that you can, they default, all of the things default to search by
alphabetically. You can look at those that are most recently updated. So our
most recent, the most recently just set out there was the Survey of
Russian Elites from Moscow, Russia. You might want to look and say like, “You know
I’m interested in this bibliography, the most cited”. And what you’ll see here is
what we call the Panel Study of Income Dynamics is one of them, the National
Longitudinal Survey of Labor Market, again, is our most most cited. But we’re
going to look at the, it’s known as the PSID, because we all love acronyms
right? And so let’s take a look into the PSID, very cited. In fact, over
the last three years we’ve found, or over its time actually 2,329 related publications to the PSID. So what you are
seeing in front of you, of course the study, when this one was published,
the PIs, that it is indeed a series, you’re seeing its DOI, this is the
persistent URL. And what you are seeing in front of you is the project
description, the data citation that should be used, if you use this data it
should be cited in your bibliography, who funded it, subject terms, geographic
coverage, it goes on to the scope of the project, the time periods, the dates it
was collected, weights that were used, methodology, how was it sampled, who is in
what population was surveyed? Which is critical critical to understand as
you’re interpreting it and any kind of versions. When you see versions sometimes
what will happen is we’ll be given the data, we’ll process, we’ll put it out
there, and sometimes maybe they’ll find that there was a nuance in the data that
is critical enough that it needs to be corrected. And here is when we re-release,
and correct it, and put it out. There’s the version history that will be
provided to you. But all of this goodness in terms of understanding what
the survey is, and how it was conceived, and delivered is called Metadata.
Metadata is painstaking, there’s a person doing this. There is, to date,
not a computer that actually can type in all this information and put it together
in a nice story for you, a data story for you. So this is metadata and this is what
our folks up in the data curation team are doing on a daily basis. They are
putting it in the tab called Data and Documentation so that you can take a
look at things, you can preview them now to see what’s in that codebook,
for example. This is something that’s new, is to take a real quick look at what’s
in there with a preview. My navigation pane here for the webinar is getting kind of
in my space here. You can then go ahead and proceed to download those
files, you can download the documentation, which is a PDF file. And of course that
documentation is going to tell you more about the variables that are in
there, it’ll tell you more about how the study was put together, and it probably
has a questionnaire, for example. Here are the variables related to the PSID. And we talked about those variables,
that they have that information for you, you know, gender, income, this one is 18 or
older in 1983, for example. There’s many many variables,
you can see there’s actually over 31,000 variables in this series.
There is publications and again this is one of those that was heavily
heavily cited. And then you can export the metadata if you so desire. So this is
a sample of when you say, “What does it look like, what are the data,
how do I download it?” I’m going to show you really quickly too the usage report.
So if you’re interested in how it’s been used or who’s using it, this gives you
three years. I’m going to just look at the past year for kicks and you can see that
will change dynamically for you. If you want to print this off you can hit the
print button. This is something new, there are some, there are a few errors on here
that we’re working on. So it’ll continue to evolve as we kind of
get those nuances out like the label isn’t quite fitting into the box. So this
is new and we’re working on those little buggy things. Again, you can see with
this one that over the past year we had over 7,000 or 7,670 downloads. This
is the number that we’re just for documentation and these included some
data with it as well. And this gets very detailed into, because this has lots of
datasets affiliated with it. You can look at who the users were, you can see on
this one that you have a number of graduate students. They by far
outweigh the other groups. Oh I guess by undergrads and faculty users. We have
guest users. These are people that just view. They took a look at the
documentation. They may have come back later in order to download that data. And
you can also take a look at what colleges that currently
downloaded the PSID in the last year. So imagine if you’re a PI or an agency
and you need to give reporting to someone to show that you have great
research impact, you want to understand all the publications, they’re listed here,
so that you can come back again. Even if you’re an individual researcher, or
whether you’re an agency that’s looking for continued funding, or researcher
that’s writing for a grant, you can come back here and get that kind of
information. So it really shows the impact of usage of these data. All right,
and what I want to do is go back to the Find Data, which again is the beginning
of the data catalog, and talk about Online Analysis. Online Analysis is, well
actually I’m sorry about that. Let’s go back one more piece here and talk
about how do you how do you download. Let’s say I’m an SPSS user. What happens
is that I click and I want to download all the files, which will
include the documentation, I’m going to click on Agree here and what’s going to
happen is is it’s going to ask me to log in to my MyData account. Now I am
a member, I’m at a member institution, and so what it’s going to deliver to me
is this zipped file. It’s going to send it to where I tend to put my
files, and then I can open that up, and it’ll have my files that will work
nicely with my SPSS file, and then it has all of the documentation as well. So
that’s it. If I was not a member, it would say, “hey you need to be a member in order
to download these data” or you can contact User Support to talk about how
you might get it as a non-member. We will, we do sell date à la carte. It does get
fairly expensive fairly quickly because right now the going rate for a data
set, we downloaded it, package it up nicely, and deliver it electronically, is
$500 dollar or $550 dollars per data set. But because I
logged in, and I agreed to the Terms of Use, which you saw me read very as I
went through the agreement, we were able to access that without any
impediments. So let’s say that you don’t have an SPSS license or a SAS license and you would like to take a look at some data.
There is what we call Online Analysis. So what I did was I, did that very quickly, I
just clicked on “List the studies for which online analysis is available”. So these
studies have been marked up so that you can go out and take a look at them
without needing to have special software. Let’s look at this 2012 Chicago
Council Survey. What will happen is you take a look at this, you read through all
this wonderful metadata, and you decide “yeah I’d like to take a look in
this”. I’m going to click on Analyze Online. So what happens is you’ll
click here: Survey Data Analysis, you’re going to very carefully read the Terms
of Use and agree to them, It’s saying you are a member so you have
access to this. And what it brings you into is an area where you can use
your variables and run cross tabs. You can run, actually a number of different
types of analyses, you can see up there: correlation matrix, comparison
and means, multiple regression, etc. And you can run these in here. We have a
lot of people that use the online analysis to conduct their analysis and a
lot of them that use it to explore the data to make sure it’s the data that
they want before they proceed on to downloading everything and actually
proceeding with their analysis and reporting, and report
writing. So that’s the Online Analysis and that’s the mechanism that you get to
accessing it. Alright so I think what we will do is move on to just a quick
summary of the ICPSR data catalog. As you saw over 8,300 studies available on
demand. The 1,600 studies that are restricted use. In just a moment
we’ll talk a little bit more about that. We looked at the Online Analysis capabilities. The Catalog. The Catalog does point to a few more studies. I want to say it’s
about a thousand maybe. Maybe closer to 1,200 studies that are… so about
1,200 studies that we don’t have on site. So for example, when things went from…
when they went to on-demand downloading via Internet, what happened was is we no longer grabbed a copy, let’s say from the
UK Data Archive. What we used to do is have them send us their latest stuff and
then we would distribute from here. Kind of like that hub-and-spoke type of
operations, so we make copies and then distribute it out to our members. Because
then you could get it more quickly. But now with the internet, what we will
do is we have metadata on certain studies that are, what we call, the data
set of record. And we point you out there. So if it’s a really nice data set we
point you out to the UK Data Archive and then you work with them in terms of
access because they have the data set of a record. That way if
they make changes or updates [and] they have a second version, we’re not trying to keep
up with their version here. So we send you out, again where we can, to
the data set of record if we don’t physically have it here in the ICPSR
catalog or in the data, I’m sorry, in the data archive. And we talked
about the 40 disciplines. So a note on restricted use, starting with
delivering data. So on delivering data we, what you just saw was on demand. You sign
in, you select your data format, you download it or you analyze it, and
you go. You’re good to go and the majority of our data are,
what we call, delivery on demand. There is a group of these that’s called
restricted use data. And what’s required there is an application, and might
require a data security plan, a data use plan, a project plan, we don’t
judge whether we think the project or the analysis that you’re doing is a
good one or not. What we’re doing is we’re
looking at: do you have the right security in place, what are you going to
do with the data, we need… There are certain things you might have to do with
the data. One of them is typically destroy it when you’re done. If it’s a
situation where you’re actually going to have the micro data in your
possession. There are three different types of access that you might see:
secure download, where we securely download it. It does come over the
internet but it’s in a secure different way than we do publicly. A Virtual
enclave, which means that you’re coming in to our servers to do your analysis
and you can’t take anything out with you. We have to release it, take a look at it,
make sure there are no disclosure risk, and then we send it back out to
you. Or you may have to come here if the data are so sensitive and the
confidentiality concerns are so high, you may need to come to Ann Arbor and visit
our Physical enclave which is in the basement here of our Perry
building. I’m going to quickly go out and show you where you find… Typically
what happens is you’re not going to go out and search for restricted data. What
happens is that you may end up landing on it. So if we go to the catalog and
over to the right, click on the More Information for Restricted-use Data. It’s
going to give you all kinds of information on what that’s about, and how
you apply, and kind of what are the requirements. You can also take a
quick look into our Restricted Holdings. So if I click here, these here are
all restricted-use data. And again, usually you’re not going to go out and
go “hey I want to I want to go try for restricted data because I like
more paperwork and documentation.” What will happen is you’re looking at
something, you come into it and you see kind of this red piece here where it
says “One or more files of this data collection have special restrictions.”
There may be a public data set that you can use, they’ve taken out maybe some of
the fine geographical instances in there so that you can use it in a public
on-demand, click it and go mechanism. Or it may be the case that all of the
the data sets are restricted. It’s going to talk to you about
which one of these are available and are not available, and if
you click on one that is restricted and you decide “yep this is what I really
need” you’re going to be click, gone. You’ll be ported over into our
application area. I’m not going to go into here but this is going to tell
you basically how to get into that, how to apply. It’s going to send you into a
system that will give you the application, the requirements, and give
you the to do’s. You will also ultimately, when we’re dealing with
restricted use, you’re going to be dealing… you’re going to have to talk on the
phone quite probably, and email with personnel over here at ICPSR. And the
folks that are doing that type of user support enjoy that and they will
guide you through to make it as easy a process as it possibly can be. So why are
the data restricted? Once again, if there’s disclosure risk, which is
basically the potential to identify a research subject, somebody in
there, if there’s a lot of that disclosure risk or a fair amount of that, that
risk is higher or medium-high and it’s combined with highly sensitive personal
information. So it’s things that would be… things would be more sensitive,
and more embarrassing, or more dangerous to you than, I don’t know,
tripping out in front of somebody and people kind of pointing and laughing. So
we look at data sensitivity. On the left side you’ll see Potentially
Sensitive, you know illegal behavior that you may be admitting to in a survey, or
use of alcohol and drugs that you may be admitting to. These are
types of things that could impede your insurance, it can impede your
ability to work, things like that. Things that are Probably Not Sensitive
are on the right. The type of car you drive, your voting behavior, your real age,
social media rants for sure, because those are public by definition. They are
not sensitive and people can figure out who you are. They’re really good at the
that these days. So if the things on the left are present
in the survey data, and then there are other things like maybe a combination of
geographical markers in addition to how much you’re earning and your job title
these things could have adverse effects on your potential to earn a
living, for example. Then the data are made restricted-use. We need to know
that this isn’t some fishing expedition, for example. That it is
actually a project by which you need that information in order to
conduct your analysis and that you will be sure that you protect that
information while the data or while that information is in your care. So we’re
heading beyond why the data are unique? Instead of talking to you about
that I always find that the best way is to talk to other folks that have
downloaded our data and ask them what is it, why are our data unique? And these are
the words that they gave us and it’s always best to hear what they have to
say. So we hear a lot about the second bullet, you come here you, get data, it’s
been checked for errors, it’s well marked up so you understand it what it is, and
it has to, I believe, accompany that first bullet. The documentation helps the
interpretation, so you can run it and you can interpret it. And again that’s why we
have the hook here because someone told them it’s not a fishing expedition. Our
data are unique because of the curation. ICPSR approaches data as if it were
an artifact, you know, just like any other museum would. That we organize it, we
describe it, we clean it, we enhance it where we need to, and we preserve it for
public use and also for those now in the future. It’s not an inexpensive
process to do that. But it is, we believe, that’s why we’re here. An
imperative, it’s a mission that we believe here in ICPSR. We
believe that we have to spend the money in order to curate that data for the
future and make sure it’s here for future analysis. A quick comparison, let’s see did I tab out of the, I’m going to head over… So we
saw the, I’m going to bring up that PSID, and that’s a shortcut because I know the
number. I’m going to bring up that PSID, with all of its yummy metadata. And then I’m
going to head over to, what we call, our self- deposit repository. That’s called openICPSR. So when you look at PSID, I’m going to
head down and you’ll see, I think it was economic change. I’m going to go over here
and we’ll find something that was marked by the depositor to be economic
change. And we found ten results in the self-publishing archive. We’re
going to look at, let’s look at China’s Domestic Trade During the Treaty Port
Era. So this is actually nicely marked up. The person who deposited it,
which may or may not be the PI, deposited it themselves, they described it
themselves, they have lots of data in here. They have a Readme PDF. So we’ll
click on that, and we can preview this file, I can preview the PDFs. And
they’ve done, you know, a nice job really. This is actually much more
marked up or gives us much more metadata than most. So they’ve done a good
job. It’s still one of those things that because they’re looking at what
they’re looking at now and this is going to be a bit of a challenge for
someone to come in here and just read through it and run it and go. There will
be a lot of work on it, you know, if all is good, these files… I’m heading
back here, these files will give some type of… This is audio in here, you will
be able to run that in some mechanism, if you have the right
types of technology on your computers. So again, this is one that’s done a good job. The data curation in our world, again is much more
on this, kind of, juicier longer-term preservation. What’s arguable
here is how long we’ll be able to preserve this and, again, whether or
not this audio with the MPEG type of application is going to stay current. So
that is the difference when we talk about the value of curation, is that the
self-deposit it is really what you get. The downloader or the user is it’s you get
what has been deposited. Our collections are unique. We have a number of
collections, I want to say it’s about 20, where they’re funded by
a certain, maybe the OJDP, or the National Institutes on Aging with the NACDA.
So we have a lot of collections where if you’re looking for aging data, data
on the aged, that that’s the place to go. And we all serve, we all exist on the
same infrastructure. So the data come in, we clean them, we conduct curation, and
then they set out there for the data user to come in. So they’re operating on
the same infrastructure, the same search apparatus. But they are just like a
library, we have them segmented into collections. And if you are looking for
that and how to get into these thematic collections, you go to the Find Data, you
click on Thematic Data Collections, and you can find them right there.
So here’s the one where I talked about NACDA, the one on aging. These data are
all about the aging population from various areas and this is
sponsored by the National Institute on Aging. So again that gives you some good
opportunity if you’re exploring our site to understand what’s there and how its
nicely organized for you. Alright, and then User Support. We have a number
of FAQs on our site. You can go out to the ICPSR YouTube channel and there are
endless numbers of videos out there. This one will be out there shortly.
They include tutorials, webinars, and presentations. You can call us or email
us. We are available roughly 8 to 5 Eastern Time. We don’t yet have the
resources for 24/7 Live Support but that’s why we have that YouTube channel
out there. And we also will refer, when we talk about Official
Representatives, refer you out to ORs if you’re local who can also, you know, if
someone needs a lot of help with SPSS we’ll send them to the OR who then can
send them to the person on campus who is known for their SPSS training. And we
also like to have a little bit of humor, in the rare event, with all that we
have, that you don’t find any results in our search, we do tend once in a while find a little gem here that this person really seemed to enjoy. She
found no results but there was a cat to assuage her frustration. Alright so data
was a big one. We’re gonna move to training and education which will cover
the Summer Program, our resources that are designed for classroom use, resources
for students and then you already heard about the webinars on our YouTube
channel. So our training, Summer Program in Quantitative Methods of
Social Research. We bring about a thousand people to the Summer Program
each year. It started today so there are, what we call, four-week sessions, those
are the long ones. You come and you take a fabulous quantitative session,
methods session, for four weeks starting June 25th and then the second [session] starts
on July 23rd. Some people come all eight weeks but there are many many three- and
five-day workshops. If you go to the Summer Program on our site, you can very quickly
find the workshops. Again they started today with Introduction to R. It’s going on in Houston, Texas. Also a Dynamical
Systems Analysis that started in Salt Lake City and then on the 21st, the
Regression Discontinuity Design starts and that’s here in Ann
Arbor. If it doesn’t have a different city on it, then we know that
it’s actually occurring here in Ann Arbor. All kinds of those and I’m going
to talk about one specifically for ORs and administrators here shortly. We also
have resources for classroom use. I’m going to click out to the Learning
Guides, where you’re at is on the ICPSR site. If you click on Teaching and
Learning, you will find Learning Guides. These are guides that are designed for
teaching faculty so that they can bring data and concepts into the classroom.
These are designed really for intro classes, they weren’t designed for upper
statistics or anything like that. They’re really designed to bring them in when
you’re discussing core concepts in sociology or political science or
demography, in those intro classes so that you can get people excited about
using data and analysis without having to teach them SPSS first, or STATA first or those pieces. It’s really all done for the faculty so that they
can use it for one session. It’s not meant to be something that you have to
use and over and over again. Also students can go out and go through
them themselves in order to get more comfortable with those types of topics
and with the introduction to analysis. Another our Help Guides. We had a
request for us to put together, How do you read a journal article? Here’s a
Guide to Interpreting SPSS Output. Giving them some instruction on
citing data, on data citations. And even introductions on Data Management
Plans, so all kinds of learning support there. We have a research paper
competition every fall. You’ll see email saying, “Send in your students’
research!”, or actually they’re due, I think, after the first of the year so
they’re due in January. We love to give out prizes for masters
and undergrad students who have written great papers. And Webinars and Videos.
Again, I can’t I can’t mention, emphasize enough, if you
go out to our YouTube channel there’s a lot of stuff out there I think we need
to prune and move over the things that are not as relevant today
or we’ve replaced them, you know, because we do things over and over again.
But there’s a lot of learning to be done out there on every topic from on data, on
analysis, on ICPSR, on depositing data, on restricted-use data. Some things that go
very very in-depth. And what I think makes us unique in our education and the
services that we provide, is that we provide both hands-on training as well
as self-help. The Summer Program- it’s condensed, it’s intense, it’s hands-on.
People come here, they get immersed in the topic, whether it’s a quantitative
method, or whether it’s data curation, or whether it’s on a specific data set, and
they go go go. It’s for early and advanced professionals, you might be just
starting out or you may be coming for an advanced method training. So that, in
itself, makes it unique. Our materials are on demand for our members.
Most of our learning materials are members only because they were developed.
We utilized the member funding to develop those materials. And then
there are continuous training opportunities via webinar and on-site.
We will be having a Data Fair. It’s fully online, it begins… it’s going to occur
during the week of October 1st and it’s four to five days, depending on how
much content we have, where we’re running webinars basically from 11:00 a.m.
Eastern time to 3:00 or 4:00 p.m. Eastern Time. The days will
have themes. So there might be one day that you are really interested in,
and you can attend them all, and then maybe a few throughout the week. But
again, a big training event for us that we push out to you all. All of
the slides and recordings are posted from this webinar and many more out,
again, on that YouTube channel. We do campus visits. If we’re out at
conferences we may contact you, if we’re travelling somewhere we may contact you
to come and do a workshop. And the workshop can be what
you need. We don’t just come with a prepackaged thing if you need more
information on how to deposit, we have slides and information
and we can go through deposit. If you want to do more searching strategies, we
can do that. We don’t have funding unfortunately just to travel out a whole
lot, but we try and do it when there’s something going on nearby you
and we can either drive or train out there if we’re at another
meeting, for example, a conference for example. And then the ICPSR Meeting
happens every other year, it’s a biennial event, and that will be October 16th
through the 18th in 2019 and we’ve already begun planning for it, at least the
logistics. So those are our training. A few more words on why we’re unique
and I’m going to go very quickly through that because we need to get to ICPSR
Data Management and Deposit options. Hopefully you read that very quickly. I
always hear it’s a good site for one-stop shopping, that they have all the
data that are needed, and that it’s ready to go, and that’s something
that I heard when I joined back here in 2004. Data Management and Deposit Options.
So we have resources, tools, and support for your data management and grant
writing as well as numerous options. The way we support individuals at your
institutions who are conducting primary research is our data can help
them determine need, it can help them in developing questionnaires. You saw
those variables out there, they can understand great ways to use them, they
do need to… some of them are copyrighted but those make clear note and
then they can get the permission to use that, if need be. We have Data
Management Plans for proposals. We will also help you develop a budget line
because that is allowable on your federal, on your
sponsored grants and foundations as well. So we can help you develop what that
budget should be for data sharing and preserving it. We have written numerous,
hundreds, of letters of support along with those grant proposals
when you have worked with data management planning with us. We also can
guide you on writing proper informed consent so that data can be shared.
We have a lot of people who, the mandate is to share the data, but they’re
using old informed consent forms that make it so it makes it difficult to
share that data and that gets them in a bit of trouble, as you can imagine.
There is this PDF Guide out there to Data Preparation and Archiving. We
suggest that you read that or you encourage people who are doing, who’ve
won grants, to read that before they actually start their research collection.
And of course, sharing data and supporting research impact with lots of
numbers. Our data sharing options, actually I should show you real quickly
where that’s located. So if you are looking for Data Management &
Curation, you will go here. If you’re looking specifically for Data Management
Plans, go into the ACCESS area and you’ll see Data Management Plans. There’s also
all kinds of resources and examples out there, so that’s where you go is Data
Management for those. Data Sharing: your deposit options. We looked at openICPSR,
that is self-deposit immediate publishing, you get your persistent ID
which is often the requirement for journal articles or for your funders.
What you deposit is what others get. We are not going to curate that, it will sit
out there and probably age to some extent. And again, if you’ve
documented it well that’s great. We feel like we can document it better by moving
it into a curated area which is our thematic archives, like that
Aging archive or the criminal justice archives. Our funders set the
acquisitions policies, so if you say, “Hey, I have this great aging data set” but
they’re not sure that… it may not meet their requirements, but we do
have a home for it. It can be part of the Members Archive. If you
want, you can pay a fee for the curation. That’s what those budget lines are and
then that data open, not only to the members it’s open
to everyone, but it’s curated and taken care of within ICPSR. We treat it like
it’s a Member deposit, for example. So we advise, we push, we nag
for data curation because we believe it does… it’s the right thing to do. It
increases discoverability, it increases research impact and that’s what
we’re here to do is to increase the research impact. And we talked about how,
actually showed you that, this was in case things weren’t working because that
never happens when you’re presenting, right? So our unique
advantages for those that are looking to deposit, whether it’s agencies or PI’s or
institutions that are looking for a repository, is that research impact. We can deliver the audience that is interested in data, we
can deliver the numbers to show how that data were utilized, as you’ve seen. We
have that wonderful bibliography where we’re constantly looking for citations
or looking for works that are based on the data that are deposited. Talked about
the audience. We offer user support. We do as much as we can on our and so that
users are not needing to contact the research team or the PI to answer
questions. We do a lot of outreach including how to use the data and
instruction. We have the workshop infrastructure. We have a lot of agencies
that come here, they deposit their data here, and then we do wonderful oftentimes
free workshops where people are paid a stipend to come so that they know more
specifically how to use data. And of course, data infinity. We plan that this
data will be here forever. So we had a “wow” moment. Someone talked about
ICPSR being a critical component of our national identity. That
was one of those made my heart skip a beat and so I put it in there. Alright.
A quick couple minutes on where the tools are. User Support- [email protected] this is the fastest way because we have
a ticketing system for you to get help. If you email me and I happen to be in
one of those fabulously wonderful entertaining three to four-hour meetings
and you’re emailing me something very specific about a data set or something,
you have waited three to four fabulous hours and I’m going to have to send it over
to help so I can get you the technical people that you really need to get. So
the fastest way is you get it in there and if it’s a question for me, as
the membership person, they’re going to send it right over to me. And then the thing
is, there’s a counter on there so it’ll nag them until I answer it and
close it down. It’ll nag them to make sure that I don’t miss it. We do lots of email
announcements, well I wouldn’t say lots. We do try to
package them in groups but they will be training information, lots of webinar
invitations, as you know. And we encourage you always always share those.
It is not meant just for you, it is meant for anyone who is interested in
that type of data or that type of analysis and et cetera.
Utilization tools- couple of things, so if you are an OR or DR, you have a credentialed account. The MyData account is how we log in and recognize you. We go by IP authentication and here I’ve logged
in. There is a place and it recognizes me as an OR for ICPSR but I also have some
staff advantages. So you’ll kind of see some things perhaps that you
won’t see on yours. Here is the area where you manage your membership. How to
select a new OR or DR if you, if someone is retiring or leaving, a
handbook on what to do, and how to do, and some FAQ’s. There are things here to
help you promote ICPSR and data sharing. They’re already printed up. You can
download them, and change them however you want, and go out and spread
information on ICPSR and again on data sharing. There’s a view utilization tools,
although I’m going to show you a different way in there. You can look,
update your IP ranges, you can see what your IP ranges are looking at right now.
It will come up just that easily as long as you’re logged into your MyData account. Because if you are the OR, you can request a
personalized webinar for your institution or your department. If you
click on MyData, and this has changed recently, you are going to see some
information on you. You’re going to see a lot of things that are ‘coming soon’, which
will be wonderful. Where you may want to go is to the Report Manager and this is
going to give you options, again there’s more stuff for me because I am a staff
member, it’s going to give you information on your usage, this defaults
to the past year. You can see this is ICPSR staff, and it tells you how many
files that you’ve downloaded, how many studies, how many SDA online datasets
that you’ve accessed, the frequently downloaded studies, and
also the Learning Guides. If you head back over to your reports again you
can go to User Statistics. And this one’s pretty cool because it shows you the
departments that are using it, and it also shows you the types of people.
We have a lot of staff here and so of course we’re
predominantly staffed but typically you’ll see a lot of undergrads, a lot of
faculty, and a lot of graduate students downloading your data. It will even give
you the individuals, if they’ve agreed to share their information with you, it will
give you their names, their departments, and their status. We know that a lot of ORs,
particularly in smaller institutions, will actually see this information and they
will go out and help people and say, “Hey can I help you with this data set?”
Because that’s the type of service that they are providing. You can
also see your Summer Program attendance. It tells you, in here if you have been
funded, or receive stipends, or scholarships, that will be there. We are
not giving ourselves scholarships so it is saying null right now. There was
some stipend some time ago but you can get a look at years past.
This one does have some stipends because they were given professional development
funds for that. So that is, again if you go to anywhere on the ICPSR page, you click on
your account, sign in and you’re going to get your dashboard, your MyData
dashboard. And you, again you have, as an Or or DR, you have that special
extra access. So those are the Official Representative tools. Anytime you see a
lock here, that means that it is credentialed access and you have to be
an OR or DR with that access to obtain that information. Flip back
here real quick and go very quickly and I apologize to the questions. We don’t
have any, I was going to say apologies to the questions but I have a “never mind” in
here. And please define the bibliography and someone wondered if they had
things organized in collections so apparently we answered those questions.
So thank you so much for your attendance. I hope that we enlightened
you about the membership benefits, the uniqueness, what is uniquely ICPSR as
I’ve been calling it. We believe we do offer a little bit
different services in many areas. You know, I think since I still have… I did
want to show you one other thing if you’re here and you are delivering data
set resources within your institution, is there is a training July 30th through
August 3rd, it’s a five-day workshop, there are still openings, I believe. And
this is for anyone who manages or supports local services for ICPSR and
other research data for quantitative analysis. So if you are, some of the
titles we’re seeing are data librarian or someone who’s supporting the types of
things that a data librarian are and you want to learn
more about it, this is a nice gentle introduction. It is delivered by these
three individuals who are, they’ve been official
representatives for ICPSR for a long time. They’re kind, they’re understanding and,
if you need it delivered gently they will do so. So I wanted to give
that information, give them a call out in terms of that piece. If you have any
questions on that course, I’d put you in touch with them but to send me an email
and I’m happy to answer those questions. And once again thank you for attending,
think spring, and have a wonderful wonderful afternoon.

Leave a Reply

Your email address will not be published. Required fields are marked *