Introduction In the previous post we installed pgAdmin and PostgreSQL, and created a database. In this post, Lesson 001 of Learning Data Science, we’ll discuss database schemas and create a new schema in my_first_database. Since data analysis and management are best learned by doing, the objective of this and future posts is to get readersContinue reading “Lesson 001: Creating a Schema”
Category Archives: data management
Creating a PostgreSQL Database
Introduction In the previous post I introduced the learning-data-science GitHub repository, defined forking a repository, and provided instructions for forking the learning-data-science repository and cloning it to your computer. In this post, we’ll install pgAdmin and PostgreSQL, and create a database. Since data analysis and management are best learned by doing, the objective of thisContinue reading “Creating a PostgreSQL Database”
Learning Data Science
Introduction In the previous post I covered branches and basic merging in Git. In this post I’ll introduce readers to a GitHub repository that we’ll be using for this and future posts to facilitate learning, and then you’ll fork the repository and clone the forked repository to your computer. Since data analysis and management areContinue reading “Learning Data Science”
Version Control: Branches & Merging
Introduction In the previous post I covered reverting and viewing differences in Git. In this post, the last in a 5 part series on version control, I’ll be instructing readers in using branches and merging in Git. Since version control and data management are best learned by doing, the objective of this and future postsContinue reading “Version Control: Branches & Merging”
Version Control: Reverting & Diffs
Introduction In the previous post I presented detailed instructions for a commonly used workflow for managing files using version control. In this post, the fourth in a 5 part series on version control, I’ll be instructing readers on reverting changes and viewing differences in Git. Since version control and data management are best learned byContinue reading “Version Control: Reverting & Diffs”
Version Control: File Management
Introduction In the previous post I presented a quick start guide for getting up and running with version control. In this post, the third in a 5 part series on version control, I’ll be instructing readers on managing files using Git and GitHub. Since version control and data management are best learned by doing, theContinue reading “Version Control: File Management”
Getting Started with Version Control
Introduction In the previous post I provided an overview of version control for managing computer files. In this post, the second in a 5 part series on version control, I’ll be instructing readers on how to get started with version control. Since version control and data management are best learned by doing, the objective ofContinue reading “Getting Started with Version Control”
Version Control: An Overview
Introduction In the last post I provided an overview of databases and data management systems. This post is the first in a 5 part series on version control. The objective of this post is to provide an overview of version control for managing code. This is in preparation for using version control in future postsContinue reading “Version Control: An Overview”
Databases and Data Management Systems
Introduction In the last post I provided an overview of data management plans: Data Management Plans. In this post I’ll be presenting on databases and data management systems (DBMS), including a discussion about what constitutes a database, the basics of relational databases, and an overview of several popular DMBS software options. What is a Database?Continue reading “Databases and Data Management Systems”
Elfin…what?
The objective of this blog is to help improve data management in the environmental sciences. To this end, I’ll present an introduction to data management and a database schema model with the intent of moving us towards a more integrated approach to data management in the environmental sciences. The materials presented will be equivalent to that of a graduate level course in data management.