Introduction In the previous post I covered branches and basic merging in Git. In this post I’ll introduce readers to a GitHub repository that we’ll be using for this and future posts to facilitate learning, and then you’ll fork the repository and clone the forked repository to your computer. Since data analysis and management areContinue reading “Learning Data Science”
Author Archives: adubak
Version Control: Branches & Merging
Introduction In the previous post I covered reverting and viewing differences in Git. In this post, the last in a 5 part series on version control, I’ll be instructing readers in using branches and merging in Git. Since version control and data management are best learned by doing, the objective of this and future postsContinue reading “Version Control: Branches & Merging”
Version Control: Reverting & Diffs
Introduction In the previous post I presented detailed instructions for a commonly used workflow for managing files using version control. In this post, the fourth in a 5 part series on version control, I’ll be instructing readers on reverting changes and viewing differences in Git. Since version control and data management are best learned byContinue reading “Version Control: Reverting & Diffs”
Version Control: File Management
Introduction In the previous post I presented a quick start guide for getting up and running with version control. In this post, the third in a 5 part series on version control, I’ll be instructing readers on managing files using Git and GitHub. Since version control and data management are best learned by doing, theContinue reading “Version Control: File Management”
Getting Started with Version Control
Introduction In the previous post I provided an overview of version control for managing computer files. In this post, the second in a 5 part series on version control, I’ll be instructing readers on how to get started with version control. Since version control and data management are best learned by doing, the objective ofContinue reading “Getting Started with Version Control”
Version Control: An Overview
Introduction In the last post I provided an overview of databases and data management systems. This post is the first in a 5 part series on version control. The objective of this post is to provide an overview of version control for managing code. This is in preparation for using version control in future postsContinue reading “Version Control: An Overview”
Databases and Data Management Systems
Introduction In the last post I provided an overview of data management plans: Data Management Plans. In this post I’ll be presenting on databases and data management systems (DBMS), including a discussion about what constitutes a database, the basics of relational databases, and an overview of several popular DMBS software options. What is a Database?Continue reading “Databases and Data Management Systems”
Data Management Plans
“By failing to prepare, you are preparing to fail.” Benjamin Franklin Introduction In the last post I provided an overview of the final stage in the data pipeline: Metadata and Archiving. The first stage in the data pipeline is planning, and I provided an overview of this stage in my third post “The Data Pipeline:Continue reading “Data Management Plans”
The Data Pipeline: Metadata & Archiving
Introduction In the previous post I provided an overview of the analysis and report stage of the data pipeline (Figure 1). In this post I’ll provide an overview of the metadata and archiving stage. At this stage in the pipeline you’re project is mostly completed, and your ready to prepare your data for long-term storage.Continue reading “The Data Pipeline: Metadata & Archiving”
The Data Pipeline: Analysis & Reporting
Introduction In the previous post I provided an overview of the Quality Assurance and Control (QAQC) stage of the data pipeline (Figure 1). In this post I’ll provide an overview of the data Analysis and Reporting stage. By this point you have developed a database system, acquired data, and performed QAQC review on the data.Continue reading “The Data Pipeline: Analysis & Reporting”