Compartmentalized | Documented | Extendible | Reproducible | Robust |
.git
folderThe .git
folder is what has the repository information and tells Git that this is a Git repository. If you want your folder to stop being a Git repository, delete this. Looking at the config
file can often help you solve problems with connecting to a remote repository.
.gitignore
fileAdd files (or folders) here so that they are ignored.
Say you made a change and you need to get rid of that. The temptation (for me) is to jump onto the Git command line and clobber my repository with reset
and revert
commands. Don’t do this. Here are some strategies that are less prone to leaving your code a mess.
No? Easy click on the file in GitHub Desktop, right click and click ‘Discard Changes…’. Note this will take things all the way back to your last commit!! If you have been making a bunch of changes without committing those, then you are out of luck.
Yes? Go to History in the GitHub Desktop window, click on the commit and click ‘Revert’. This will get rid of all the changes that went with that commit. So if you changed multiple files, all those files will be reverted. If you have pushed the changes to GitHub, then you can push the revert and it’ll show up on GitHub too.
Let’s say you made a big multi-file commit and you want to revert one file.
You can do this at the Git command line, but I find that to be a huge time suck and in my early Git days, I sometimes left my repository with a horrible problem that I could not fix and had to completely rebuild my repo. Since I don’t need to be a Git wizard, this is what I do when I want to go ‘back in time’ for a single file.
Assuming you have already pushed the changes up to GitHub
< >
to browse your repo at the state in time where your file was ok.If you have not pushed the changes up to GitHub.
Ok, here’s the Git command to get a single file back. This works whether or not you have pushed to GitHub. The problem with this and why I don’t do it is that I usually need to look at the file. So I am scrolling back through the status of my repo in the past until I find the status that I want. Then I stare a bit and think and think. Then get a coffee and think some more. Then I scroll back through the status of the repo in the past some more and THEN I do the copy and paste. It is rarely the case that I know exactly what commit that I need to get rid of—and even rarer that I want to go completely to a status in the past.
git log
to find the commit hash (the long number)git checkout 1d0f8c2eb4e66db0a7123588ae2fad26a6338303~1 -- ./R/test.R
would reset test.R to one before that commit. This part 1d0f8c2eb4e66db0a7123588ae2fad26a6338303
is the bad commit hash and this part ~1
means what the file was like 1 commit before that.If you accidentally leave off the file name and Git says you have a detached head, use git checkout master
to reattach your head.
GitHub Desktop makes resolving these pretty easy.
hello.R
and show where the conflicts are. You then edit hello.R
in RStudio to fix the conflicts.hello.R
are still there.
hello.R
and fix the conflict. Git won’t have marked it so it might be hard to find.Those using Git in RStudio Merge conflicts are a bit of a disaster in RStudio, and RStudio gives no warning before it mucks up your files. So it you are pushing/pulling from RStudio be sure to practice on some toy merge conflicts before you run into a real one.
If you use a non-GitHub interface (like RStudio, GitLab, jupyter-git, etc), you need to access GitHub using a Personal Access Token not your user password.
When you are asked for you username and password. You do not enter your password! * Go to GitHub.com * Follow instructions for generating a Personal Access Token **Make sure to click the “repo” checkbox for scope. Copy the token and use that for your password.
Time stamps is something that you probably use to see when a file was last changed. Time stamps lose meaning if you use git checkout
for branches or tags. In fact, in the Git-verse, time doesn’t exist since Git workflow is not necessarily linear. I find this very confusing so I purposely work in a linear fashion with Git.
To fix time stamps when you use git checkout
(i.e. switch branches), you can use a post-checkout
file in the .git/hooks
folder.
post-checkout.sample
. Save as post-checkout
#!/bin/sh -e
OS=${OS:-`uname`}
if [ "$OS" = 'Darwin' ]; then
get_touch_time() {
date -r ${unixtime} '+%Y%m%d%H%M.%S'
}
else
get_touch_time() {
date -d @${unixtime} '+%Y%m%d%H%M.%S'
}
fi
# all git files
git ls-tree -r --name-only HEAD > .git_ls-tree_r_name-only_HEAD
# modified git files
git diff --name-only > .git_diff_name-only
# only restore files not modified
comm -2 -3 .git_ls-tree_r_name-only_HEAD .git_diff_name-only | while read filename; do
unixtime=$(git log -1 --format="%at" -- "${filename}")
touchtime=$(get_touch_time)
# echo ${touchtime} "${filename}"
touch -t ${touchtime} "${filename}"
done
rm .git_ls-tree_r_name-only_HEAD .git_diff_name-only
Working with Git from the command line. For most of you, you have no need for this and this is just a good way to mess up your repository. The exception is a few unique cases where you do need to issue a Git command (like git push
first time you need to give RStudio your GitHub login).
How to do a pull requests. I’ll post a video. Most of you won’t need this.
Branches You won’t need branches when starting, in fact I rarely use them and I have over 100 repositories. I only use branches in a public R package that I maintain.