diff options
Diffstat (limited to 'third_party/git/Documentation/user-manual.txt')
-rw-r--r-- | third_party/git/Documentation/user-manual.txt | 4578 |
1 files changed, 0 insertions, 4578 deletions
diff --git a/third_party/git/Documentation/user-manual.txt b/third_party/git/Documentation/user-manual.txt deleted file mode 100644 index fd480b864526..000000000000 --- a/third_party/git/Documentation/user-manual.txt +++ /dev/null @@ -1,4578 +0,0 @@ -= Git User Manual - -Git is a fast distributed revision control system. - -This manual is designed to be readable by someone with basic UNIX -command-line skills, but no previous knowledge of Git. - -<<repositories-and-branches>> and <<exploring-git-history>> explain how -to fetch and study a project using git--read these chapters to learn how -to build and test a particular version of a software project, search for -regressions, and so on. - -People needing to do actual development will also want to read -<<Developing-With-git>> and <<sharing-development>>. - -Further chapters cover more specialized topics. - -Comprehensive reference documentation is available through the man -pages, or linkgit:git-help[1] command. For example, for the command -`git clone <repo>`, you can either use: - ------------------------------------------------- -$ man git-clone ------------------------------------------------- - -or: - ------------------------------------------------- -$ git help clone ------------------------------------------------- - -With the latter, you can use the manual viewer of your choice; see -linkgit:git-help[1] for more information. - -See also <<git-quick-start>> for a brief overview of Git commands, -without any explanation. - -Finally, see <<todo>> for ways that you can help make this manual more -complete. - - -[[repositories-and-branches]] -== Repositories and Branches - -[[how-to-get-a-git-repository]] -=== How to get a Git repository - -It will be useful to have a Git repository to experiment with as you -read this manual. - -The best way to get one is by using the linkgit:git-clone[1] command to -download a copy of an existing repository. If you don't already have a -project in mind, here are some interesting examples: - ------------------------------------------------- - # Git itself (approx. 40MB download): -$ git clone git://git.kernel.org/pub/scm/git/git.git - # the Linux kernel (approx. 640MB download): -$ git clone git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git ------------------------------------------------- - -The initial clone may be time-consuming for a large project, but you -will only need to clone once. - -The clone command creates a new directory named after the project -(`git` or `linux` in the examples above). After you cd into this -directory, you will see that it contains a copy of the project files, -called the <<def_working_tree,working tree>>, together with a special -top-level directory named `.git`, which contains all the information -about the history of the project. - -[[how-to-check-out]] -=== How to check out a different version of a project - -Git is best thought of as a tool for storing the history of a collection -of files. It stores the history as a compressed collection of -interrelated snapshots of the project's contents. In Git each such -version is called a <<def_commit,commit>>. - -Those snapshots aren't necessarily all arranged in a single line from -oldest to newest; instead, work may simultaneously proceed along -parallel lines of development, called <<def_branch,branches>>, which may -merge and diverge. - -A single Git repository can track development on multiple branches. It -does this by keeping a list of <<def_head,heads>> which reference the -latest commit on each branch; the linkgit:git-branch[1] command shows -you the list of branch heads: - ------------------------------------------------- -$ git branch -* master ------------------------------------------------- - -A freshly cloned repository contains a single branch head, by default -named "master", with the working directory initialized to the state of -the project referred to by that branch head. - -Most projects also use <<def_tag,tags>>. Tags, like heads, are -references into the project's history, and can be listed using the -linkgit:git-tag[1] command: - ------------------------------------------------- -$ git tag -l -v2.6.11 -v2.6.11-tree -v2.6.12 -v2.6.12-rc2 -v2.6.12-rc3 -v2.6.12-rc4 -v2.6.12-rc5 -v2.6.12-rc6 -v2.6.13 -... ------------------------------------------------- - -Tags are expected to always point at the same version of a project, -while heads are expected to advance as development progresses. - -Create a new branch head pointing to one of these versions and check it -out using linkgit:git-switch[1]: - ------------------------------------------------- -$ git switch -c new v2.6.13 ------------------------------------------------- - -The working directory then reflects the contents that the project had -when it was tagged v2.6.13, and linkgit:git-branch[1] shows two -branches, with an asterisk marking the currently checked-out branch: - ------------------------------------------------- -$ git branch - master -* new ------------------------------------------------- - -If you decide that you'd rather see version 2.6.17, you can modify -the current branch to point at v2.6.17 instead, with - ------------------------------------------------- -$ git reset --hard v2.6.17 ------------------------------------------------- - -Note that if the current branch head was your only reference to a -particular point in history, then resetting that branch may leave you -with no way to find the history it used to point to; so use this command -carefully. - -[[understanding-commits]] -=== Understanding History: Commits - -Every change in the history of a project is represented by a commit. -The linkgit:git-show[1] command shows the most recent commit on the -current branch: - ------------------------------------------------- -$ git show -commit 17cf781661e6d38f737f15f53ab552f1e95960d7 -Author: Linus Torvalds <torvalds@ppc970.osdl.org.(none)> -Date: Tue Apr 19 14:11:06 2005 -0700 - - Remove duplicate getenv(DB_ENVIRONMENT) call - - Noted by Tony Luck. - -diff --git a/init-db.c b/init-db.c -index 65898fa..b002dc6 100644 ---- a/init-db.c -+++ b/init-db.c -@@ -7,7 +7,7 @@ - - int main(int argc, char **argv) - { -- char *sha1_dir = getenv(DB_ENVIRONMENT), *path; -+ char *sha1_dir, *path; - int len, i; - - if (mkdir(".git", 0755) < 0) { ------------------------------------------------- - -As you can see, a commit shows who made the latest change, what they -did, and why. - -Every commit has a 40-hexdigit id, sometimes called the "object name" or the -"SHA-1 id", shown on the first line of the `git show` output. You can usually -refer to a commit by a shorter name, such as a tag or a branch name, but this -longer name can also be useful. Most importantly, it is a globally unique -name for this commit: so if you tell somebody else the object name (for -example in email), then you are guaranteed that name will refer to the same -commit in their repository that it does in yours (assuming their repository -has that commit at all). Since the object name is computed as a hash over the -contents of the commit, you are guaranteed that the commit can never change -without its name also changing. - -In fact, in <<git-concepts>> we shall see that everything stored in Git -history, including file data and directory contents, is stored in an object -with a name that is a hash of its contents. - -[[understanding-reachability]] -==== Understanding history: commits, parents, and reachability - -Every commit (except the very first commit in a project) also has a -parent commit which shows what happened before this commit. -Following the chain of parents will eventually take you back to the -beginning of the project. - -However, the commits do not form a simple list; Git allows lines of -development to diverge and then reconverge, and the point where two -lines of development reconverge is called a "merge". The commit -representing a merge can therefore have more than one parent, with -each parent representing the most recent commit on one of the lines -of development leading to that point. - -The best way to see how this works is using the linkgit:gitk[1] -command; running gitk now on a Git repository and looking for merge -commits will help understand how Git organizes history. - -In the following, we say that commit X is "reachable" from commit Y -if commit X is an ancestor of commit Y. Equivalently, you could say -that Y is a descendant of X, or that there is a chain of parents -leading from commit Y to commit X. - -[[history-diagrams]] -==== Understanding history: History diagrams - -We will sometimes represent Git history using diagrams like the one -below. Commits are shown as "o", and the links between them with -lines drawn with - / and \. Time goes left to right: - - -................................................ - o--o--o <-- Branch A - / - o--o--o <-- master - \ - o--o--o <-- Branch B -................................................ - -If we need to talk about a particular commit, the character "o" may -be replaced with another letter or number. - -[[what-is-a-branch]] -==== Understanding history: What is a branch? - -When we need to be precise, we will use the word "branch" to mean a line -of development, and "branch head" (or just "head") to mean a reference -to the most recent commit on a branch. In the example above, the branch -head named "A" is a pointer to one particular commit, but we refer to -the line of three commits leading up to that point as all being part of -"branch A". - -However, when no confusion will result, we often just use the term -"branch" both for branches and for branch heads. - -[[manipulating-branches]] -=== Manipulating branches - -Creating, deleting, and modifying branches is quick and easy; here's -a summary of the commands: - -`git branch`:: - list all branches. -`git branch <branch>`:: - create a new branch named `<branch>`, referencing the same - point in history as the current branch. -`git branch <branch> <start-point>`:: - create a new branch named `<branch>`, referencing - `<start-point>`, which may be specified any way you like, - including using a branch name or a tag name. -`git branch -d <branch>`:: - delete the branch `<branch>`; if the branch is not fully - merged in its upstream branch or contained in the current branch, - this command will fail with a warning. -`git branch -D <branch>`:: - delete the branch `<branch>` irrespective of its merged status. -`git switch <branch>`:: - make the current branch `<branch>`, updating the working - directory to reflect the version referenced by `<branch>`. -`git switch -c <new> <start-point>`:: - create a new branch `<new>` referencing `<start-point>`, and - check it out. - -The special symbol "HEAD" can always be used to refer to the current -branch. In fact, Git uses a file named `HEAD` in the `.git` directory -to remember which branch is current: - ------------------------------------------------- -$ cat .git/HEAD -ref: refs/heads/master ------------------------------------------------- - -[[detached-head]] -=== Examining an old version without creating a new branch - -The `git switch` command normally expects a branch head, but will also -accept an arbitrary commit when invoked with --detach; for example, -you can check out the commit referenced by a tag: - ------------------------------------------------- -$ git switch --detach v2.6.17 -Note: checking out 'v2.6.17'. - -You are in 'detached HEAD' state. You can look around, make experimental -changes and commit them, and you can discard any commits you make in this -state without impacting any branches by performing another switch. - -If you want to create a new branch to retain commits you create, you may -do so (now or later) by using -c with the switch command again. Example: - - git switch -c new_branch_name - -HEAD is now at 427abfa Linux v2.6.17 ------------------------------------------------- - -The HEAD then refers to the SHA-1 of the commit instead of to a branch, -and git branch shows that you are no longer on a branch: - ------------------------------------------------- -$ cat .git/HEAD -427abfa28afedffadfca9dd8b067eb6d36bac53f -$ git branch -* (detached from v2.6.17) - master ------------------------------------------------- - -In this case we say that the HEAD is "detached". - -This is an easy way to check out a particular version without having to -make up a name for the new branch. You can still create a new branch -(or tag) for this version later if you decide to. - -[[examining-remote-branches]] -=== Examining branches from a remote repository - -The "master" branch that was created at the time you cloned is a copy -of the HEAD in the repository that you cloned from. That repository -may also have had other branches, though, and your local repository -keeps branches which track each of those remote branches, called -remote-tracking branches, which you -can view using the `-r` option to linkgit:git-branch[1]: - ------------------------------------------------- -$ git branch -r - origin/HEAD - origin/html - origin/maint - origin/man - origin/master - origin/next - origin/seen - origin/todo ------------------------------------------------- - -In this example, "origin" is called a remote repository, or "remote" -for short. The branches of this repository are called "remote -branches" from our point of view. The remote-tracking branches listed -above were created based on the remote branches at clone time and will -be updated by `git fetch` (hence `git pull`) and `git push`. See -<<Updating-a-repository-With-git-fetch>> for details. - -You might want to build on one of these remote-tracking branches -on a branch of your own, just as you would for a tag: - ------------------------------------------------- -$ git switch -c my-todo-copy origin/todo ------------------------------------------------- - -You can also check out `origin/todo` directly to examine it or -write a one-off patch. See <<detached-head,detached head>>. - -Note that the name "origin" is just the name that Git uses by default -to refer to the repository that you cloned from. - -[[how-git-stores-references]] -=== Naming branches, tags, and other references - -Branches, remote-tracking branches, and tags are all references to -commits. All references are named with a slash-separated path name -starting with `refs`; the names we've been using so far are actually -shorthand: - - - The branch `test` is short for `refs/heads/test`. - - The tag `v2.6.18` is short for `refs/tags/v2.6.18`. - - `origin/master` is short for `refs/remotes/origin/master`. - -The full name is occasionally useful if, for example, there ever -exists a tag and a branch with the same name. - -(Newly created refs are actually stored in the `.git/refs` directory, -under the path given by their name. However, for efficiency reasons -they may also be packed together in a single file; see -linkgit:git-pack-refs[1]). - -As another useful shortcut, the "HEAD" of a repository can be referred -to just using the name of that repository. So, for example, "origin" -is usually a shortcut for the HEAD branch in the repository "origin". - -For the complete list of paths which Git checks for references, and -the order it uses to decide which to choose when there are multiple -references with the same shorthand name, see the "SPECIFYING -REVISIONS" section of linkgit:gitrevisions[7]. - -[[Updating-a-repository-With-git-fetch]] -=== Updating a repository with git fetch - -After you clone a repository and commit a few changes of your own, you -may wish to check the original repository for updates. - -The `git-fetch` command, with no arguments, will update all of the -remote-tracking branches to the latest version found in the original -repository. It will not touch any of your own branches--not even the -"master" branch that was created for you on clone. - -[[fetching-branches]] -=== Fetching branches from other repositories - -You can also track branches from repositories other than the one you -cloned from, using linkgit:git-remote[1]: - -------------------------------------------------- -$ git remote add staging git://git.kernel.org/.../gregkh/staging.git -$ git fetch staging -... -From git://git.kernel.org/pub/scm/linux/kernel/git/gregkh/staging - * [new branch] master -> staging/master - * [new branch] staging-linus -> staging/staging-linus - * [new branch] staging-next -> staging/staging-next -------------------------------------------------- - -New remote-tracking branches will be stored under the shorthand name -that you gave `git remote add`, in this case `staging`: - -------------------------------------------------- -$ git branch -r - origin/HEAD -> origin/master - origin/master - staging/master - staging/staging-linus - staging/staging-next -------------------------------------------------- - -If you run `git fetch <remote>` later, the remote-tracking branches -for the named `<remote>` will be updated. - -If you examine the file `.git/config`, you will see that Git has added -a new stanza: - -------------------------------------------------- -$ cat .git/config -... -[remote "staging"] - url = git://git.kernel.org/pub/scm/linux/kernel/git/gregkh/staging.git - fetch = +refs/heads/*:refs/remotes/staging/* -... -------------------------------------------------- - -This is what causes Git to track the remote's branches; you may modify -or delete these configuration options by editing `.git/config` with a -text editor. (See the "CONFIGURATION FILE" section of -linkgit:git-config[1] for details.) - -[[exploring-git-history]] -== Exploring Git history - -Git is best thought of as a tool for storing the history of a -collection of files. It does this by storing compressed snapshots of -the contents of a file hierarchy, together with "commits" which show -the relationships between these snapshots. - -Git provides extremely flexible and fast tools for exploring the -history of a project. - -We start with one specialized tool that is useful for finding the -commit that introduced a bug into a project. - -[[using-bisect]] -=== How to use bisect to find a regression - -Suppose version 2.6.18 of your project worked, but the version at -"master" crashes. Sometimes the best way to find the cause of such a -regression is to perform a brute-force search through the project's -history to find the particular commit that caused the problem. The -linkgit:git-bisect[1] command can help you do this: - -------------------------------------------------- -$ git bisect start -$ git bisect good v2.6.18 -$ git bisect bad master -Bisecting: 3537 revisions left to test after this -[65934a9a028b88e83e2b0f8b36618fe503349f8e] BLOCK: Make USB storage depend on SCSI rather than selecting it [try #6] -------------------------------------------------- - -If you run `git branch` at this point, you'll see that Git has -temporarily moved you in "(no branch)". HEAD is now detached from any -branch and points directly to a commit (with commit id 65934) that -is reachable from "master" but not from v2.6.18. Compile and test it, -and see whether it crashes. Assume it does crash. Then: - -------------------------------------------------- -$ git bisect bad -Bisecting: 1769 revisions left to test after this -[7eff82c8b1511017ae605f0c99ac275a7e21b867] i2c-core: Drop useless bitmaskings -------------------------------------------------- - -checks out an older version. Continue like this, telling Git at each -stage whether the version it gives you is good or bad, and notice -that the number of revisions left to test is cut approximately in -half each time. - -After about 13 tests (in this case), it will output the commit id of -the guilty commit. You can then examine the commit with -linkgit:git-show[1], find out who wrote it, and mail them your bug -report with the commit id. Finally, run - -------------------------------------------------- -$ git bisect reset -------------------------------------------------- - -to return you to the branch you were on before. - -Note that the version which `git bisect` checks out for you at each -point is just a suggestion, and you're free to try a different -version if you think it would be a good idea. For example, -occasionally you may land on a commit that broke something unrelated; -run - -------------------------------------------------- -$ git bisect visualize -------------------------------------------------- - -which will run gitk and label the commit it chose with a marker that -says "bisect". Choose a safe-looking commit nearby, note its commit -id, and check it out with: - -------------------------------------------------- -$ git reset --hard fb47ddb2db -------------------------------------------------- - -then test, run `bisect good` or `bisect bad` as appropriate, and -continue. - -Instead of `git bisect visualize` and then `git reset --hard -fb47ddb2db`, you might just want to tell Git that you want to skip -the current commit: - -------------------------------------------------- -$ git bisect skip -------------------------------------------------- - -In this case, though, Git may not eventually be able to tell the first -bad one between some first skipped commits and a later bad commit. - -There are also ways to automate the bisecting process if you have a -test script that can tell a good from a bad commit. See -linkgit:git-bisect[1] for more information about this and other `git -bisect` features. - -[[naming-commits]] -=== Naming commits - -We have seen several ways of naming commits already: - - - 40-hexdigit object name - - branch name: refers to the commit at the head of the given - branch - - tag name: refers to the commit pointed to by the given tag - (we've seen branches and tags are special cases of - <<how-git-stores-references,references>>). - - HEAD: refers to the head of the current branch - -There are many more; see the "SPECIFYING REVISIONS" section of the -linkgit:gitrevisions[7] man page for the complete list of ways to -name revisions. Some examples: - -------------------------------------------------- -$ git show fb47ddb2 # the first few characters of the object name - # are usually enough to specify it uniquely -$ git show HEAD^ # the parent of the HEAD commit -$ git show HEAD^^ # the grandparent -$ git show HEAD~4 # the great-great-grandparent -------------------------------------------------- - -Recall that merge commits may have more than one parent; by default, -`^` and `~` follow the first parent listed in the commit, but you can -also choose: - -------------------------------------------------- -$ git show HEAD^1 # show the first parent of HEAD -$ git show HEAD^2 # show the second parent of HEAD -------------------------------------------------- - -In addition to HEAD, there are several other special names for -commits: - -Merges (to be discussed later), as well as operations such as -`git reset`, which change the currently checked-out commit, generally -set ORIG_HEAD to the value HEAD had before the current operation. - -The `git fetch` operation always stores the head of the last fetched -branch in FETCH_HEAD. For example, if you run `git fetch` without -specifying a local branch as the target of the operation - -------------------------------------------------- -$ git fetch git://example.com/proj.git theirbranch -------------------------------------------------- - -the fetched commits will still be available from FETCH_HEAD. - -When we discuss merges we'll also see the special name MERGE_HEAD, -which refers to the other branch that we're merging in to the current -branch. - -The linkgit:git-rev-parse[1] command is a low-level command that is -occasionally useful for translating some name for a commit to the object -name for that commit: - -------------------------------------------------- -$ git rev-parse origin -e05db0fd4f31dde7005f075a84f96b360d05984b -------------------------------------------------- - -[[creating-tags]] -=== Creating tags - -We can also create a tag to refer to a particular commit; after -running - -------------------------------------------------- -$ git tag stable-1 1b2e1d63ff -------------------------------------------------- - -You can use `stable-1` to refer to the commit 1b2e1d63ff. - -This creates a "lightweight" tag. If you would also like to include a -comment with the tag, and possibly sign it cryptographically, then you -should create a tag object instead; see the linkgit:git-tag[1] man page -for details. - -[[browsing-revisions]] -=== Browsing revisions - -The linkgit:git-log[1] command can show lists of commits. On its -own, it shows all commits reachable from the parent commit; but you -can also make more specific requests: - -------------------------------------------------- -$ git log v2.5.. # commits since (not reachable from) v2.5 -$ git log test..master # commits reachable from master but not test -$ git log master..test # ...reachable from test but not master -$ git log master...test # ...reachable from either test or master, - # but not both -$ git log --since="2 weeks ago" # commits from the last 2 weeks -$ git log Makefile # commits which modify Makefile -$ git log fs/ # ... which modify any file under fs/ -$ git log -S'foo()' # commits which add or remove any file data - # matching the string 'foo()' -------------------------------------------------- - -And of course you can combine all of these; the following finds -commits since v2.5 which touch the `Makefile` or any file under `fs`: - -------------------------------------------------- -$ git log v2.5.. Makefile fs/ -------------------------------------------------- - -You can also ask git log to show patches: - -------------------------------------------------- -$ git log -p -------------------------------------------------- - -See the `--pretty` option in the linkgit:git-log[1] man page for more -display options. - -Note that git log starts with the most recent commit and works -backwards through the parents; however, since Git history can contain -multiple independent lines of development, the particular order that -commits are listed in may be somewhat arbitrary. - -[[generating-diffs]] -=== Generating diffs - -You can generate diffs between any two versions using -linkgit:git-diff[1]: - -------------------------------------------------- -$ git diff master..test -------------------------------------------------- - -That will produce the diff between the tips of the two branches. If -you'd prefer to find the diff from their common ancestor to test, you -can use three dots instead of two: - -------------------------------------------------- -$ git diff master...test -------------------------------------------------- - -Sometimes what you want instead is a set of patches; for this you can -use linkgit:git-format-patch[1]: - -------------------------------------------------- -$ git format-patch master..test -------------------------------------------------- - -will generate a file with a patch for each commit reachable from test -but not from master. - -[[viewing-old-file-versions]] -=== Viewing old file versions - -You can always view an old version of a file by just checking out the -correct revision first. But sometimes it is more convenient to be -able to view an old version of a single file without checking -anything out; this command does that: - -------------------------------------------------- -$ git show v2.5:fs/locks.c -------------------------------------------------- - -Before the colon may be anything that names a commit, and after it -may be any path to a file tracked by Git. - -[[history-examples]] -=== Examples - -[[counting-commits-on-a-branch]] -==== Counting the number of commits on a branch - -Suppose you want to know how many commits you've made on `mybranch` -since it diverged from `origin`: - -------------------------------------------------- -$ git log --pretty=oneline origin..mybranch | wc -l -------------------------------------------------- - -Alternatively, you may often see this sort of thing done with the -lower-level command linkgit:git-rev-list[1], which just lists the SHA-1's -of all the given commits: - -------------------------------------------------- -$ git rev-list origin..mybranch | wc -l -------------------------------------------------- - -[[checking-for-equal-branches]] -==== Check whether two branches point at the same history - -Suppose you want to check whether two branches point at the same point -in history. - -------------------------------------------------- -$ git diff origin..master -------------------------------------------------- - -will tell you whether the contents of the project are the same at the -two branches; in theory, however, it's possible that the same project -contents could have been arrived at by two different historical -routes. You could compare the object names: - -------------------------------------------------- -$ git rev-list origin -e05db0fd4f31dde7005f075a84f96b360d05984b -$ git rev-list master -e05db0fd4f31dde7005f075a84f96b360d05984b -------------------------------------------------- - -Or you could recall that the `...` operator selects all commits -reachable from either one reference or the other but not -both; so - -------------------------------------------------- -$ git log origin...master -------------------------------------------------- - -will return no commits when the two branches are equal. - -[[finding-tagged-descendants]] -==== Find first tagged version including a given fix - -Suppose you know that the commit e05db0fd fixed a certain problem. -You'd like to find the earliest tagged release that contains that -fix. - -Of course, there may be more than one answer--if the history branched -after commit e05db0fd, then there could be multiple "earliest" tagged -releases. - -You could just visually inspect the commits since e05db0fd: - -------------------------------------------------- -$ gitk e05db0fd.. -------------------------------------------------- - -or you can use linkgit:git-name-rev[1], which will give the commit a -name based on any tag it finds pointing to one of the commit's -descendants: - -------------------------------------------------- -$ git name-rev --tags e05db0fd -e05db0fd tags/v1.5.0-rc1^0~23 -------------------------------------------------- - -The linkgit:git-describe[1] command does the opposite, naming the -revision using a tag on which the given commit is based: - -------------------------------------------------- -$ git describe e05db0fd -v1.5.0-rc0-260-ge05db0f -------------------------------------------------- - -but that may sometimes help you guess which tags might come after the -given commit. - -If you just want to verify whether a given tagged version contains a -given commit, you could use linkgit:git-merge-base[1]: - -------------------------------------------------- -$ git merge-base e05db0fd v1.5.0-rc1 -e05db0fd4f31dde7005f075a84f96b360d05984b -------------------------------------------------- - -The merge-base command finds a common ancestor of the given commits, -and always returns one or the other in the case where one is a -descendant of the other; so the above output shows that e05db0fd -actually is an ancestor of v1.5.0-rc1. - -Alternatively, note that - -------------------------------------------------- -$ git log v1.5.0-rc1..e05db0fd -------------------------------------------------- - -will produce empty output if and only if v1.5.0-rc1 includes e05db0fd, -because it outputs only commits that are not reachable from v1.5.0-rc1. - -As yet another alternative, the linkgit:git-show-branch[1] command lists -the commits reachable from its arguments with a display on the left-hand -side that indicates which arguments that commit is reachable from. -So, if you run something like - -------------------------------------------------- -$ git show-branch e05db0fd v1.5.0-rc0 v1.5.0-rc1 v1.5.0-rc2 -! [e05db0fd] Fix warnings in sha1_file.c - use C99 printf format if -available - ! [v1.5.0-rc0] GIT v1.5.0 preview - ! [v1.5.0-rc1] GIT v1.5.0-rc1 - ! [v1.5.0-rc2] GIT v1.5.0-rc2 -... -------------------------------------------------- - -then a line like - -------------------------------------------------- -+ ++ [e05db0fd] Fix warnings in sha1_file.c - use C99 printf format if -available -------------------------------------------------- - -shows that e05db0fd is reachable from itself, from v1.5.0-rc1, -and from v1.5.0-rc2, and not from v1.5.0-rc0. - -[[showing-commits-unique-to-a-branch]] -==== Showing commits unique to a given branch - -Suppose you would like to see all the commits reachable from the branch -head named `master` but not from any other head in your repository. - -We can list all the heads in this repository with -linkgit:git-show-ref[1]: - -------------------------------------------------- -$ git show-ref --heads -bf62196b5e363d73353a9dcf094c59595f3153b7 refs/heads/core-tutorial -db768d5504c1bb46f63ee9d6e1772bd047e05bf9 refs/heads/maint -a07157ac624b2524a059a3414e99f6f44bebc1e7 refs/heads/master -24dbc180ea14dc1aebe09f14c8ecf32010690627 refs/heads/tutorial-2 -1e87486ae06626c2f31eaa63d26fc0fd646c8af2 refs/heads/tutorial-fixes -------------------------------------------------- - -We can get just the branch-head names, and remove `master`, with -the help of the standard utilities cut and grep: - -------------------------------------------------- -$ git show-ref --heads | cut -d' ' -f2 | grep -v '^refs/heads/master' -refs/heads/core-tutorial -refs/heads/maint -refs/heads/tutorial-2 -refs/heads/tutorial-fixes -------------------------------------------------- - -And then we can ask to see all the commits reachable from master -but not from these other heads: - -------------------------------------------------- -$ gitk master --not $( git show-ref --heads | cut -d' ' -f2 | - grep -v '^refs/heads/master' ) -------------------------------------------------- - -Obviously, endless variations are possible; for example, to see all -commits reachable from some head but not from any tag in the repository: - -------------------------------------------------- -$ gitk $( git show-ref --heads ) --not $( git show-ref --tags ) -------------------------------------------------- - -(See linkgit:gitrevisions[7] for explanations of commit-selecting -syntax such as `--not`.) - -[[making-a-release]] -==== Creating a changelog and tarball for a software release - -The linkgit:git-archive[1] command can create a tar or zip archive from -any version of a project; for example: - -------------------------------------------------- -$ git archive -o latest.tar.gz --prefix=project/ HEAD -------------------------------------------------- - -will use HEAD to produce a gzipped tar archive in which each filename -is preceded by `project/`. The output file format is inferred from -the output file extension if possible, see linkgit:git-archive[1] for -details. - -Versions of Git older than 1.7.7 don't know about the `tar.gz` format, -you'll need to use gzip explicitly: - -------------------------------------------------- -$ git archive --format=tar --prefix=project/ HEAD | gzip >latest.tar.gz -------------------------------------------------- - -If you're releasing a new version of a software project, you may want -to simultaneously make a changelog to include in the release -announcement. - -Linus Torvalds, for example, makes new kernel releases by tagging them, -then running: - -------------------------------------------------- -$ release-script 2.6.12 2.6.13-rc6 2.6.13-rc7 -------------------------------------------------- - -where release-script is a shell script that looks like: - -------------------------------------------------- -#!/bin/sh -stable="$1" -last="$2" -new="$3" -echo "# git tag v$new" -echo "git archive --prefix=linux-$new/ v$new | gzip -9 > ../linux-$new.tar.gz" -echo "git diff v$stable v$new | gzip -9 > ../patch-$new.gz" -echo "git log --no-merges v$new ^v$last > ../ChangeLog-$new" -echo "git shortlog --no-merges v$new ^v$last > ../ShortLog" -echo "git diff --stat --summary -M v$last v$new > ../diffstat-$new" -------------------------------------------------- - -and then he just cut-and-pastes the output commands after verifying that -they look OK. - -[[Finding-commits-With-given-Content]] -==== Finding commits referencing a file with given content - -Somebody hands you a copy of a file, and asks which commits modified a -file such that it contained the given content either before or after the -commit. You can find out with this: - -------------------------------------------------- -$ git log --raw --abbrev=40 --pretty=oneline | - grep -B 1 `git hash-object filename` -------------------------------------------------- - -Figuring out why this works is left as an exercise to the (advanced) -student. The linkgit:git-log[1], linkgit:git-diff-tree[1], and -linkgit:git-hash-object[1] man pages may prove helpful. - -[[Developing-With-git]] -== Developing with Git - -[[telling-git-your-name]] -=== Telling Git your name - -Before creating any commits, you should introduce yourself to Git. -The easiest way to do so is to use linkgit:git-config[1]: - ------------------------------------------------- -$ git config --global user.name 'Your Name Comes Here' -$ git config --global user.email 'you@yourdomain.example.com' ------------------------------------------------- - -Which will add the following to a file named `.gitconfig` in your -home directory: - ------------------------------------------------- -[user] - name = Your Name Comes Here - email = you@yourdomain.example.com ------------------------------------------------- - -See the "CONFIGURATION FILE" section of linkgit:git-config[1] for -details on the configuration file. The file is plain text, so you can -also edit it with your favorite editor. - - -[[creating-a-new-repository]] -=== Creating a new repository - -Creating a new repository from scratch is very easy: - -------------------------------------------------- -$ mkdir project -$ cd project -$ git init -------------------------------------------------- - -If you have some initial content (say, a tarball): - -------------------------------------------------- -$ tar xzvf project.tar.gz -$ cd project -$ git init -$ git add . # include everything below ./ in the first commit: -$ git commit -------------------------------------------------- - -[[how-to-make-a-commit]] -=== How to make a commit - -Creating a new commit takes three steps: - - 1. Making some changes to the working directory using your - favorite editor. - 2. Telling Git about your changes. - 3. Creating the commit using the content you told Git about - in step 2. - -In practice, you can interleave and repeat steps 1 and 2 as many -times as you want: in order to keep track of what you want committed -at step 3, Git maintains a snapshot of the tree's contents in a -special staging area called "the index." - -At the beginning, the content of the index will be identical to -that of the HEAD. The command `git diff --cached`, which shows -the difference between the HEAD and the index, should therefore -produce no output at that point. - -Modifying the index is easy: - -To update the index with the contents of a new or modified file, use - -------------------------------------------------- -$ git add path/to/file -------------------------------------------------- - -To remove a file from the index and from the working tree, use - -------------------------------------------------- -$ git rm path/to/file -------------------------------------------------- - -After each step you can verify that - -------------------------------------------------- -$ git diff --cached -------------------------------------------------- - -always shows the difference between the HEAD and the index file--this -is what you'd commit if you created the commit now--and that - -------------------------------------------------- -$ git diff -------------------------------------------------- - -shows the difference between the working tree and the index file. - -Note that `git add` always adds just the current contents of a file -to the index; further changes to the same file will be ignored unless -you run `git add` on the file again. - -When you're ready, just run - -------------------------------------------------- -$ git commit -------------------------------------------------- - -and Git will prompt you for a commit message and then create the new -commit. Check to make sure it looks like what you expected with - -------------------------------------------------- -$ git show -------------------------------------------------- - -As a special shortcut, - -------------------------------------------------- -$ git commit -a -------------------------------------------------- - -will update the index with any files that you've modified or removed -and create a commit, all in one step. - -A number of commands are useful for keeping track of what you're -about to commit: - -------------------------------------------------- -$ git diff --cached # difference between HEAD and the index; what - # would be committed if you ran "commit" now. -$ git diff # difference between the index file and your - # working directory; changes that would not - # be included if you ran "commit" now. -$ git diff HEAD # difference between HEAD and working tree; what - # would be committed if you ran "commit -a" now. -$ git status # a brief per-file summary of the above. -------------------------------------------------- - -You can also use linkgit:git-gui[1] to create commits, view changes in -the index and the working tree files, and individually select diff hunks -for inclusion in the index (by right-clicking on the diff hunk and -choosing "Stage Hunk For Commit"). - -[[creating-good-commit-messages]] -=== Creating good commit messages - -Though not required, it's a good idea to begin the commit message -with a single short (less than 50 character) line summarizing the -change, followed by a blank line and then a more thorough -description. The text up to the first blank line in a commit -message is treated as the commit title, and that title is used -throughout Git. For example, linkgit:git-format-patch[1] turns a -commit into email, and it uses the title on the Subject line and the -rest of the commit in the body. - - -[[ignoring-files]] -=== Ignoring files - -A project will often generate files that you do 'not' want to track with Git. -This typically includes files generated by a build process or temporary -backup files made by your editor. Of course, 'not' tracking files with Git -is just a matter of 'not' calling `git add` on them. But it quickly becomes -annoying to have these untracked files lying around; e.g. they make -`git add .` practically useless, and they keep showing up in the output of -`git status`. - -You can tell Git to ignore certain files by creating a file called -`.gitignore` in the top level of your working directory, with contents -such as: - -------------------------------------------------- -# Lines starting with '#' are considered comments. -# Ignore any file named foo.txt. -foo.txt -# Ignore (generated) html files, -*.html -# except foo.html which is maintained by hand. -!foo.html -# Ignore objects and archives. -*.[oa] -------------------------------------------------- - -See linkgit:gitignore[5] for a detailed explanation of the syntax. You can -also place .gitignore files in other directories in your working tree, and they -will apply to those directories and their subdirectories. The `.gitignore` -files can be added to your repository like any other files (just run `git add -.gitignore` and `git commit`, as usual), which is convenient when the exclude -patterns (such as patterns matching build output files) would also make sense -for other users who clone your repository. - -If you wish the exclude patterns to affect only certain repositories -(instead of every repository for a given project), you may instead put -them in a file in your repository named `.git/info/exclude`, or in any -file specified by the `core.excludesFile` configuration variable. -Some Git commands can also take exclude patterns directly on the -command line. See linkgit:gitignore[5] for the details. - -[[how-to-merge]] -=== How to merge - -You can rejoin two diverging branches of development using -linkgit:git-merge[1]: - -------------------------------------------------- -$ git merge branchname -------------------------------------------------- - -merges the development in the branch `branchname` into the current -branch. - -A merge is made by combining the changes made in `branchname` and the -changes made up to the latest commit in your current branch since -their histories forked. The work tree is overwritten by the result of -the merge when this combining is done cleanly, or overwritten by a -half-merged results when this combining results in conflicts. -Therefore, if you have uncommitted changes touching the same files as -the ones impacted by the merge, Git will refuse to proceed. Most of -the time, you will want to commit your changes before you can merge, -and if you don't, then linkgit:git-stash[1] can take these changes -away while you're doing the merge, and reapply them afterwards. - -If the changes are independent enough, Git will automatically complete -the merge and commit the result (or reuse an existing commit in case -of <<fast-forwards,fast-forward>>, see below). On the other hand, -if there are conflicts--for example, if the same file is -modified in two different ways in the remote branch and the local -branch--then you are warned; the output may look something like this: - -------------------------------------------------- -$ git merge next - 100% (4/4) done -Auto-merged file.txt -CONFLICT (content): Merge conflict in file.txt -Automatic merge failed; fix conflicts and then commit the result. -------------------------------------------------- - -Conflict markers are left in the problematic files, and after -you resolve the conflicts manually, you can update the index -with the contents and run Git commit, as you normally would when -creating a new file. - -If you examine the resulting commit using gitk, you will see that it -has two parents, one pointing to the top of the current branch, and -one to the top of the other branch. - -[[resolving-a-merge]] -=== Resolving a merge - -When a merge isn't resolved automatically, Git leaves the index and -the working tree in a special state that gives you all the -information you need to help resolve the merge. - -Files with conflicts are marked specially in the index, so until you -resolve the problem and update the index, linkgit:git-commit[1] will -fail: - -------------------------------------------------- -$ git commit -file.txt: needs merge -------------------------------------------------- - -Also, linkgit:git-status[1] will list those files as "unmerged", and the -files with conflicts will have conflict markers added, like this: - -------------------------------------------------- -<<<<<<< HEAD:file.txt -Hello world -======= -Goodbye ->>>>>>> 77976da35a11db4580b80ae27e8d65caf5208086:file.txt -------------------------------------------------- - -All you need to do is edit the files to resolve the conflicts, and then - -------------------------------------------------- -$ git add file.txt -$ git commit -------------------------------------------------- - -Note that the commit message will already be filled in for you with -some information about the merge. Normally you can just use this -default message unchanged, but you may add additional commentary of -your own if desired. - -The above is all you need to know to resolve a simple merge. But Git -also provides more information to help resolve conflicts: - -[[conflict-resolution]] -==== Getting conflict-resolution help during a merge - -All of the changes that Git was able to merge automatically are -already added to the index file, so linkgit:git-diff[1] shows only -the conflicts. It uses an unusual syntax: - -------------------------------------------------- -$ git diff -diff --cc file.txt -index 802992c,2b60207..0000000 ---- a/file.txt -+++ b/file.txt -@@@ -1,1 -1,1 +1,5 @@@ -++<<<<<<< HEAD:file.txt - +Hello world -++======= -+ Goodbye -++>>>>>>> 77976da35a11db4580b80ae27e8d65caf5208086:file.txt -------------------------------------------------- - -Recall that the commit which will be committed after we resolve this -conflict will have two parents instead of the usual one: one parent -will be HEAD, the tip of the current branch; the other will be the -tip of the other branch, which is stored temporarily in MERGE_HEAD. - -During the merge, the index holds three versions of each file. Each of -these three "file stages" represents a different version of the file: - -------------------------------------------------- -$ git show :1:file.txt # the file in a common ancestor of both branches -$ git show :2:file.txt # the version from HEAD. -$ git show :3:file.txt # the version from MERGE_HEAD. -------------------------------------------------- - -When you ask linkgit:git-diff[1] to show the conflicts, it runs a -three-way diff between the conflicted merge results in the work tree with -stages 2 and 3 to show only hunks whose contents come from both sides, -mixed (in other words, when a hunk's merge results come only from stage 2, -that part is not conflicting and is not shown. Same for stage 3). - -The diff above shows the differences between the working-tree version of -file.txt and the stage 2 and stage 3 versions. So instead of preceding -each line by a single `+` or `-`, it now uses two columns: the first -column is used for differences between the first parent and the working -directory copy, and the second for differences between the second parent -and the working directory copy. (See the "COMBINED DIFF FORMAT" section -of linkgit:git-diff-files[1] for a details of the format.) - -After resolving the conflict in the obvious way (but before updating the -index), the diff will look like: - -------------------------------------------------- -$ git diff -diff --cc file.txt -index 802992c,2b60207..0000000 ---- a/file.txt -+++ b/file.txt -@@@ -1,1 -1,1 +1,1 @@@ -- Hello world - -Goodbye -++Goodbye world -------------------------------------------------- - -This shows that our resolved version deleted "Hello world" from the -first parent, deleted "Goodbye" from the second parent, and added -"Goodbye world", which was previously absent from both. - -Some special diff options allow diffing the working directory against -any of these stages: - -------------------------------------------------- -$ git diff -1 file.txt # diff against stage 1 -$ git diff --base file.txt # same as the above -$ git diff -2 file.txt # diff against stage 2 -$ git diff --ours file.txt # same as the above -$ git diff -3 file.txt # diff against stage 3 -$ git diff --theirs file.txt # same as the above. -------------------------------------------------- - -The linkgit:git-log[1] and linkgit:gitk[1] commands also provide special help -for merges: - -------------------------------------------------- -$ git log --merge -$ gitk --merge -------------------------------------------------- - -These will display all commits which exist only on HEAD or on -MERGE_HEAD, and which touch an unmerged file. - -You may also use linkgit:git-mergetool[1], which lets you merge the -unmerged files using external tools such as Emacs or kdiff3. - -Each time you resolve the conflicts in a file and update the index: - -------------------------------------------------- -$ git add file.txt -------------------------------------------------- - -the different stages of that file will be "collapsed", after which -`git diff` will (by default) no longer show diffs for that file. - -[[undoing-a-merge]] -=== Undoing a merge - -If you get stuck and decide to just give up and throw the whole mess -away, you can always return to the pre-merge state with - -------------------------------------------------- -$ git merge --abort -------------------------------------------------- - -Or, if you've already committed the merge that you want to throw away, - -------------------------------------------------- -$ git reset --hard ORIG_HEAD -------------------------------------------------- - -However, this last command can be dangerous in some cases--never -throw away a commit you have already committed if that commit may -itself have been merged into another branch, as doing so may confuse -further merges. - -[[fast-forwards]] -=== Fast-forward merges - -There is one special case not mentioned above, which is treated -differently. Normally, a merge results in a merge commit, with two -parents, one pointing at each of the two lines of development that -were merged. - -However, if the current branch is an ancestor of the other--so every commit -present in the current branch is already contained in the other branch--then Git -just performs a "fast-forward"; the head of the current branch is moved forward -to point at the head of the merged-in branch, without any new commits being -created. - -[[fixing-mistakes]] -=== Fixing mistakes - -If you've messed up the working tree, but haven't yet committed your -mistake, you can return the entire working tree to the last committed -state with - -------------------------------------------------- -$ git restore --staged --worktree :/ -------------------------------------------------- - -If you make a commit that you later wish you hadn't, there are two -fundamentally different ways to fix the problem: - - 1. You can create a new commit that undoes whatever was done - by the old commit. This is the correct thing if your - mistake has already been made public. - - 2. You can go back and modify the old commit. You should - never do this if you have already made the history public; - Git does not normally expect the "history" of a project to - change, and cannot correctly perform repeated merges from - a branch that has had its history changed. - -[[reverting-a-commit]] -==== Fixing a mistake with a new commit - -Creating a new commit that reverts an earlier change is very easy; -just pass the linkgit:git-revert[1] command a reference to the bad -commit; for example, to revert the most recent commit: - -------------------------------------------------- -$ git revert HEAD -------------------------------------------------- - -This will create a new commit which undoes the change in HEAD. You -will be given a chance to edit the commit message for the new commit. - -You can also revert an earlier change, for example, the next-to-last: - -------------------------------------------------- -$ git revert HEAD^ -------------------------------------------------- - -In this case Git will attempt to undo the old change while leaving -intact any changes made since then. If more recent changes overlap -with the changes to be reverted, then you will be asked to fix -conflicts manually, just as in the case of <<resolving-a-merge, -resolving a merge>>. - -[[fixing-a-mistake-by-rewriting-history]] -==== Fixing a mistake by rewriting history - -If the problematic commit is the most recent commit, and you have not -yet made that commit public, then you may just -<<undoing-a-merge,destroy it using `git reset`>>. - -Alternatively, you -can edit the working directory and update the index to fix your -mistake, just as if you were going to <<how-to-make-a-commit,create a -new commit>>, then run - -------------------------------------------------- -$ git commit --amend -------------------------------------------------- - -which will replace the old commit by a new commit incorporating your -changes, giving you a chance to edit the old commit message first. - -Again, you should never do this to a commit that may already have -been merged into another branch; use linkgit:git-revert[1] instead in -that case. - -It is also possible to replace commits further back in the history, but -this is an advanced topic to be left for -<<cleaning-up-history,another chapter>>. - -[[checkout-of-path]] -==== Checking out an old version of a file - -In the process of undoing a previous bad change, you may find it -useful to check out an older version of a particular file using -linkgit:git-restore[1]. The command - -------------------------------------------------- -$ git restore --source=HEAD^ path/to/file -------------------------------------------------- - -replaces path/to/file by the contents it had in the commit HEAD^, and -also updates the index to match. It does not change branches. - -If you just want to look at an old version of the file, without -modifying the working directory, you can do that with -linkgit:git-show[1]: - -------------------------------------------------- -$ git show HEAD^:path/to/file -------------------------------------------------- - -which will display the given version of the file. - -[[interrupted-work]] -==== Temporarily setting aside work in progress - -While you are in the middle of working on something complicated, you -find an unrelated but obvious and trivial bug. You would like to fix it -before continuing. You can use linkgit:git-stash[1] to save the current -state of your work, and after fixing the bug (or, optionally after doing -so on a different branch and then coming back), unstash the -work-in-progress changes. - ------------------------------------------------- -$ git stash push -m "work in progress for foo feature" ------------------------------------------------- - -This command will save your changes away to the `stash`, and -reset your working tree and the index to match the tip of your -current branch. Then you can make your fix as usual. - ------------------------------------------------- -... edit and test ... -$ git commit -a -m "blorpl: typofix" ------------------------------------------------- - -After that, you can go back to what you were working on with -`git stash pop`: - ------------------------------------------------- -$ git stash pop ------------------------------------------------- - - -[[ensuring-good-performance]] -=== Ensuring good performance - -On large repositories, Git depends on compression to keep the history -information from taking up too much space on disk or in memory. Some -Git commands may automatically run linkgit:git-gc[1], so you don't -have to worry about running it manually. However, compressing a large -repository may take a while, so you may want to call `gc` explicitly -to avoid automatic compression kicking in when it is not convenient. - - -[[ensuring-reliability]] -=== Ensuring reliability - -[[checking-for-corruption]] -==== Checking the repository for corruption - -The linkgit:git-fsck[1] command runs a number of self-consistency checks -on the repository, and reports on any problems. This may take some -time. - -------------------------------------------------- -$ git fsck -dangling commit 7281251ddd2a61e38657c827739c57015671a6b3 -dangling commit 2706a059f258c6b245f298dc4ff2ccd30ec21a63 -dangling commit 13472b7c4b80851a1bc551779171dcb03655e9b5 -dangling blob 218761f9d90712d37a9c5e36f406f92202db07eb -dangling commit bf093535a34a4d35731aa2bd90fe6b176302f14f -dangling commit 8e4bec7f2ddaa268bef999853c25755452100f8e -dangling tree d50bb86186bf27b681d25af89d3b5b68382e4085 -dangling tree b24c2473f1fd3d91352a624795be026d64c8841f -... -------------------------------------------------- - -You will see informational messages on dangling objects. They are objects -that still exist in the repository but are no longer referenced by any of -your branches, and can (and will) be removed after a while with `gc`. -You can run `git fsck --no-dangling` to suppress these messages, and still -view real errors. - -[[recovering-lost-changes]] -==== Recovering lost changes - -[[reflogs]] -===== Reflogs - -Say you modify a branch with <<fixing-mistakes,`git reset --hard`>>, -and then realize that the branch was the only reference you had to -that point in history. - -Fortunately, Git also keeps a log, called a "reflog", of all the -previous values of each branch. So in this case you can still find the -old history using, for example, - -------------------------------------------------- -$ git log master@{1} -------------------------------------------------- - -This lists the commits reachable from the previous version of the -`master` branch head. This syntax can be used with any Git command -that accepts a commit, not just with `git log`. Some other examples: - -------------------------------------------------- -$ git show master@{2} # See where the branch pointed 2, -$ git show master@{3} # 3, ... changes ago. -$ gitk master@{yesterday} # See where it pointed yesterday, -$ gitk master@{"1 week ago"} # ... or last week -$ git log --walk-reflogs master # show reflog entries for master -------------------------------------------------- - -A separate reflog is kept for the HEAD, so - -------------------------------------------------- -$ git show HEAD@{"1 week ago"} -------------------------------------------------- - -will show what HEAD pointed to one week ago, not what the current branch -pointed to one week ago. This allows you to see the history of what -you've checked out. - -The reflogs are kept by default for 30 days, after which they may be -pruned. See linkgit:git-reflog[1] and linkgit:git-gc[1] to learn -how to control this pruning, and see the "SPECIFYING REVISIONS" -section of linkgit:gitrevisions[7] for details. - -Note that the reflog history is very different from normal Git history. -While normal history is shared by every repository that works on the -same project, the reflog history is not shared: it tells you only about -how the branches in your local repository have changed over time. - -[[dangling-object-recovery]] -===== Examining dangling objects - -In some situations the reflog may not be able to save you. For example, -suppose you delete a branch, then realize you need the history it -contained. The reflog is also deleted; however, if you have not yet -pruned the repository, then you may still be able to find the lost -commits in the dangling objects that `git fsck` reports. See -<<dangling-objects>> for the details. - -------------------------------------------------- -$ git fsck -dangling commit 7281251ddd2a61e38657c827739c57015671a6b3 -dangling commit 2706a059f258c6b245f298dc4ff2ccd30ec21a63 -dangling commit 13472b7c4b80851a1bc551779171dcb03655e9b5 -... -------------------------------------------------- - -You can examine -one of those dangling commits with, for example, - ------------------------------------------------- -$ gitk 7281251ddd --not --all ------------------------------------------------- - -which does what it sounds like: it says that you want to see the commit -history that is described by the dangling commit(s), but not the -history that is described by all your existing branches and tags. Thus -you get exactly the history reachable from that commit that is lost. -(And notice that it might not be just one commit: we only report the -"tip of the line" as being dangling, but there might be a whole deep -and complex commit history that was dropped.) - -If you decide you want the history back, you can always create a new -reference pointing to it, for example, a new branch: - ------------------------------------------------- -$ git branch recovered-branch 7281251ddd ------------------------------------------------- - -Other types of dangling objects (blobs and trees) are also possible, and -dangling objects can arise in other situations. - - -[[sharing-development]] -== Sharing development with others - -[[getting-updates-With-git-pull]] -=== Getting updates with git pull - -After you clone a repository and commit a few changes of your own, you -may wish to check the original repository for updates and merge them -into your own work. - -We have already seen <<Updating-a-repository-With-git-fetch,how to -keep remote-tracking branches up to date>> with linkgit:git-fetch[1], -and how to merge two branches. So you can merge in changes from the -original repository's master branch with: - -------------------------------------------------- -$ git fetch -$ git merge origin/master -------------------------------------------------- - -However, the linkgit:git-pull[1] command provides a way to do this in -one step: - -------------------------------------------------- -$ git pull origin master -------------------------------------------------- - -In fact, if you have `master` checked out, then this branch has been -configured by `git clone` to get changes from the HEAD branch of the -origin repository. So often you can -accomplish the above with just a simple - -------------------------------------------------- -$ git pull -------------------------------------------------- - -This command will fetch changes from the remote branches to your -remote-tracking branches `origin/*`, and merge the default branch into -the current branch. - -More generally, a branch that is created from a remote-tracking branch -will pull -by default from that branch. See the descriptions of the -`branch.<name>.remote` and `branch.<name>.merge` options in -linkgit:git-config[1], and the discussion of the `--track` option in -linkgit:git-checkout[1], to learn how to control these defaults. - -In addition to saving you keystrokes, `git pull` also helps you by -producing a default commit message documenting the branch and -repository that you pulled from. - -(But note that no such commit will be created in the case of a -<<fast-forwards,fast-forward>>; instead, your branch will just be -updated to point to the latest commit from the upstream branch.) - -The `git pull` command can also be given `.` as the "remote" repository, -in which case it just merges in a branch from the current repository; so -the commands - -------------------------------------------------- -$ git pull . branch -$ git merge branch -------------------------------------------------- - -are roughly equivalent. - -[[submitting-patches]] -=== Submitting patches to a project - -If you just have a few changes, the simplest way to submit them may -just be to send them as patches in email: - -First, use linkgit:git-format-patch[1]; for example: - -------------------------------------------------- -$ git format-patch origin -------------------------------------------------- - -will produce a numbered series of files in the current directory, one -for each patch in the current branch but not in `origin/HEAD`. - -`git format-patch` can include an initial "cover letter". You can insert -commentary on individual patches after the three dash line which -`format-patch` places after the commit message but before the patch -itself. If you use `git notes` to track your cover letter material, -`git format-patch --notes` will include the commit's notes in a similar -manner. - -You can then import these into your mail client and send them by -hand. However, if you have a lot to send at once, you may prefer to -use the linkgit:git-send-email[1] script to automate the process. -Consult the mailing list for your project first to determine -their requirements for submitting patches. - -[[importing-patches]] -=== Importing patches to a project - -Git also provides a tool called linkgit:git-am[1] (am stands for -"apply mailbox"), for importing such an emailed series of patches. -Just save all of the patch-containing messages, in order, into a -single mailbox file, say `patches.mbox`, then run - -------------------------------------------------- -$ git am -3 patches.mbox -------------------------------------------------- - -Git will apply each patch in order; if any conflicts are found, it -will stop, and you can fix the conflicts as described in -"<<resolving-a-merge,Resolving a merge>>". (The `-3` option tells -Git to perform a merge; if you would prefer it just to abort and -leave your tree and index untouched, you may omit that option.) - -Once the index is updated with the results of the conflict -resolution, instead of creating a new commit, just run - -------------------------------------------------- -$ git am --continue -------------------------------------------------- - -and Git will create the commit for you and continue applying the -remaining patches from the mailbox. - -The final result will be a series of commits, one for each patch in -the original mailbox, with authorship and commit log message each -taken from the message containing each patch. - -[[public-repositories]] -=== Public Git repositories - -Another way to submit changes to a project is to tell the maintainer -of that project to pull the changes from your repository using -linkgit:git-pull[1]. In the section "<<getting-updates-With-git-pull, -Getting updates with `git pull`>>" we described this as a way to get -updates from the "main" repository, but it works just as well in the -other direction. - -If you and the maintainer both have accounts on the same machine, then -you can just pull changes from each other's repositories directly; -commands that accept repository URLs as arguments will also accept a -local directory name: - -------------------------------------------------- -$ git clone /path/to/repository -$ git pull /path/to/other/repository -------------------------------------------------- - -or an ssh URL: - -------------------------------------------------- -$ git clone ssh://yourhost/~you/repository -------------------------------------------------- - -For projects with few developers, or for synchronizing a few private -repositories, this may be all you need. - -However, the more common way to do this is to maintain a separate public -repository (usually on a different host) for others to pull changes -from. This is usually more convenient, and allows you to cleanly -separate private work in progress from publicly visible work. - -You will continue to do your day-to-day work in your personal -repository, but periodically "push" changes from your personal -repository into your public repository, allowing other developers to -pull from that repository. So the flow of changes, in a situation -where there is one other developer with a public repository, looks -like this: - -.... - you push -your personal repo ------------------> your public repo - ^ | - | | - | you pull | they pull - | | - | | - | they push V -their public repo <------------------- their repo -.... - -We explain how to do this in the following sections. - -[[setting-up-a-public-repository]] -==== Setting up a public repository - -Assume your personal repository is in the directory `~/proj`. We -first create a new clone of the repository and tell `git daemon` that it -is meant to be public: - -------------------------------------------------- -$ git clone --bare ~/proj proj.git -$ touch proj.git/git-daemon-export-ok -------------------------------------------------- - -The resulting directory proj.git contains a "bare" git repository--it is -just the contents of the `.git` directory, without any files checked out -around it. - -Next, copy `proj.git` to the server where you plan to host the -public repository. You can use scp, rsync, or whatever is most -convenient. - -[[exporting-via-git]] -==== Exporting a Git repository via the Git protocol - -This is the preferred method. - -If someone else administers the server, they should tell you what -directory to put the repository in, and what `git://` URL it will -appear at. You can then skip to the section -"<<pushing-changes-to-a-public-repository,Pushing changes to a public -repository>>", below. - -Otherwise, all you need to do is start linkgit:git-daemon[1]; it will -listen on port 9418. By default, it will allow access to any directory -that looks like a Git directory and contains the magic file -git-daemon-export-ok. Passing some directory paths as `git daemon` -arguments will further restrict the exports to those paths. - -You can also run `git daemon` as an inetd service; see the -linkgit:git-daemon[1] man page for details. (See especially the -examples section.) - -[[exporting-via-http]] -==== Exporting a git repository via HTTP - -The Git protocol gives better performance and reliability, but on a -host with a web server set up, HTTP exports may be simpler to set up. - -All you need to do is place the newly created bare Git repository in -a directory that is exported by the web server, and make some -adjustments to give web clients some extra information they need: - -------------------------------------------------- -$ mv proj.git /home/you/public_html/proj.git -$ cd proj.git -$ git --bare update-server-info -$ mv hooks/post-update.sample hooks/post-update -------------------------------------------------- - -(For an explanation of the last two lines, see -linkgit:git-update-server-info[1] and linkgit:githooks[5].) - -Advertise the URL of `proj.git`. Anybody else should then be able to -clone or pull from that URL, for example with a command line like: - -------------------------------------------------- -$ git clone http://yourserver.com/~you/proj.git -------------------------------------------------- - -(See also -link:howto/setup-git-server-over-http.html[setup-git-server-over-http] -for a slightly more sophisticated setup using WebDAV which also -allows pushing over HTTP.) - -[[pushing-changes-to-a-public-repository]] -==== Pushing changes to a public repository - -Note that the two techniques outlined above (exporting via -<<exporting-via-http,http>> or <<exporting-via-git,git>>) allow other -maintainers to fetch your latest changes, but they do not allow write -access, which you will need to update the public repository with the -latest changes created in your private repository. - -The simplest way to do this is using linkgit:git-push[1] and ssh; to -update the remote branch named `master` with the latest state of your -branch named `master`, run - -------------------------------------------------- -$ git push ssh://yourserver.com/~you/proj.git master:master -------------------------------------------------- - -or just - -------------------------------------------------- -$ git push ssh://yourserver.com/~you/proj.git master -------------------------------------------------- - -As with `git fetch`, `git push` will complain if this does not result in a -<<fast-forwards,fast-forward>>; see the following section for details on -handling this case. - -Note that the target of a `push` is normally a -<<def_bare_repository,bare>> repository. You can also push to a -repository that has a checked-out working tree, but a push to update the -currently checked-out branch is denied by default to prevent confusion. -See the description of the receive.denyCurrentBranch option -in linkgit:git-config[1] for details. - -As with `git fetch`, you may also set up configuration options to -save typing; so, for example: - -------------------------------------------------- -$ git remote add public-repo ssh://yourserver.com/~you/proj.git -------------------------------------------------- - -adds the following to `.git/config`: - -------------------------------------------------- -[remote "public-repo"] - url = yourserver.com:proj.git - fetch = +refs/heads/*:refs/remotes/example/* -------------------------------------------------- - -which lets you do the same push with just - -------------------------------------------------- -$ git push public-repo master -------------------------------------------------- - -See the explanations of the `remote.<name>.url`, -`branch.<name>.remote`, and `remote.<name>.push` options in -linkgit:git-config[1] for details. - -[[forcing-push]] -==== What to do when a push fails - -If a push would not result in a <<fast-forwards,fast-forward>> of the -remote branch, then it will fail with an error like: - -------------------------------------------------- - ! [rejected] master -> master (non-fast-forward) -error: failed to push some refs to '...' -hint: Updates were rejected because the tip of your current branch is behind -hint: its remote counterpart. Integrate the remote changes (e.g. -hint: 'git pull ...') before pushing again. -hint: See the 'Note about fast-forwards' in 'git push --help' for details. -------------------------------------------------- - -This can happen, for example, if you: - - - use `git reset --hard` to remove already-published commits, or - - use `git commit --amend` to replace already-published commits - (as in <<fixing-a-mistake-by-rewriting-history>>), or - - use `git rebase` to rebase any already-published commits (as - in <<using-git-rebase>>). - -You may force `git push` to perform the update anyway by preceding the -branch name with a plus sign: - -------------------------------------------------- -$ git push ssh://yourserver.com/~you/proj.git +master -------------------------------------------------- - -Note the addition of the `+` sign. Alternatively, you can use the -`-f` flag to force the remote update, as in: - -------------------------------------------------- -$ git push -f ssh://yourserver.com/~you/proj.git master -------------------------------------------------- - -Normally whenever a branch head in a public repository is modified, it -is modified to point to a descendant of the commit that it pointed to -before. By forcing a push in this situation, you break that convention. -(See <<problems-With-rewriting-history>>.) - -Nevertheless, this is a common practice for people that need a simple -way to publish a work-in-progress patch series, and it is an acceptable -compromise as long as you warn other developers that this is how you -intend to manage the branch. - -It's also possible for a push to fail in this way when other people have -the right to push to the same repository. In that case, the correct -solution is to retry the push after first updating your work: either by a -pull, or by a fetch followed by a rebase; see the -<<setting-up-a-shared-repository,next section>> and -linkgit:gitcvs-migration[7] for more. - -[[setting-up-a-shared-repository]] -==== Setting up a shared repository - -Another way to collaborate is by using a model similar to that -commonly used in CVS, where several developers with special rights -all push to and pull from a single shared repository. See -linkgit:gitcvs-migration[7] for instructions on how to -set this up. - -However, while there is nothing wrong with Git's support for shared -repositories, this mode of operation is not generally recommended, -simply because the mode of collaboration that Git supports--by -exchanging patches and pulling from public repositories--has so many -advantages over the central shared repository: - - - Git's ability to quickly import and merge patches allows a - single maintainer to process incoming changes even at very - high rates. And when that becomes too much, `git pull` provides - an easy way for that maintainer to delegate this job to other - maintainers while still allowing optional review of incoming - changes. - - Since every developer's repository has the same complete copy - of the project history, no repository is special, and it is - trivial for another developer to take over maintenance of a - project, either by mutual agreement, or because a maintainer - becomes unresponsive or difficult to work with. - - The lack of a central group of "committers" means there is - less need for formal decisions about who is "in" and who is - "out". - -[[setting-up-gitweb]] -==== Allowing web browsing of a repository - -The gitweb cgi script provides users an easy way to browse your -project's revisions, file contents and logs without having to install -Git. Features like RSS/Atom feeds and blame/annotation details may -optionally be enabled. - -The linkgit:git-instaweb[1] command provides a simple way to start -browsing the repository using gitweb. The default server when using -instaweb is lighttpd. - -See the file gitweb/INSTALL in the Git source tree and -linkgit:gitweb[1] for instructions on details setting up a permanent -installation with a CGI or Perl capable server. - -[[how-to-get-a-git-repository-with-minimal-history]] -=== How to get a Git repository with minimal history - -A <<def_shallow_clone,shallow clone>>, with its truncated -history, is useful when one is interested only in recent history -of a project and getting full history from the upstream is -expensive. - -A <<def_shallow_clone,shallow clone>> is created by specifying -the linkgit:git-clone[1] `--depth` switch. The depth can later be -changed with the linkgit:git-fetch[1] `--depth` switch, or full -history restored with `--unshallow`. - -Merging inside a <<def_shallow_clone,shallow clone>> will work as long -as a merge base is in the recent history. -Otherwise, it will be like merging unrelated histories and may -have to result in huge conflicts. This limitation may make such -a repository unsuitable to be used in merge based workflows. - -[[sharing-development-examples]] -=== Examples - -[[maintaining-topic-branches]] -==== Maintaining topic branches for a Linux subsystem maintainer - -This describes how Tony Luck uses Git in his role as maintainer of the -IA64 architecture for the Linux kernel. - -He uses two public branches: - - - A "test" tree into which patches are initially placed so that they - can get some exposure when integrated with other ongoing development. - This tree is available to Andrew for pulling into -mm whenever he - wants. - - - A "release" tree into which tested patches are moved for final sanity - checking, and as a vehicle to send them upstream to Linus (by sending - him a "please pull" request.) - -He also uses a set of temporary branches ("topic branches"), each -containing a logical grouping of patches. - -To set this up, first create your work tree by cloning Linus's public -tree: - -------------------------------------------------- -$ git clone git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git work -$ cd work -------------------------------------------------- - -Linus's tree will be stored in the remote-tracking branch named origin/master, -and can be updated using linkgit:git-fetch[1]; you can track other -public trees using linkgit:git-remote[1] to set up a "remote" and -linkgit:git-fetch[1] to keep them up to date; see -<<repositories-and-branches>>. - -Now create the branches in which you are going to work; these start out -at the current tip of origin/master branch, and should be set up (using -the `--track` option to linkgit:git-branch[1]) to merge changes in from -Linus by default. - -------------------------------------------------- -$ git branch --track test origin/master -$ git branch --track release origin/master -------------------------------------------------- - -These can be easily kept up to date using linkgit:git-pull[1]. - -------------------------------------------------- -$ git switch test && git pull -$ git switch release && git pull -------------------------------------------------- - -Important note! If you have any local changes in these branches, then -this merge will create a commit object in the history (with no local -changes Git will simply do a "fast-forward" merge). Many people dislike -the "noise" that this creates in the Linux history, so you should avoid -doing this capriciously in the `release` branch, as these noisy commits -will become part of the permanent history when you ask Linus to pull -from the release branch. - -A few configuration variables (see linkgit:git-config[1]) can -make it easy to push both branches to your public tree. (See -<<setting-up-a-public-repository>>.) - -------------------------------------------------- -$ cat >> .git/config <<EOF -[remote "mytree"] - url = master.kernel.org:/pub/scm/linux/kernel/git/aegl/linux.git - push = release - push = test -EOF -------------------------------------------------- - -Then you can push both the test and release trees using -linkgit:git-push[1]: - -------------------------------------------------- -$ git push mytree -------------------------------------------------- - -or push just one of the test and release branches using: - -------------------------------------------------- -$ git push mytree test -------------------------------------------------- - -or - -------------------------------------------------- -$ git push mytree release -------------------------------------------------- - -Now to apply some patches from the community. Think of a short -snappy name for a branch to hold this patch (or related group of -patches), and create a new branch from a recent stable tag of -Linus's branch. Picking a stable base for your branch will: -1) help you: by avoiding inclusion of unrelated and perhaps lightly -tested changes -2) help future bug hunters that use `git bisect` to find problems - -------------------------------------------------- -$ git switch -c speed-up-spinlocks v2.6.35 -------------------------------------------------- - -Now you apply the patch(es), run some tests, and commit the change(s). If -the patch is a multi-part series, then you should apply each as a separate -commit to this branch. - -------------------------------------------------- -$ ... patch ... test ... commit [ ... patch ... test ... commit ]* -------------------------------------------------- - -When you are happy with the state of this change, you can merge it into the -"test" branch in preparation to make it public: - -------------------------------------------------- -$ git switch test && git merge speed-up-spinlocks -------------------------------------------------- - -It is unlikely that you would have any conflicts here ... but you might if you -spent a while on this step and had also pulled new versions from upstream. - -Sometime later when enough time has passed and testing done, you can pull the -same branch into the `release` tree ready to go upstream. This is where you -see the value of keeping each patch (or patch series) in its own branch. It -means that the patches can be moved into the `release` tree in any order. - -------------------------------------------------- -$ git switch release && git merge speed-up-spinlocks -------------------------------------------------- - -After a while, you will have a number of branches, and despite the -well chosen names you picked for each of them, you may forget what -they are for, or what status they are in. To get a reminder of what -changes are in a specific branch, use: - -------------------------------------------------- -$ git log linux..branchname | git shortlog -------------------------------------------------- - -To see whether it has already been merged into the test or release branches, -use: - -------------------------------------------------- -$ git log test..branchname -------------------------------------------------- - -or - -------------------------------------------------- -$ git log release..branchname -------------------------------------------------- - -(If this branch has not yet been merged, you will see some log entries. -If it has been merged, then there will be no output.) - -Once a patch completes the great cycle (moving from test to release, -then pulled by Linus, and finally coming back into your local -`origin/master` branch), the branch for this change is no longer needed. -You detect this when the output from: - -------------------------------------------------- -$ git log origin..branchname -------------------------------------------------- - -is empty. At this point the branch can be deleted: - -------------------------------------------------- -$ git branch -d branchname -------------------------------------------------- - -Some changes are so trivial that it is not necessary to create a separate -branch and then merge into each of the test and release branches. For -these changes, just apply directly to the `release` branch, and then -merge that into the `test` branch. - -After pushing your work to `mytree`, you can use -linkgit:git-request-pull[1] to prepare a "please pull" request message -to send to Linus: - -------------------------------------------------- -$ git push mytree -$ git request-pull origin mytree release -------------------------------------------------- - -Here are some of the scripts that simplify all this even further. - -------------------------------------------------- -==== update script ==== -# Update a branch in my Git tree. If the branch to be updated -# is origin, then pull from kernel.org. Otherwise merge -# origin/master branch into test|release branch - -case "$1" in -test|release) - git checkout $1 && git pull . origin - ;; -origin) - before=$(git rev-parse refs/remotes/origin/master) - git fetch origin - after=$(git rev-parse refs/remotes/origin/master) - if [ $before != $after ] - then - git log $before..$after | git shortlog - fi - ;; -*) - echo "usage: $0 origin|test|release" 1>&2 - exit 1 - ;; -esac -------------------------------------------------- - -------------------------------------------------- -==== merge script ==== -# Merge a branch into either the test or release branch - -pname=$0 - -usage() -{ - echo "usage: $pname branch test|release" 1>&2 - exit 1 -} - -git show-ref -q --verify -- refs/heads/"$1" || { - echo "Can't see branch <$1>" 1>&2 - usage -} - -case "$2" in -test|release) - if [ $(git log $2..$1 | wc -c) -eq 0 ] - then - echo $1 already merged into $2 1>&2 - exit 1 - fi - git checkout $2 && git pull . $1 - ;; -*) - usage - ;; -esac -------------------------------------------------- - -------------------------------------------------- -==== status script ==== -# report on status of my ia64 Git tree - -gb=$(tput setab 2) -rb=$(tput setab 1) -restore=$(tput setab 9) - -if [ `git rev-list test..release | wc -c` -gt 0 ] -then - echo $rb Warning: commits in release that are not in test $restore - git log test..release -fi - -for branch in `git show-ref --heads | sed 's|^.*/||'` -do - if [ $branch = test -o $branch = release ] - then - continue - fi - - echo -n $gb ======= $branch ====== $restore " " - status= - for ref in test release origin/master - do - if [ `git rev-list $ref..$branch | wc -c` -gt 0 ] - then - status=$status${ref:0:1} - fi - done - case $status in - trl) - echo $rb Need to pull into test $restore - ;; - rl) - echo "In test" - ;; - l) - echo "Waiting for linus" - ;; - "") - echo $rb All done $restore - ;; - *) - echo $rb "<$status>" $restore - ;; - esac - git log origin/master..$branch | git shortlog -done -------------------------------------------------- - - -[[cleaning-up-history]] -== Rewriting history and maintaining patch series - -Normally commits are only added to a project, never taken away or -replaced. Git is designed with this assumption, and violating it will -cause Git's merge machinery (for example) to do the wrong thing. - -However, there is a situation in which it can be useful to violate this -assumption. - -[[patch-series]] -=== Creating the perfect patch series - -Suppose you are a contributor to a large project, and you want to add a -complicated feature, and to present it to the other developers in a way -that makes it easy for them to read your changes, verify that they are -correct, and understand why you made each change. - -If you present all of your changes as a single patch (or commit), they -may find that it is too much to digest all at once. - -If you present them with the entire history of your work, complete with -mistakes, corrections, and dead ends, they may be overwhelmed. - -So the ideal is usually to produce a series of patches such that: - - 1. Each patch can be applied in order. - - 2. Each patch includes a single logical change, together with a - message explaining the change. - - 3. No patch introduces a regression: after applying any initial - part of the series, the resulting project still compiles and - works, and has no bugs that it didn't have before. - - 4. The complete series produces the same end result as your own - (probably much messier!) development process did. - -We will introduce some tools that can help you do this, explain how to -use them, and then explain some of the problems that can arise because -you are rewriting history. - -[[using-git-rebase]] -=== Keeping a patch series up to date using git rebase - -Suppose that you create a branch `mywork` on a remote-tracking branch -`origin`, and create some commits on top of it: - -------------------------------------------------- -$ git switch -c mywork origin -$ vi file.txt -$ git commit -$ vi otherfile.txt -$ git commit -... -------------------------------------------------- - -You have performed no merges into mywork, so it is just a simple linear -sequence of patches on top of `origin`: - -................................................ - o--o--O <-- origin - \ - a--b--c <-- mywork -................................................ - -Some more interesting work has been done in the upstream project, and -`origin` has advanced: - -................................................ - o--o--O--o--o--o <-- origin - \ - a--b--c <-- mywork -................................................ - -At this point, you could use `pull` to merge your changes back in; -the result would create a new merge commit, like this: - -................................................ - o--o--O--o--o--o <-- origin - \ \ - a--b--c--m <-- mywork -................................................ - -However, if you prefer to keep the history in mywork a simple series of -commits without any merges, you may instead choose to use -linkgit:git-rebase[1]: - -------------------------------------------------- -$ git switch mywork -$ git rebase origin -------------------------------------------------- - -This will remove each of your commits from mywork, temporarily saving -them as patches (in a directory named `.git/rebase-apply`), update mywork to -point at the latest version of origin, then apply each of the saved -patches to the new mywork. The result will look like: - - -................................................ - o--o--O--o--o--o <-- origin - \ - a'--b'--c' <-- mywork -................................................ - -In the process, it may discover conflicts. In that case it will stop -and allow you to fix the conflicts; after fixing conflicts, use `git add` -to update the index with those contents, and then, instead of -running `git commit`, just run - -------------------------------------------------- -$ git rebase --continue -------------------------------------------------- - -and Git will continue applying the rest of the patches. - -At any point you may use the `--abort` option to abort this process and -return mywork to the state it had before you started the rebase: - -------------------------------------------------- -$ git rebase --abort -------------------------------------------------- - -If you need to reorder or edit a number of commits in a branch, it may -be easier to use `git rebase -i`, which allows you to reorder and -squash commits, as well as marking them for individual editing during -the rebase. See <<interactive-rebase>> for details, and -<<reordering-patch-series>> for alternatives. - -[[rewriting-one-commit]] -=== Rewriting a single commit - -We saw in <<fixing-a-mistake-by-rewriting-history>> that you can replace the -most recent commit using - -------------------------------------------------- -$ git commit --amend -------------------------------------------------- - -which will replace the old commit by a new commit incorporating your -changes, giving you a chance to edit the old commit message first. -This is useful for fixing typos in your last commit, or for adjusting -the patch contents of a poorly staged commit. - -If you need to amend commits from deeper in your history, you can -use <<interactive-rebase,interactive rebase's `edit` instruction>>. - -[[reordering-patch-series]] -=== Reordering or selecting from a patch series - -Sometimes you want to edit a commit deeper in your history. One -approach is to use `git format-patch` to create a series of patches -and then reset the state to before the patches: - -------------------------------------------------- -$ git format-patch origin -$ git reset --hard origin -------------------------------------------------- - -Then modify, reorder, or eliminate patches as needed before applying -them again with linkgit:git-am[1]: - -------------------------------------------------- -$ git am *.patch -------------------------------------------------- - -[[interactive-rebase]] -=== Using interactive rebases - -You can also edit a patch series with an interactive rebase. This is -the same as <<reordering-patch-series,reordering a patch series using -`format-patch`>>, so use whichever interface you like best. - -Rebase your current HEAD on the last commit you want to retain as-is. -For example, if you want to reorder the last 5 commits, use: - -------------------------------------------------- -$ git rebase -i HEAD~5 -------------------------------------------------- - -This will open your editor with a list of steps to be taken to perform -your rebase. - -------------------------------------------------- -pick deadbee The oneline of this commit -pick fa1afe1 The oneline of the next commit -... - -# Rebase c0ffeee..deadbee onto c0ffeee -# -# Commands: -# p, pick = use commit -# r, reword = use commit, but edit the commit message -# e, edit = use commit, but stop for amending -# s, squash = use commit, but meld into previous commit -# f, fixup = like "squash", but discard this commit's log message -# x, exec = run command (the rest of the line) using shell -# -# These lines can be re-ordered; they are executed from top to bottom. -# -# If you remove a line here THAT COMMIT WILL BE LOST. -# -# However, if you remove everything, the rebase will be aborted. -# -# Note that empty commits are commented out -------------------------------------------------- - -As explained in the comments, you can reorder commits, squash them -together, edit commit messages, etc. by editing the list. Once you -are satisfied, save the list and close your editor, and the rebase -will begin. - -The rebase will stop where `pick` has been replaced with `edit` or -when a step in the list fails to mechanically resolve conflicts and -needs your help. When you are done editing and/or resolving conflicts -you can continue with `git rebase --continue`. If you decide that -things are getting too hairy, you can always bail out with `git rebase ---abort`. Even after the rebase is complete, you can still recover -the original branch by using the <<reflogs,reflog>>. - -For a more detailed discussion of the procedure and additional tips, -see the "INTERACTIVE MODE" section of linkgit:git-rebase[1]. - -[[patch-series-tools]] -=== Other tools - -There are numerous other tools, such as StGit, which exist for the -purpose of maintaining a patch series. These are outside of the scope of -this manual. - -[[problems-With-rewriting-history]] -=== Problems with rewriting history - -The primary problem with rewriting the history of a branch has to do -with merging. Suppose somebody fetches your branch and merges it into -their branch, with a result something like this: - -................................................ - o--o--O--o--o--o <-- origin - \ \ - t--t--t--m <-- their branch: -................................................ - -Then suppose you modify the last three commits: - -................................................ - o--o--o <-- new head of origin - / - o--o--O--o--o--o <-- old head of origin -................................................ - -If we examined all this history together in one repository, it will -look like: - -................................................ - o--o--o <-- new head of origin - / - o--o--O--o--o--o <-- old head of origin - \ \ - t--t--t--m <-- their branch: -................................................ - -Git has no way of knowing that the new head is an updated version of -the old head; it treats this situation exactly the same as it would if -two developers had independently done the work on the old and new heads -in parallel. At this point, if someone attempts to merge the new head -in to their branch, Git will attempt to merge together the two (old and -new) lines of development, instead of trying to replace the old by the -new. The results are likely to be unexpected. - -You may still choose to publish branches whose history is rewritten, -and it may be useful for others to be able to fetch those branches in -order to examine or test them, but they should not attempt to pull such -branches into their own work. - -For true distributed development that supports proper merging, -published branches should never be rewritten. - -[[bisect-merges]] -=== Why bisecting merge commits can be harder than bisecting linear history - -The linkgit:git-bisect[1] command correctly handles history that -includes merge commits. However, when the commit that it finds is a -merge commit, the user may need to work harder than usual to figure out -why that commit introduced a problem. - -Imagine this history: - -................................................ - ---Z---o---X---...---o---A---C---D - \ / - o---o---Y---...---o---B -................................................ - -Suppose that on the upper line of development, the meaning of one -of the functions that exists at Z is changed at commit X. The -commits from Z leading to A change both the function's -implementation and all calling sites that exist at Z, as well -as new calling sites they add, to be consistent. There is no -bug at A. - -Suppose that in the meantime on the lower line of development somebody -adds a new calling site for that function at commit Y. The -commits from Z leading to B all assume the old semantics of that -function and the callers and the callee are consistent with each -other. There is no bug at B, either. - -Suppose further that the two development lines merge cleanly at C, -so no conflict resolution is required. - -Nevertheless, the code at C is broken, because the callers added -on the lower line of development have not been converted to the new -semantics introduced on the upper line of development. So if all -you know is that D is bad, that Z is good, and that -linkgit:git-bisect[1] identifies C as the culprit, how will you -figure out that the problem is due to this change in semantics? - -When the result of a `git bisect` is a non-merge commit, you should -normally be able to discover the problem by examining just that commit. -Developers can make this easy by breaking their changes into small -self-contained commits. That won't help in the case above, however, -because the problem isn't obvious from examination of any single -commit; instead, a global view of the development is required. To -make matters worse, the change in semantics in the problematic -function may be just one small part of the changes in the upper -line of development. - -On the other hand, if instead of merging at C you had rebased the -history between Z to B on top of A, you would have gotten this -linear history: - -................................................................ - ---Z---o---X--...---o---A---o---o---Y*--...---o---B*--D* -................................................................ - -Bisecting between Z and D* would hit a single culprit commit Y*, -and understanding why Y* was broken would probably be easier. - -Partly for this reason, many experienced Git users, even when -working on an otherwise merge-heavy project, keep the history -linear by rebasing against the latest upstream version before -publishing. - -[[advanced-branch-management]] -== Advanced branch management - -[[fetching-individual-branches]] -=== Fetching individual branches - -Instead of using linkgit:git-remote[1], you can also choose just -to update one branch at a time, and to store it locally under an -arbitrary name: - -------------------------------------------------- -$ git fetch origin todo:my-todo-work -------------------------------------------------- - -The first argument, `origin`, just tells Git to fetch from the -repository you originally cloned from. The second argument tells Git -to fetch the branch named `todo` from the remote repository, and to -store it locally under the name `refs/heads/my-todo-work`. - -You can also fetch branches from other repositories; so - -------------------------------------------------- -$ git fetch git://example.com/proj.git master:example-master -------------------------------------------------- - -will create a new branch named `example-master` and store in it the -branch named `master` from the repository at the given URL. If you -already have a branch named example-master, it will attempt to -<<fast-forwards,fast-forward>> to the commit given by example.com's -master branch. In more detail: - -[[fetch-fast-forwards]] -=== git fetch and fast-forwards - -In the previous example, when updating an existing branch, `git fetch` -checks to make sure that the most recent commit on the remote -branch is a descendant of the most recent commit on your copy of the -branch before updating your copy of the branch to point at the new -commit. Git calls this process a <<fast-forwards,fast-forward>>. - -A fast-forward looks something like this: - -................................................ - o--o--o--o <-- old head of the branch - \ - o--o--o <-- new head of the branch -................................................ - - -In some cases it is possible that the new head will *not* actually be -a descendant of the old head. For example, the developer may have -realized she made a serious mistake, and decided to backtrack, -resulting in a situation like: - -................................................ - o--o--o--o--a--b <-- old head of the branch - \ - o--o--o <-- new head of the branch -................................................ - -In this case, `git fetch` will fail, and print out a warning. - -In that case, you can still force Git to update to the new head, as -described in the following section. However, note that in the -situation above this may mean losing the commits labeled `a` and `b`, -unless you've already created a reference of your own pointing to -them. - -[[forcing-fetch]] -=== Forcing git fetch to do non-fast-forward updates - -If git fetch fails because the new head of a branch is not a -descendant of the old head, you may force the update with: - -------------------------------------------------- -$ git fetch git://example.com/proj.git +master:refs/remotes/example/master -------------------------------------------------- - -Note the addition of the `+` sign. Alternatively, you can use the `-f` -flag to force updates of all the fetched branches, as in: - -------------------------------------------------- -$ git fetch -f origin -------------------------------------------------- - -Be aware that commits that the old version of example/master pointed at -may be lost, as we saw in the previous section. - -[[remote-branch-configuration]] -=== Configuring remote-tracking branches - -We saw above that `origin` is just a shortcut to refer to the -repository that you originally cloned from. This information is -stored in Git configuration variables, which you can see using -linkgit:git-config[1]: - -------------------------------------------------- -$ git config -l -core.repositoryformatversion=0 -core.filemode=true -core.logallrefupdates=true -remote.origin.url=git://git.kernel.org/pub/scm/git/git.git -remote.origin.fetch=+refs/heads/*:refs/remotes/origin/* -branch.master.remote=origin -branch.master.merge=refs/heads/master -------------------------------------------------- - -If there are other repositories that you also use frequently, you can -create similar configuration options to save typing; for example, - -------------------------------------------------- -$ git remote add example git://example.com/proj.git -------------------------------------------------- - -adds the following to `.git/config`: - -------------------------------------------------- -[remote "example"] - url = git://example.com/proj.git - fetch = +refs/heads/*:refs/remotes/example/* -------------------------------------------------- - -Also note that the above configuration can be performed by directly -editing the file `.git/config` instead of using linkgit:git-remote[1]. - -After configuring the remote, the following three commands will do the -same thing: - -------------------------------------------------- -$ git fetch git://example.com/proj.git +refs/heads/*:refs/remotes/example/* -$ git fetch example +refs/heads/*:refs/remotes/example/* -$ git fetch example -------------------------------------------------- - -See linkgit:git-config[1] for more details on the configuration -options mentioned above and linkgit:git-fetch[1] for more details on -the refspec syntax. - - -[[git-concepts]] -== Git concepts - -Git is built on a small number of simple but powerful ideas. While it -is possible to get things done without understanding them, you will find -Git much more intuitive if you do. - -We start with the most important, the <<def_object_database,object -database>> and the <<def_index,index>>. - -[[the-object-database]] -=== The Object Database - - -We already saw in <<understanding-commits>> that all commits are stored -under a 40-digit "object name". In fact, all the information needed to -represent the history of a project is stored in objects with such names. -In each case the name is calculated by taking the SHA-1 hash of the -contents of the object. The SHA-1 hash is a cryptographic hash function. -What that means to us is that it is impossible to find two different -objects with the same name. This has a number of advantages; among -others: - -- Git can quickly determine whether two objects are identical or not, - just by comparing names. -- Since object names are computed the same way in every repository, the - same content stored in two repositories will always be stored under - the same name. -- Git can detect errors when it reads an object, by checking that the - object's name is still the SHA-1 hash of its contents. - -(See <<object-details>> for the details of the object formatting and -SHA-1 calculation.) - -There are four different types of objects: "blob", "tree", "commit", and -"tag". - -- A <<def_blob_object,"blob" object>> is used to store file data. -- A <<def_tree_object,"tree" object>> ties one or more - "blob" objects into a directory structure. In addition, a tree object - can refer to other tree objects, thus creating a directory hierarchy. -- A <<def_commit_object,"commit" object>> ties such directory hierarchies - together into a <<def_DAG,directed acyclic graph>> of revisions--each - commit contains the object name of exactly one tree designating the - directory hierarchy at the time of the commit. In addition, a commit - refers to "parent" commit objects that describe the history of how we - arrived at that directory hierarchy. -- A <<def_tag_object,"tag" object>> symbolically identifies and can be - used to sign other objects. It contains the object name and type of - another object, a symbolic name (of course!) and, optionally, a - signature. - -The object types in some more detail: - -[[commit-object]] -==== Commit Object - -The "commit" object links a physical state of a tree with a description -of how we got there and why. Use the `--pretty=raw` option to -linkgit:git-show[1] or linkgit:git-log[1] to examine your favorite -commit: - ------------------------------------------------- -$ git show -s --pretty=raw 2be7fcb476 -commit 2be7fcb4764f2dbcee52635b91fedb1b3dcf7ab4 -tree fb3a8bdd0ceddd019615af4d57a53f43d8cee2bf -parent 257a84d9d02e90447b149af58b271c19405edb6a -author Dave Watson <dwatson@mimvista.com> 1187576872 -0400 -committer Junio C Hamano <gitster@pobox.com> 1187591163 -0700 - - Fix misspelling of 'suppress' in docs - - Signed-off-by: Junio C Hamano <gitster@pobox.com> ------------------------------------------------- - -As you can see, a commit is defined by: - -- a tree: The SHA-1 name of a tree object (as defined below), representing - the contents of a directory at a certain point in time. -- parent(s): The SHA-1 name(s) of some number of commits which represent the - immediately previous step(s) in the history of the project. The - example above has one parent; merge commits may have more than - one. A commit with no parents is called a "root" commit, and - represents the initial revision of a project. Each project must have - at least one root. A project can also have multiple roots, though - that isn't common (or necessarily a good idea). -- an author: The name of the person responsible for this change, together - with its date. -- a committer: The name of the person who actually created the commit, - with the date it was done. This may be different from the author, for - example, if the author was someone who wrote a patch and emailed it - to the person who used it to create the commit. -- a comment describing this commit. - -Note that a commit does not itself contain any information about what -actually changed; all changes are calculated by comparing the contents -of the tree referred to by this commit with the trees associated with -its parents. In particular, Git does not attempt to record file renames -explicitly, though it can identify cases where the existence of the same -file data at changing paths suggests a rename. (See, for example, the -`-M` option to linkgit:git-diff[1]). - -A commit is usually created by linkgit:git-commit[1], which creates a -commit whose parent is normally the current HEAD, and whose tree is -taken from the content currently stored in the index. - -[[tree-object]] -==== Tree Object - -The ever-versatile linkgit:git-show[1] command can also be used to -examine tree objects, but linkgit:git-ls-tree[1] will give you more -details: - ------------------------------------------------- -$ git ls-tree fb3a8bdd0ce -100644 blob 63c918c667fa005ff12ad89437f2fdc80926e21c .gitignore -100644 blob 5529b198e8d14decbe4ad99db3f7fb632de0439d .mailmap -100644 blob 6ff87c4664981e4397625791c8ea3bbb5f2279a3 COPYING -040000 tree 2fb783e477100ce076f6bf57e4a6f026013dc745 Documentation -100755 blob 3c0032cec592a765692234f1cba47dfdcc3a9200 GIT-VERSION-GEN -100644 blob 289b046a443c0647624607d471289b2c7dcd470b INSTALL -100644 blob 4eb463797adc693dc168b926b6932ff53f17d0b1 Makefile -100644 blob 548142c327a6790ff8821d67c2ee1eff7a656b52 README -... ------------------------------------------------- - -As you can see, a tree object contains a list of entries, each with a -mode, object type, SHA-1 name, and name, sorted by name. It represents -the contents of a single directory tree. - -The object type may be a blob, representing the contents of a file, or -another tree, representing the contents of a subdirectory. Since trees -and blobs, like all other objects, are named by the SHA-1 hash of their -contents, two trees have the same SHA-1 name if and only if their -contents (including, recursively, the contents of all subdirectories) -are identical. This allows Git to quickly determine the differences -between two related tree objects, since it can ignore any entries with -identical object names. - -(Note: in the presence of submodules, trees may also have commits as -entries. See <<submodules>> for documentation.) - -Note that the files all have mode 644 or 755: Git actually only pays -attention to the executable bit. - -[[blob-object]] -==== Blob Object - -You can use linkgit:git-show[1] to examine the contents of a blob; take, -for example, the blob in the entry for `COPYING` from the tree above: - ------------------------------------------------- -$ git show 6ff87c4664 - - Note that the only valid version of the GPL as far as this project - is concerned is _this_ particular version of the license (ie v2, not - v2.2 or v3.x or whatever), unless explicitly otherwise stated. -... ------------------------------------------------- - -A "blob" object is nothing but a binary blob of data. It doesn't refer -to anything else or have attributes of any kind. - -Since the blob is entirely defined by its data, if two files in a -directory tree (or in multiple different versions of the repository) -have the same contents, they will share the same blob object. The object -is totally independent of its location in the directory tree, and -renaming a file does not change the object that file is associated with. - -Note that any tree or blob object can be examined using -linkgit:git-show[1] with the <revision>:<path> syntax. This can -sometimes be useful for browsing the contents of a tree that is not -currently checked out. - -[[trust]] -==== Trust - -If you receive the SHA-1 name of a blob from one source, and its contents -from another (possibly untrusted) source, you can still trust that those -contents are correct as long as the SHA-1 name agrees. This is because -the SHA-1 is designed so that it is infeasible to find different contents -that produce the same hash. - -Similarly, you need only trust the SHA-1 name of a top-level tree object -to trust the contents of the entire directory that it refers to, and if -you receive the SHA-1 name of a commit from a trusted source, then you -can easily verify the entire history of commits reachable through -parents of that commit, and all of those contents of the trees referred -to by those commits. - -So to introduce some real trust in the system, the only thing you need -to do is to digitally sign just 'one' special note, which includes the -name of a top-level commit. Your digital signature shows others -that you trust that commit, and the immutability of the history of -commits tells others that they can trust the whole history. - -In other words, you can easily validate a whole archive by just -sending out a single email that tells the people the name (SHA-1 hash) -of the top commit, and digitally sign that email using something -like GPG/PGP. - -To assist in this, Git also provides the tag object... - -[[tag-object]] -==== Tag Object - -A tag object contains an object, object type, tag name, the name of the -person ("tagger") who created the tag, and a message, which may contain -a signature, as can be seen using linkgit:git-cat-file[1]: - ------------------------------------------------- -$ git cat-file tag v1.5.0 -object 437b1b20df4b356c9342dac8d38849f24ef44f27 -type commit -tag v1.5.0 -tagger Junio C Hamano <junkio@cox.net> 1171411200 +0000 - -GIT 1.5.0 ------BEGIN PGP SIGNATURE----- -Version: GnuPG v1.4.6 (GNU/Linux) - -iD8DBQBF0lGqwMbZpPMRm5oRAuRiAJ9ohBLd7s2kqjkKlq1qqC57SbnmzQCdG4ui -nLE/L9aUXdWeTFPron96DLA= -=2E+0 ------END PGP SIGNATURE----- ------------------------------------------------- - -See the linkgit:git-tag[1] command to learn how to create and verify tag -objects. (Note that linkgit:git-tag[1] can also be used to create -"lightweight tags", which are not tag objects at all, but just simple -references whose names begin with `refs/tags/`). - -[[pack-files]] -==== How Git stores objects efficiently: pack files - -Newly created objects are initially created in a file named after the -object's SHA-1 hash (stored in `.git/objects`). - -Unfortunately this system becomes inefficient once a project has a -lot of objects. Try this on an old project: - ------------------------------------------------- -$ git count-objects -6930 objects, 47620 kilobytes ------------------------------------------------- - -The first number is the number of objects which are kept in -individual files. The second is the amount of space taken up by -those "loose" objects. - -You can save space and make Git faster by moving these loose objects in -to a "pack file", which stores a group of objects in an efficient -compressed format; the details of how pack files are formatted can be -found in link:technical/pack-format.html[pack format]. - -To put the loose objects into a pack, just run git repack: - ------------------------------------------------- -$ git repack -Counting objects: 6020, done. -Delta compression using up to 4 threads. -Compressing objects: 100% (6020/6020), done. -Writing objects: 100% (6020/6020), done. -Total 6020 (delta 4070), reused 0 (delta 0) ------------------------------------------------- - -This creates a single "pack file" in .git/objects/pack/ -containing all currently unpacked objects. You can then run - ------------------------------------------------- -$ git prune ------------------------------------------------- - -to remove any of the "loose" objects that are now contained in the -pack. This will also remove any unreferenced objects (which may be -created when, for example, you use `git reset` to remove a commit). -You can verify that the loose objects are gone by looking at the -`.git/objects` directory or by running - ------------------------------------------------- -$ git count-objects -0 objects, 0 kilobytes ------------------------------------------------- - -Although the object files are gone, any commands that refer to those -objects will work exactly as they did before. - -The linkgit:git-gc[1] command performs packing, pruning, and more for -you, so is normally the only high-level command you need. - -[[dangling-objects]] -==== Dangling objects - -The linkgit:git-fsck[1] command will sometimes complain about dangling -objects. They are not a problem. - -The most common cause of dangling objects is that you've rebased a -branch, or you have pulled from somebody else who rebased a branch--see -<<cleaning-up-history>>. In that case, the old head of the original -branch still exists, as does everything it pointed to. The branch -pointer itself just doesn't, since you replaced it with another one. - -There are also other situations that cause dangling objects. For -example, a "dangling blob" may arise because you did a `git add` of a -file, but then, before you actually committed it and made it part of the -bigger picture, you changed something else in that file and committed -that *updated* thing--the old state that you added originally ends up -not being pointed to by any commit or tree, so it's now a dangling blob -object. - -Similarly, when the "recursive" merge strategy runs, and finds that -there are criss-cross merges and thus more than one merge base (which is -fairly unusual, but it does happen), it will generate one temporary -midway tree (or possibly even more, if you had lots of criss-crossing -merges and more than two merge bases) as a temporary internal merge -base, and again, those are real objects, but the end result will not end -up pointing to them, so they end up "dangling" in your repository. - -Generally, dangling objects aren't anything to worry about. They can -even be very useful: if you screw something up, the dangling objects can -be how you recover your old tree (say, you did a rebase, and realized -that you really didn't want to--you can look at what dangling objects -you have, and decide to reset your head to some old dangling state). - -For commits, you can just use: - ------------------------------------------------- -$ gitk <dangling-commit-sha-goes-here> --not --all ------------------------------------------------- - -This asks for all the history reachable from the given commit but not -from any branch, tag, or other reference. If you decide it's something -you want, you can always create a new reference to it, e.g., - ------------------------------------------------- -$ git branch recovered-branch <dangling-commit-sha-goes-here> ------------------------------------------------- - -For blobs and trees, you can't do the same, but you can still examine -them. You can just do - ------------------------------------------------- -$ git show <dangling-blob/tree-sha-goes-here> ------------------------------------------------- - -to show what the contents of the blob were (or, for a tree, basically -what the `ls` for that directory was), and that may give you some idea -of what the operation was that left that dangling object. - -Usually, dangling blobs and trees aren't very interesting. They're -almost always the result of either being a half-way mergebase (the blob -will often even have the conflict markers from a merge in it, if you -have had conflicting merges that you fixed up by hand), or simply -because you interrupted a `git fetch` with ^C or something like that, -leaving _some_ of the new objects in the object database, but just -dangling and useless. - -Anyway, once you are sure that you're not interested in any dangling -state, you can just prune all unreachable objects: - ------------------------------------------------- -$ git prune ------------------------------------------------- - -and they'll be gone. (You should only run `git prune` on a quiescent -repository--it's kind of like doing a filesystem fsck recovery: you -don't want to do that while the filesystem is mounted. -`git prune` is designed not to cause any harm in such cases of concurrent -accesses to a repository but you might receive confusing or scary messages.) - -[[recovering-from-repository-corruption]] -==== Recovering from repository corruption - -By design, Git treats data trusted to it with caution. However, even in -the absence of bugs in Git itself, it is still possible that hardware or -operating system errors could corrupt data. - -The first defense against such problems is backups. You can back up a -Git directory using clone, or just using cp, tar, or any other backup -mechanism. - -As a last resort, you can search for the corrupted objects and attempt -to replace them by hand. Back up your repository before attempting this -in case you corrupt things even more in the process. - -We'll assume that the problem is a single missing or corrupted blob, -which is sometimes a solvable problem. (Recovering missing trees and -especially commits is *much* harder). - -Before starting, verify that there is corruption, and figure out where -it is with linkgit:git-fsck[1]; this may be time-consuming. - -Assume the output looks like this: - ------------------------------------------------- -$ git fsck --full --no-dangling -broken link from tree 2d9263c6d23595e7cb2a21e5ebbb53655278dff8 - to blob 4b9458b3786228369c63936db65827de3cc06200 -missing blob 4b9458b3786228369c63936db65827de3cc06200 ------------------------------------------------- - -Now you know that blob 4b9458b3 is missing, and that the tree 2d9263c6 -points to it. If you could find just one copy of that missing blob -object, possibly in some other repository, you could move it into -`.git/objects/4b/9458b3...` and be done. Suppose you can't. You can -still examine the tree that pointed to it with linkgit:git-ls-tree[1], -which might output something like: - ------------------------------------------------- -$ git ls-tree 2d9263c6d23595e7cb2a21e5ebbb53655278dff8 -100644 blob 8d14531846b95bfa3564b58ccfb7913a034323b8 .gitignore -100644 blob ebf9bf84da0aab5ed944264a5db2a65fe3a3e883 .mailmap -100644 blob ca442d313d86dc67e0a2e5d584b465bd382cbf5c COPYING -... -100644 blob 4b9458b3786228369c63936db65827de3cc06200 myfile -... ------------------------------------------------- - -So now you know that the missing blob was the data for a file named -`myfile`. And chances are you can also identify the directory--let's -say it's in `somedirectory`. If you're lucky the missing copy might be -the same as the copy you have checked out in your working tree at -`somedirectory/myfile`; you can test whether that's right with -linkgit:git-hash-object[1]: - ------------------------------------------------- -$ git hash-object -w somedirectory/myfile ------------------------------------------------- - -which will create and store a blob object with the contents of -somedirectory/myfile, and output the SHA-1 of that object. if you're -extremely lucky it might be 4b9458b3786228369c63936db65827de3cc06200, in -which case you've guessed right, and the corruption is fixed! - -Otherwise, you need more information. How do you tell which version of -the file has been lost? - -The easiest way to do this is with: - ------------------------------------------------- -$ git log --raw --all --full-history -- somedirectory/myfile ------------------------------------------------- - -Because you're asking for raw output, you'll now get something like - ------------------------------------------------- -commit abc -Author: -Date: -... -:100644 100644 4b9458b newsha M somedirectory/myfile - - -commit xyz -Author: -Date: - -... -:100644 100644 oldsha 4b9458b M somedirectory/myfile ------------------------------------------------- - -This tells you that the immediately following version of the file was -"newsha", and that the immediately preceding version was "oldsha". -You also know the commit messages that went with the change from oldsha -to 4b9458b and with the change from 4b9458b to newsha. - -If you've been committing small enough changes, you may now have a good -shot at reconstructing the contents of the in-between state 4b9458b. - -If you can do that, you can now recreate the missing object with - ------------------------------------------------- -$ git hash-object -w <recreated-file> ------------------------------------------------- - -and your repository is good again! - -(Btw, you could have ignored the `fsck`, and started with doing a - ------------------------------------------------- -$ git log --raw --all ------------------------------------------------- - -and just looked for the sha of the missing object (4b9458b) in that -whole thing. It's up to you--Git does *have* a lot of information, it is -just missing one particular blob version. - -[[the-index]] -=== The index - -The index is a binary file (generally kept in `.git/index`) containing a -sorted list of path names, each with permissions and the SHA-1 of a blob -object; linkgit:git-ls-files[1] can show you the contents of the index: - -------------------------------------------------- -$ git ls-files --stage -100644 63c918c667fa005ff12ad89437f2fdc80926e21c 0 .gitignore -100644 5529b198e8d14decbe4ad99db3f7fb632de0439d 0 .mailmap -100644 6ff87c4664981e4397625791c8ea3bbb5f2279a3 0 COPYING -100644 a37b2152bd26be2c2289e1f57a292534a51a93c7 0 Documentation/.gitignore -100644 fbefe9a45b00a54b58d94d06eca48b03d40a50e0 0 Documentation/Makefile -... -100644 2511aef8d89ab52be5ec6a5e46236b4b6bcd07ea 0 xdiff/xtypes.h -100644 2ade97b2574a9f77e7ae4002a4e07a6a38e46d07 0 xdiff/xutils.c -100644 d5de8292e05e7c36c4b68857c1cf9855e3d2f70a 0 xdiff/xutils.h -------------------------------------------------- - -Note that in older documentation you may see the index called the -"current directory cache" or just the "cache". It has three important -properties: - -1. The index contains all the information necessary to generate a single -(uniquely determined) tree object. -+ -For example, running linkgit:git-commit[1] generates this tree object -from the index, stores it in the object database, and uses it as the -tree object associated with the new commit. - -2. The index enables fast comparisons between the tree object it defines -and the working tree. -+ -It does this by storing some additional data for each entry (such as -the last modified time). This data is not displayed above, and is not -stored in the created tree object, but it can be used to determine -quickly which files in the working directory differ from what was -stored in the index, and thus save Git from having to read all of the -data from such files to look for changes. - -3. It can efficiently represent information about merge conflicts -between different tree objects, allowing each pathname to be -associated with sufficient information about the trees involved that -you can create a three-way merge between them. -+ -We saw in <<conflict-resolution>> that during a merge the index can -store multiple versions of a single file (called "stages"). The third -column in the linkgit:git-ls-files[1] output above is the stage -number, and will take on values other than 0 for files with merge -conflicts. - -The index is thus a sort of temporary staging area, which is filled with -a tree which you are in the process of working on. - -If you blow the index away entirely, you generally haven't lost any -information as long as you have the name of the tree that it described. - -[[submodules]] -== Submodules - -Large projects are often composed of smaller, self-contained modules. For -example, an embedded Linux distribution's source tree would include every -piece of software in the distribution with some local modifications; a movie -player might need to build against a specific, known-working version of a -decompression library; several independent programs might all share the same -build scripts. - -With centralized revision control systems this is often accomplished by -including every module in one single repository. Developers can check out -all modules or only the modules they need to work with. They can even modify -files across several modules in a single commit while moving things around -or updating APIs and translations. - -Git does not allow partial checkouts, so duplicating this approach in Git -would force developers to keep a local copy of modules they are not -interested in touching. Commits in an enormous checkout would be slower -than you'd expect as Git would have to scan every directory for changes. -If modules have a lot of local history, clones would take forever. - -On the plus side, distributed revision control systems can much better -integrate with external sources. In a centralized model, a single arbitrary -snapshot of the external project is exported from its own revision control -and then imported into the local revision control on a vendor branch. All -the history is hidden. With distributed revision control you can clone the -entire external history and much more easily follow development and re-merge -local changes. - -Git's submodule support allows a repository to contain, as a subdirectory, a -checkout of an external project. Submodules maintain their own identity; -the submodule support just stores the submodule repository location and -commit ID, so other developers who clone the containing project -("superproject") can easily clone all the submodules at the same revision. -Partial checkouts of the superproject are possible: you can tell Git to -clone none, some or all of the submodules. - -The linkgit:git-submodule[1] command is available since Git 1.5.3. Users -with Git 1.5.2 can look up the submodule commits in the repository and -manually check them out; earlier versions won't recognize the submodules at -all. - -To see how submodule support works, create four example -repositories that can be used later as a submodule: - -------------------------------------------------- -$ mkdir ~/git -$ cd ~/git -$ for i in a b c d -do - mkdir $i - cd $i - git init - echo "module $i" > $i.txt - git add $i.txt - git commit -m "Initial commit, submodule $i" - cd .. -done -------------------------------------------------- - -Now create the superproject and add all the submodules: - -------------------------------------------------- -$ mkdir super -$ cd super -$ git init -$ for i in a b c d -do - git submodule add ~/git/$i $i -done -------------------------------------------------- - -NOTE: Do not use local URLs here if you plan to publish your superproject! - -See what files `git submodule` created: - -------------------------------------------------- -$ ls -a -. .. .git .gitmodules a b c d -------------------------------------------------- - -The `git submodule add <repo> <path>` command does a couple of things: - -- It clones the submodule from `<repo>` to the given `<path>` under the - current directory and by default checks out the master branch. -- It adds the submodule's clone path to the linkgit:gitmodules[5] file and - adds this file to the index, ready to be committed. -- It adds the submodule's current commit ID to the index, ready to be - committed. - -Commit the superproject: - -------------------------------------------------- -$ git commit -m "Add submodules a, b, c and d." -------------------------------------------------- - -Now clone the superproject: - -------------------------------------------------- -$ cd .. -$ git clone super cloned -$ cd cloned -------------------------------------------------- - -The submodule directories are there, but they're empty: - -------------------------------------------------- -$ ls -a a -. .. -$ git submodule status --d266b9873ad50488163457f025db7cdd9683d88b a --e81d457da15309b4fef4249aba9b50187999670d b --c1536a972b9affea0f16e0680ba87332dc059146 c --d96249ff5d57de5de093e6baff9e0aafa5276a74 d -------------------------------------------------- - -NOTE: The commit object names shown above would be different for you, but they -should match the HEAD commit object names of your repositories. You can check -it by running `git ls-remote ../a`. - -Pulling down the submodules is a two-step process. First run `git submodule -init` to add the submodule repository URLs to `.git/config`: - -------------------------------------------------- -$ git submodule init -------------------------------------------------- - -Now use `git submodule update` to clone the repositories and check out the -commits specified in the superproject: - -------------------------------------------------- -$ git submodule update -$ cd a -$ ls -a -. .. .git a.txt -------------------------------------------------- - -One major difference between `git submodule update` and `git submodule add` is -that `git submodule update` checks out a specific commit, rather than the tip -of a branch. It's like checking out a tag: the head is detached, so you're not -working on a branch. - -------------------------------------------------- -$ git branch -* (detached from d266b98) - master -------------------------------------------------- - -If you want to make a change within a submodule and you have a detached head, -then you should create or checkout a branch, make your changes, publish the -change within the submodule, and then update the superproject to reference the -new commit: - -------------------------------------------------- -$ git switch master -------------------------------------------------- - -or - -------------------------------------------------- -$ git switch -c fix-up -------------------------------------------------- - -then - -------------------------------------------------- -$ echo "adding a line again" >> a.txt -$ git commit -a -m "Updated the submodule from within the superproject." -$ git push -$ cd .. -$ git diff -diff --git a/a b/a -index d266b98..261dfac 160000 ---- a/a -+++ b/a -@@ -1 +1 @@ --Subproject commit d266b9873ad50488163457f025db7cdd9683d88b -+Subproject commit 261dfac35cb99d380eb966e102c1197139f7fa24 -$ git add a -$ git commit -m "Updated submodule a." -$ git push -------------------------------------------------- - -You have to run `git submodule update` after `git pull` if you want to update -submodules, too. - -[[pitfalls-with-submodules]] -=== Pitfalls with submodules - -Always publish the submodule change before publishing the change to the -superproject that references it. If you forget to publish the submodule change, -others won't be able to clone the repository: - -------------------------------------------------- -$ cd ~/git/super/a -$ echo i added another line to this file >> a.txt -$ git commit -a -m "doing it wrong this time" -$ cd .. -$ git add a -$ git commit -m "Updated submodule a again." -$ git push -$ cd ~/git/cloned -$ git pull -$ git submodule update -error: pathspec '261dfac35cb99d380eb966e102c1197139f7fa24' did not match any file(s) known to git. -Did you forget to 'git add'? -Unable to checkout '261dfac35cb99d380eb966e102c1197139f7fa24' in submodule path 'a' -------------------------------------------------- - -In older Git versions it could be easily forgotten to commit new or modified -files in a submodule, which silently leads to similar problems as not pushing -the submodule changes. Starting with Git 1.7.0 both `git status` and `git diff` -in the superproject show submodules as modified when they contain new or -modified files to protect against accidentally committing such a state. `git -diff` will also add a `-dirty` to the work tree side when generating patch -output or used with the `--submodule` option: - -------------------------------------------------- -$ git diff -diff --git a/sub b/sub ---- a/sub -+++ b/sub -@@ -1 +1 @@ --Subproject commit 3f356705649b5d566d97ff843cf193359229a453 -+Subproject commit 3f356705649b5d566d97ff843cf193359229a453-dirty -$ git diff --submodule -Submodule sub 3f35670..3f35670-dirty: -------------------------------------------------- - -You also should not rewind branches in a submodule beyond commits that were -ever recorded in any superproject. - -It's not safe to run `git submodule update` if you've made and committed -changes within a submodule without checking out a branch first. They will be -silently overwritten: - -------------------------------------------------- -$ cat a.txt -module a -$ echo line added from private2 >> a.txt -$ git commit -a -m "line added inside private2" -$ cd .. -$ git submodule update -Submodule path 'a': checked out 'd266b9873ad50488163457f025db7cdd9683d88b' -$ cd a -$ cat a.txt -module a -------------------------------------------------- - -NOTE: The changes are still visible in the submodule's reflog. - -If you have uncommitted changes in your submodule working tree, `git -submodule update` will not overwrite them. Instead, you get the usual -warning about not being able switch from a dirty branch. - -[[low-level-operations]] -== Low-level Git operations - -Many of the higher-level commands were originally implemented as shell -scripts using a smaller core of low-level Git commands. These can still -be useful when doing unusual things with Git, or just as a way to -understand its inner workings. - -[[object-manipulation]] -=== Object access and manipulation - -The linkgit:git-cat-file[1] command can show the contents of any object, -though the higher-level linkgit:git-show[1] is usually more useful. - -The linkgit:git-commit-tree[1] command allows constructing commits with -arbitrary parents and trees. - -A tree can be created with linkgit:git-write-tree[1] and its data can be -accessed by linkgit:git-ls-tree[1]. Two trees can be compared with -linkgit:git-diff-tree[1]. - -A tag is created with linkgit:git-mktag[1], and the signature can be -verified by linkgit:git-verify-tag[1], though it is normally simpler to -use linkgit:git-tag[1] for both. - -[[the-workflow]] -=== The Workflow - -High-level operations such as linkgit:git-commit[1] and -linkgit:git-restore[1] work by moving data -between the working tree, the index, and the object database. Git -provides low-level operations which perform each of these steps -individually. - -Generally, all Git operations work on the index file. Some operations -work *purely* on the index file (showing the current state of the -index), but most operations move data between the index file and either -the database or the working directory. Thus there are four main -combinations: - -[[working-directory-to-index]] -==== working directory -> index - -The linkgit:git-update-index[1] command updates the index with -information from the working directory. You generally update the -index information by just specifying the filename you want to update, -like so: - -------------------------------------------------- -$ git update-index filename -------------------------------------------------- - -but to avoid common mistakes with filename globbing etc., the command -will not normally add totally new entries or remove old entries, -i.e. it will normally just update existing cache entries. - -To tell Git that yes, you really do realize that certain files no -longer exist, or that new files should be added, you -should use the `--remove` and `--add` flags respectively. - -NOTE! A `--remove` flag does 'not' mean that subsequent filenames will -necessarily be removed: if the files still exist in your directory -structure, the index will be updated with their new status, not -removed. The only thing `--remove` means is that update-index will be -considering a removed file to be a valid thing, and if the file really -does not exist any more, it will update the index accordingly. - -As a special case, you can also do `git update-index --refresh`, which -will refresh the "stat" information of each index to match the current -stat information. It will 'not' update the object status itself, and -it will only update the fields that are used to quickly test whether -an object still matches its old backing store object. - -The previously introduced linkgit:git-add[1] is just a wrapper for -linkgit:git-update-index[1]. - -[[index-to-object-database]] -==== index -> object database - -You write your current index file to a "tree" object with the program - -------------------------------------------------- -$ git write-tree -------------------------------------------------- - -that doesn't come with any options--it will just write out the -current index into the set of tree objects that describe that state, -and it will return the name of the resulting top-level tree. You can -use that tree to re-generate the index at any time by going in the -other direction: - -[[object-database-to-index]] -==== object database -> index - -You read a "tree" file from the object database, and use that to -populate (and overwrite--don't do this if your index contains any -unsaved state that you might want to restore later!) your current -index. Normal operation is just - -------------------------------------------------- -$ git read-tree <SHA-1 of tree> -------------------------------------------------- - -and your index file will now be equivalent to the tree that you saved -earlier. However, that is only your 'index' file: your working -directory contents have not been modified. - -[[index-to-working-directory]] -==== index -> working directory - -You update your working directory from the index by "checking out" -files. This is not a very common operation, since normally you'd just -keep your files updated, and rather than write to your working -directory, you'd tell the index files about the changes in your -working directory (i.e. `git update-index`). - -However, if you decide to jump to a new version, or check out somebody -else's version, or just restore a previous tree, you'd populate your -index file with read-tree, and then you need to check out the result -with - -------------------------------------------------- -$ git checkout-index filename -------------------------------------------------- - -or, if you want to check out all of the index, use `-a`. - -NOTE! `git checkout-index` normally refuses to overwrite old files, so -if you have an old version of the tree already checked out, you will -need to use the `-f` flag ('before' the `-a` flag or the filename) to -'force' the checkout. - - -Finally, there are a few odds and ends which are not purely moving -from one representation to the other: - -[[tying-it-all-together]] -==== Tying it all together - -To commit a tree you have instantiated with `git write-tree`, you'd -create a "commit" object that refers to that tree and the history -behind it--most notably the "parent" commits that preceded it in -history. - -Normally a "commit" has one parent: the previous state of the tree -before a certain change was made. However, sometimes it can have two -or more parent commits, in which case we call it a "merge", due to the -fact that such a commit brings together ("merges") two or more -previous states represented by other commits. - -In other words, while a "tree" represents a particular directory state -of a working directory, a "commit" represents that state in time, -and explains how we got there. - -You create a commit object by giving it the tree that describes the -state at the time of the commit, and a list of parents: - -------------------------------------------------- -$ git commit-tree <tree> -p <parent> [(-p <parent2>)...] -------------------------------------------------- - -and then giving the reason for the commit on stdin (either through -redirection from a pipe or file, or by just typing it at the tty). - -`git commit-tree` will return the name of the object that represents -that commit, and you should save it away for later use. Normally, -you'd commit a new `HEAD` state, and while Git doesn't care where you -save the note about that state, in practice we tend to just write the -result to the file pointed at by `.git/HEAD`, so that we can always see -what the last committed state was. - -Here is a picture that illustrates how various pieces fit together: - ------------- - - commit-tree - commit obj - +----+ - | | - | | - V V - +-----------+ - | Object DB | - | Backing | - | Store | - +-----------+ - ^ - write-tree | | - tree obj | | - | | read-tree - | | tree obj - V - +-----------+ - | Index | - | "cache" | - +-----------+ - update-index ^ - blob obj | | - | | - checkout-index -u | | checkout-index - stat | | blob obj - V - +-----------+ - | Working | - | Directory | - +-----------+ - ------------- - - -[[examining-the-data]] -=== Examining the data - -You can examine the data represented in the object database and the -index with various helper tools. For every object, you can use -linkgit:git-cat-file[1] to examine details about the -object: - -------------------------------------------------- -$ git cat-file -t <objectname> -------------------------------------------------- - -shows the type of the object, and once you have the type (which is -usually implicit in where you find the object), you can use - -------------------------------------------------- -$ git cat-file blob|tree|commit|tag <objectname> -------------------------------------------------- - -to show its contents. NOTE! Trees have binary content, and as a result -there is a special helper for showing that content, called -`git ls-tree`, which turns the binary content into a more easily -readable form. - -It's especially instructive to look at "commit" objects, since those -tend to be small and fairly self-explanatory. In particular, if you -follow the convention of having the top commit name in `.git/HEAD`, -you can do - -------------------------------------------------- -$ git cat-file commit HEAD -------------------------------------------------- - -to see what the top commit was. - -[[merging-multiple-trees]] -=== Merging multiple trees - -Git can help you perform a three-way merge, which can in turn be -used for a many-way merge by repeating the merge procedure several -times. The usual situation is that you only do one three-way merge -(reconciling two lines of history) and commit the result, but if -you like to, you can merge several branches in one go. - -To perform a three-way merge, you start with the two commits you -want to merge, find their closest common parent (a third commit), -and compare the trees corresponding to these three commits. - -To get the "base" for the merge, look up the common parent of two -commits: - -------------------------------------------------- -$ git merge-base <commit1> <commit2> -------------------------------------------------- - -This prints the name of a commit they are both based on. You should -now look up the tree objects of those commits, which you can easily -do with - -------------------------------------------------- -$ git cat-file commit <commitname> | head -1 -------------------------------------------------- - -since the tree object information is always the first line in a commit -object. - -Once you know the three trees you are going to merge (the one "original" -tree, aka the common tree, and the two "result" trees, aka the branches -you want to merge), you do a "merge" read into the index. This will -complain if it has to throw away your old index contents, so you should -make sure that you've committed those--in fact you would normally -always do a merge against your last commit (which should thus match what -you have in your current index anyway). - -To do the merge, do - -------------------------------------------------- -$ git read-tree -m -u <origtree> <yourtree> <targettree> -------------------------------------------------- - -which will do all trivial merge operations for you directly in the -index file, and you can just write the result out with -`git write-tree`. - - -[[merging-multiple-trees-2]] -=== Merging multiple trees, continued - -Sadly, many merges aren't trivial. If there are files that have -been added, moved or removed, or if both branches have modified the -same file, you will be left with an index tree that contains "merge -entries" in it. Such an index tree can 'NOT' be written out to a tree -object, and you will have to resolve any such merge clashes using -other tools before you can write out the result. - -You can examine such index state with `git ls-files --unmerged` -command. An example: - ------------------------------------------------- -$ git read-tree -m $orig HEAD $target -$ git ls-files --unmerged -100644 263414f423d0e4d70dae8fe53fa34614ff3e2860 1 hello.c -100644 06fa6a24256dc7e560efa5687fa84b51f0263c3a 2 hello.c -100644 cc44c73eb783565da5831b4d820c962954019b69 3 hello.c ------------------------------------------------- - -Each line of the `git ls-files --unmerged` output begins with -the blob mode bits, blob SHA-1, 'stage number', and the -filename. The 'stage number' is Git's way to say which tree it -came from: stage 1 corresponds to the `$orig` tree, stage 2 to -the `HEAD` tree, and stage 3 to the `$target` tree. - -Earlier we said that trivial merges are done inside -`git read-tree -m`. For example, if the file did not change -from `$orig` to `HEAD` or `$target`, or if the file changed -from `$orig` to `HEAD` and `$orig` to `$target` the same way, -obviously the final outcome is what is in `HEAD`. What the -above example shows is that file `hello.c` was changed from -`$orig` to `HEAD` and `$orig` to `$target` in a different way. -You could resolve this by running your favorite 3-way merge -program, e.g. `diff3`, `merge`, or Git's own merge-file, on -the blob objects from these three stages yourself, like this: - ------------------------------------------------- -$ git cat-file blob 263414f >hello.c~1 -$ git cat-file blob 06fa6a2 >hello.c~2 -$ git cat-file blob cc44c73 >hello.c~3 -$ git merge-file hello.c~2 hello.c~1 hello.c~3 ------------------------------------------------- - -This would leave the merge result in `hello.c~2` file, along -with conflict markers if there are conflicts. After verifying -the merge result makes sense, you can tell Git what the final -merge result for this file is by: - -------------------------------------------------- -$ mv -f hello.c~2 hello.c -$ git update-index hello.c -------------------------------------------------- - -When a path is in the "unmerged" state, running `git update-index` for -that path tells Git to mark the path resolved. - -The above is the description of a Git merge at the lowest level, -to help you understand what conceptually happens under the hood. -In practice, nobody, not even Git itself, runs `git cat-file` three times -for this. There is a `git merge-index` program that extracts the -stages to temporary files and calls a "merge" script on it: - -------------------------------------------------- -$ git merge-index git-merge-one-file hello.c -------------------------------------------------- - -and that is what higher level `git merge -s resolve` is implemented with. - -[[hacking-git]] -== Hacking Git - -This chapter covers internal details of the Git implementation which -probably only Git developers need to understand. - -[[object-details]] -=== Object storage format - -All objects have a statically determined "type" which identifies the -format of the object (i.e. how it is used, and how it can refer to other -objects). There are currently four different object types: "blob", -"tree", "commit", and "tag". - -Regardless of object type, all objects share the following -characteristics: they are all deflated with zlib, and have a header -that not only specifies their type, but also provides size information -about the data in the object. It's worth noting that the SHA-1 hash -that is used to name the object is the hash of the original data -plus this header, so `sha1sum` 'file' does not match the object name -for 'file'. - -As a result, the general consistency of an object can always be tested -independently of the contents or the type of the object: all objects can -be validated by verifying that (a) their hashes match the content of the -file and (b) the object successfully inflates to a stream of bytes that -forms a sequence of -`<ascii type without space> + <space> + <ascii decimal size> + -<byte\0> + <binary object data>`. - -The structured objects can further have their structure and -connectivity to other objects verified. This is generally done with -the `git fsck` program, which generates a full dependency graph -of all objects, and verifies their internal consistency (in addition -to just verifying their superficial consistency through the hash). - -[[birdview-on-the-source-code]] -=== A birds-eye view of Git's source code - -It is not always easy for new developers to find their way through Git's -source code. This section gives you a little guidance to show where to -start. - -A good place to start is with the contents of the initial commit, with: - ----------------------------------------------------- -$ git switch --detach e83c5163 ----------------------------------------------------- - -The initial revision lays the foundation for almost everything Git has -today, but is small enough to read in one sitting. - -Note that terminology has changed since that revision. For example, the -README in that revision uses the word "changeset" to describe what we -now call a <<def_commit_object,commit>>. - -Also, we do not call it "cache" any more, but rather "index"; however, the -file is still called `cache.h`. Remark: Not much reason to change it now, -especially since there is no good single name for it anyway, because it is -basically _the_ header file which is included by _all_ of Git's C sources. - -If you grasp the ideas in that initial commit, you should check out a -more recent version and skim `cache.h`, `object.h` and `commit.h`. - -In the early days, Git (in the tradition of UNIX) was a bunch of programs -which were extremely simple, and which you used in scripts, piping the -output of one into another. This turned out to be good for initial -development, since it was easier to test new things. However, recently -many of these parts have become builtins, and some of the core has been -"libified", i.e. put into libgit.a for performance, portability reasons, -and to avoid code duplication. - -By now, you know what the index is (and find the corresponding data -structures in `cache.h`), and that there are just a couple of object types -(blobs, trees, commits and tags) which inherit their common structure from -`struct object`, which is their first member (and thus, you can cast e.g. -`(struct object *)commit` to achieve the _same_ as `&commit->object`, i.e. -get at the object name and flags). - -Now is a good point to take a break to let this information sink in. - -Next step: get familiar with the object naming. Read <<naming-commits>>. -There are quite a few ways to name an object (and not only revisions!). -All of these are handled in `sha1_name.c`. Just have a quick look at -the function `get_sha1()`. A lot of the special handling is done by -functions like `get_sha1_basic()` or the likes. - -This is just to get you into the groove for the most libified part of Git: -the revision walker. - -Basically, the initial version of `git log` was a shell script: - ----------------------------------------------------------------- -$ git-rev-list --pretty $(git-rev-parse --default HEAD "$@") | \ - LESS=-S ${PAGER:-less} ----------------------------------------------------------------- - -What does this mean? - -`git rev-list` is the original version of the revision walker, which -_always_ printed a list of revisions to stdout. It is still functional, -and needs to, since most new Git commands start out as scripts using -`git rev-list`. - -`git rev-parse` is not as important any more; it was only used to filter out -options that were relevant for the different plumbing commands that were -called by the script. - -Most of what `git rev-list` did is contained in `revision.c` and -`revision.h`. It wraps the options in a struct named `rev_info`, which -controls how and what revisions are walked, and more. - -The original job of `git rev-parse` is now taken by the function -`setup_revisions()`, which parses the revisions and the common command-line -options for the revision walker. This information is stored in the struct -`rev_info` for later consumption. You can do your own command-line option -parsing after calling `setup_revisions()`. After that, you have to call -`prepare_revision_walk()` for initialization, and then you can get the -commits one by one with the function `get_revision()`. - -If you are interested in more details of the revision walking process, -just have a look at the first implementation of `cmd_log()`; call -`git show v1.3.0~155^2~4` and scroll down to that function (note that you -no longer need to call `setup_pager()` directly). - -Nowadays, `git log` is a builtin, which means that it is _contained_ in the -command `git`. The source side of a builtin is - -- a function called `cmd_<bla>`, typically defined in `builtin/<bla.c>` - (note that older versions of Git used to have it in `builtin-<bla>.c` - instead), and declared in `builtin.h`. - -- an entry in the `commands[]` array in `git.c`, and - -- an entry in `BUILTIN_OBJECTS` in the `Makefile`. - -Sometimes, more than one builtin is contained in one source file. For -example, `cmd_whatchanged()` and `cmd_log()` both reside in `builtin/log.c`, -since they share quite a bit of code. In that case, the commands which are -_not_ named like the `.c` file in which they live have to be listed in -`BUILT_INS` in the `Makefile`. - -`git log` looks more complicated in C than it does in the original script, -but that allows for a much greater flexibility and performance. - -Here again it is a good point to take a pause. - -Lesson three is: study the code. Really, it is the best way to learn about -the organization of Git (after you know the basic concepts). - -So, think about something which you are interested in, say, "how can I -access a blob just knowing the object name of it?". The first step is to -find a Git command with which you can do it. In this example, it is either -`git show` or `git cat-file`. - -For the sake of clarity, let's stay with `git cat-file`, because it - -- is plumbing, and - -- was around even in the initial commit (it literally went only through - some 20 revisions as `cat-file.c`, was renamed to `builtin/cat-file.c` - when made a builtin, and then saw less than 10 versions). - -So, look into `builtin/cat-file.c`, search for `cmd_cat_file()` and look what -it does. - ------------------------------------------------------------------- - git_config(git_default_config); - if (argc != 3) - usage("git cat-file [-t|-s|-e|-p|<type>] <sha1>"); - if (get_sha1(argv[2], sha1)) - die("Not a valid object name %s", argv[2]); ------------------------------------------------------------------- - -Let's skip over the obvious details; the only really interesting part -here is the call to `get_sha1()`. It tries to interpret `argv[2]` as an -object name, and if it refers to an object which is present in the current -repository, it writes the resulting SHA-1 into the variable `sha1`. - -Two things are interesting here: - -- `get_sha1()` returns 0 on _success_. This might surprise some new - Git hackers, but there is a long tradition in UNIX to return different - negative numbers in case of different errors--and 0 on success. - -- the variable `sha1` in the function signature of `get_sha1()` is `unsigned - char *`, but is actually expected to be a pointer to `unsigned - char[20]`. This variable will contain the 160-bit SHA-1 of the given - commit. Note that whenever a SHA-1 is passed as `unsigned char *`, it - is the binary representation, as opposed to the ASCII representation in - hex characters, which is passed as `char *`. - -You will see both of these things throughout the code. - -Now, for the meat: - ------------------------------------------------------------------------------ - case 0: - buf = read_object_with_reference(sha1, argv[1], &size, NULL); ------------------------------------------------------------------------------ - -This is how you read a blob (actually, not only a blob, but any type of -object). To know how the function `read_object_with_reference()` actually -works, find the source code for it (something like `git grep -read_object_with | grep ":[a-z]"` in the Git repository), and read -the source. - -To find out how the result can be used, just read on in `cmd_cat_file()`: - ------------------------------------ - write_or_die(1, buf, size); ------------------------------------ - -Sometimes, you do not know where to look for a feature. In many such cases, -it helps to search through the output of `git log`, and then `git show` the -corresponding commit. - -Example: If you know that there was some test case for `git bundle`, but -do not remember where it was (yes, you _could_ `git grep bundle t/`, but that -does not illustrate the point!): - ------------------------- -$ git log --no-merges t/ ------------------------- - -In the pager (`less`), just search for "bundle", go a few lines back, -and see that it is in commit 18449ab0. Now just copy this object name, -and paste it into the command line - -------------------- -$ git show 18449ab0 -------------------- - -Voila. - -Another example: Find out what to do in order to make some script a -builtin: - -------------------------------------------------- -$ git log --no-merges --diff-filter=A builtin/*.c -------------------------------------------------- - -You see, Git is actually the best tool to find out about the source of Git -itself! - -[[glossary]] -== Git Glossary - -[[git-explained]] -=== Git explained - -include::glossary-content.txt[] - -[[git-quick-start]] -[appendix] -== Git Quick Reference - -This is a quick summary of the major commands; the previous chapters -explain how these work in more detail. - -[[quick-creating-a-new-repository]] -=== Creating a new repository - -From a tarball: - ------------------------------------------------ -$ tar xzf project.tar.gz -$ cd project -$ git init -Initialized empty Git repository in .git/ -$ git add . -$ git commit ------------------------------------------------ - -From a remote repository: - ------------------------------------------------ -$ git clone git://example.com/pub/project.git -$ cd project ------------------------------------------------ - -[[managing-branches]] -=== Managing branches - ------------------------------------------------ -$ git branch # list all local branches in this repo -$ git switch test # switch working directory to branch "test" -$ git branch new # create branch "new" starting at current HEAD -$ git branch -d new # delete branch "new" ------------------------------------------------ - -Instead of basing a new branch on current HEAD (the default), use: - ------------------------------------------------ -$ git branch new test # branch named "test" -$ git branch new v2.6.15 # tag named v2.6.15 -$ git branch new HEAD^ # commit before the most recent -$ git branch new HEAD^^ # commit before that -$ git branch new test~10 # ten commits before tip of branch "test" ------------------------------------------------ - -Create and switch to a new branch at the same time: - ------------------------------------------------ -$ git switch -c new v2.6.15 ------------------------------------------------ - -Update and examine branches from the repository you cloned from: - ------------------------------------------------ -$ git fetch # update -$ git branch -r # list - origin/master - origin/next - ... -$ git switch -c masterwork origin/master ------------------------------------------------ - -Fetch a branch from a different repository, and give it a new -name in your repository: - ------------------------------------------------ -$ git fetch git://example.com/project.git theirbranch:mybranch -$ git fetch git://example.com/project.git v2.6.15:mybranch ------------------------------------------------ - -Keep a list of repositories you work with regularly: - ------------------------------------------------ -$ git remote add example git://example.com/project.git -$ git remote # list remote repositories -example -origin -$ git remote show example # get details -* remote example - URL: git://example.com/project.git - Tracked remote branches - master - next - ... -$ git fetch example # update branches from example -$ git branch -r # list all remote branches ------------------------------------------------ - - -[[exploring-history]] -=== Exploring history - ------------------------------------------------ -$ gitk # visualize and browse history -$ git log # list all commits -$ git log src/ # ...modifying src/ -$ git log v2.6.15..v2.6.16 # ...in v2.6.16, not in v2.6.15 -$ git log master..test # ...in branch test, not in branch master -$ git log test..master # ...in branch master, but not in test -$ git log test...master # ...in one branch, not in both -$ git log -S'foo()' # ...where difference contain "foo()" -$ git log --since="2 weeks ago" -$ git log -p # show patches as well -$ git show # most recent commit -$ git diff v2.6.15..v2.6.16 # diff between two tagged versions -$ git diff v2.6.15..HEAD # diff with current head -$ git grep "foo()" # search working directory for "foo()" -$ git grep v2.6.15 "foo()" # search old tree for "foo()" -$ git show v2.6.15:a.txt # look at old version of a.txt ------------------------------------------------ - -Search for regressions: - ------------------------------------------------ -$ git bisect start -$ git bisect bad # current version is bad -$ git bisect good v2.6.13-rc2 # last known good revision -Bisecting: 675 revisions left to test after this - # test here, then: -$ git bisect good # if this revision is good, or -$ git bisect bad # if this revision is bad. - # repeat until done. ------------------------------------------------ - -[[making-changes]] -=== Making changes - -Make sure Git knows who to blame: - ------------------------------------------------- -$ cat >>~/.gitconfig <<\EOF -[user] - name = Your Name Comes Here - email = you@yourdomain.example.com -EOF ------------------------------------------------- - -Select file contents to include in the next commit, then make the -commit: - ------------------------------------------------ -$ git add a.txt # updated file -$ git add b.txt # new file -$ git rm c.txt # old file -$ git commit ------------------------------------------------ - -Or, prepare and create the commit in one step: - ------------------------------------------------ -$ git commit d.txt # use latest content only of d.txt -$ git commit -a # use latest content of all tracked files ------------------------------------------------ - -[[merging]] -=== Merging - ------------------------------------------------ -$ git merge test # merge branch "test" into the current branch -$ git pull git://example.com/project.git master - # fetch and merge in remote branch -$ git pull . test # equivalent to git merge test ------------------------------------------------ - -[[sharing-your-changes]] -=== Sharing your changes - -Importing or exporting patches: - ------------------------------------------------ -$ git format-patch origin..HEAD # format a patch for each commit - # in HEAD but not in origin -$ git am mbox # import patches from the mailbox "mbox" ------------------------------------------------ - -Fetch a branch in a different Git repository, then merge into the -current branch: - ------------------------------------------------ -$ git pull git://example.com/project.git theirbranch ------------------------------------------------ - -Store the fetched branch into a local branch before merging into the -current branch: - ------------------------------------------------ -$ git pull git://example.com/project.git theirbranch:mybranch ------------------------------------------------ - -After creating commits on a local branch, update the remote -branch with your commits: - ------------------------------------------------ -$ git push ssh://example.com/project.git mybranch:theirbranch ------------------------------------------------ - -When remote and local branch are both named "test": - ------------------------------------------------ -$ git push ssh://example.com/project.git test ------------------------------------------------ - -Shortcut version for a frequently used remote repository: - ------------------------------------------------ -$ git remote add example ssh://example.com/project.git -$ git push example test ------------------------------------------------ - -[[repository-maintenance]] -=== Repository maintenance - -Check for corruption: - ------------------------------------------------ -$ git fsck ------------------------------------------------ - -Recompress, remove unused cruft: - ------------------------------------------------ -$ git gc ------------------------------------------------ - - -[[todo]] -[appendix] -== Notes and todo list for this manual - -[[todo-list]] -=== Todo list - -This is a work in progress. - -The basic requirements: - -- It must be readable in order, from beginning to end, by someone - intelligent with a basic grasp of the UNIX command line, but without - any special knowledge of Git. If necessary, any other prerequisites - should be specifically mentioned as they arise. -- Whenever possible, section headings should clearly describe the task - they explain how to do, in language that requires no more knowledge - than necessary: for example, "importing patches into a project" rather - than "the `git am` command" - -Think about how to create a clear chapter dependency graph that will -allow people to get to important topics without necessarily reading -everything in between. - -Scan `Documentation/` for other stuff left out; in particular: - -- howto's -- some of `technical/`? -- hooks -- list of commands in linkgit:git[1] - -Scan email archives for other stuff left out - -Scan man pages to see if any assume more background than this manual -provides. - -Add more good examples. Entire sections of just cookbook examples -might be a good idea; maybe make an "advanced examples" section a -standard end-of-chapter section? - -Include cross-references to the glossary, where appropriate. - -Add a section on working with other version control systems, including -CVS, Subversion, and just imports of series of release tarballs. - -Write a chapter on using plumbing and writing scripts. - -Alternates, clone -reference, etc. - -More on recovery from repository corruption. See: - https://lore.kernel.org/git/Pine.LNX.4.64.0702272039540.12485@woody.linux-foundation.org/ - https://lore.kernel.org/git/Pine.LNX.4.64.0702141033400.3604@woody.linux-foundation.org/ |