5

我目前正在将一个 cvs 项目导入 git。
导入后,我想重写历史以将现有目录移动到单独的子模块中。

假设我有这样的结构:

file1
file2
file3
dir1
dir2
library

现在我想重写历史,以便目录library始终是一个 git 子模块。说,将指定的目录拆分成自己的子模块/子项目

这是我目前的代码:

文件重写子模块(称为)

cd project
git filter-branch --tree-filter $PWD/../$0-tree-filter --tag-name-filter cat -- --all

文件重写子模块树过滤器

    #!/bin/bash

    函数 gitCommit()
    {
        取消设置 GIT_DIR
        取消设置 GIT_WORK_TREE
        git添加-A
        if [ -n "$(git diff --cached --name-only)" ]
        然后
            # 要提交的东西
            git commit -F $_msg
        菲
    }

    _git_dir=$GIT_DIR
    _git_work_tree=$GIT_WORK_TREE
    取消设置 GIT_DIR
    取消设置 GIT_WORK_TREE
    _dir=$密码

    如果 [ -d "库" ]
    然后
        _msg=$(临时文件)
        git 日志 ${GIT_COMMIT}^! --format="%B" > $_msg
        git rm -r --cached 库
        光盘库
        如果 [ -d ".git" ]
        然后
            gitCommit
        别的
            混帐初始化
            gitCommit
        菲
        光盘..
        导出 GIT_DIR=$_git_dir
        导出 GIT_WORK_TREE=$_git_work_tree
        git 子模块添加 -f ./lib
    菲

    GIT_DIR=$_git_dir
    GIT_WORK_TREE=$_git_work_tree
    

此代码创建 .gitmodules 文件,但不是主存储库中的子模块提交条目(行Subproject commit <sha1-hash>,输出git diff),并且目录library中的文件仍然在主存储库中进行版本控制,而不是在子项目存储库中。

提前感谢您的任何提示

.gitmodules 看起来像这样:

    [子模块“库”]
        路径 = 库
        网址 = ./库
    

4

4 回答 4

2

我解决了我自己的问题,这是解决方案:

git-submodule-split library another_library

脚本git-submodule-split

    #!/bin/bash

    设置-eu

    如果 [ $# -eq 0 ]
    然后
        echo "用法:$0 子模块拆分"
    菲

    导出 _tmp=$(mktemp -d)
    导出 _libs="$@"
    对于我在 $_libs
    做
        mkdir -p $_tmp/$i
    完毕

    git filter-branch --commit-filter '
    函数 gitCommit()
    {
        git添加-A
        if [ -n "$(git diff --cached --name-only)" ]
        然后
            git commit -F $_msg
        菲
    } >/dev/null

    # 来自 git-filter-branch
    git checkout-index -f -u -a || die "无法签出索引"
    # $commit 删除的文件现在仍在工作树中;
    # 删除它们,否则它们将被再次添加
    git clean -d -q -f -x

    _git_dir=$GIT_DIR
    _git_work_tree=$GIT_WORK_TREE
    _git_index_file=$GIT_INDEX_FILE
    取消设置 GIT_DIR
    取消设置 GIT_WORK_TREE
    取消设置 GIT_INDEX_FILE

    _msg=$(临时文件)
    猫 /dev/stdin > $_msg
    对于我在 $_libs
    做
        如果 [ -d "$i" ]
        然后
            取消设置 GIT_DIR
            取消设置 GIT_WORK_TREE
            取消设置 GIT_INDEX_FILE
            光盘$i
            如果 [ -d ".git" ]
            然后
                gitCommit
            别的
                git init >/dev/null
                gitCommit
            菲
            光盘..
            rsync -a -rtu $i/.git/ $_tmp/$i/.git/
            导出 GIT_DIR=$_git_dir
            导出 GIT_WORK_TREE=$_git_work_tree
            导出 GIT_INDEX_FILE=$_git_index_file
            git rm -q -r --cached $i
            git 子模块添加 ./$i >/dev/null
            混帐添加 $i
        菲
    完毕
    rm $_msg
    导出 GIT_DIR=$_git_dir
    导出 GIT_WORK_TREE=$_git_work_tree
    导出 GIT_INDEX_FILE=$_git_index_file

    如果 [ -f ".gitmodules" ]
    然后
        混帐添加 .gitmodules
    菲

    _new_rev=$(git write-tree)
    转移
    git commit-tree "$_new_rev" "$@";
    ' --tag-name-filter cat -- --all

    对于我在 $_libs
    做
        如果 [ -d "$_tmp/$i/.git" ]
        然后
            rsync -a -i -rtu $_tmp/$i/.git/ $i/.git/
            光盘$i
            git reset --hard
            光盘..
        菲
    完毕
    rm -r $_tmp

    git for-each-ref refs/original --format="%(refname)" | 读我的时候;做 git update-ref -d $i; 完毕

    git reflog expire --expire=now --all
    git gc --aggressive --prune=now

    
于 2012-11-19T14:23:21.283 回答
2

我有一个项目,其中包含一个utils开始在其他项目中有用的库,并且想将其历史拆分为子模块。没想到先看 SO,所以我自己写了,它在本地构建历史,所以速度要快一点,之后如果你愿意,你可以设置辅助命令的.gitmodules文件等,并将子模块历史自己推送到任何地方你要。

剥离的命令本身在这里,文档在评论中,在后面的未剥离的命令中。subdir使用set 将其作为自己的命令运行,就像subdir=utils git split-submodule您要拆分utils目录一样。这很 hacky,因为它是一次性的,但我在 Git 历史记录中的 Documentation 子目录中对其进行了测试。

#!/bin/bash
# put this or the commented version below in e.g. ~/bin/git-split-submodule
${GIT_COMMIT-exec git filter-branch --index-filter "subdir=$subdir; ${debug+debug=$debug;} $(sed 1,/SNIP/d "$0")" "$@"}
${debug+set -x}
fam=(`git rev-list --no-walk --parents $GIT_COMMIT`)
pathcheck=(`printf "%s:$subdir\\n" ${fam[@]} \
    | git cat-file --batch-check='%(objectname)' | uniq`)
[[ $pathcheck = *:* ]] || {
    subfam=($( set -- ${fam[@]}; shift;
        for par; do tpar=`map $par`; [[ $tpar != $par ]] &&
            git rev-parse -q --verify $tpar:"$subdir"
        done
    ))
    git rm -rq --cached --ignore-unmatch  "$subdir"
    if (( ${#pathcheck[@]} == 1 && ${#fam[@]} > 1 && ${#subfam[@]} > 0)); then
        git update-index --add --cacheinfo 160000,$subfam,"$subdir"
    else
        subnew=`git cat-file -p $GIT_COMMIT | sed 1,/^$/d \
            | git commit-tree $GIT_COMMIT:"$subdir" $(
                ${subfam:+printf ' -p %s' ${subfam[@]}}) 2>&-
            ` &&
        git update-index --add --cacheinfo 160000,$subnew,"$subdir"
    fi
}
${debug+set +x}

#!/bin/bash
# Git filter-branch to split a subdirectory into a submodule history.

# In each commit, the subdirectory tree is replaced in the index with an
# appropriate submodule commit.
# * If the subdirectory tree has changed from any parent, or there are
#   no parents, a new submodule commit is made for the subdirectory (with
#   the current commit's message, which should presumably say something
#   about the change). The new submodule commit's parents are the
#   submodule commits in any rewrites of the current commit's parents.
# * Otherwise, the submodule commit is copied from a parent.

# Since the new history includes references to the new submodule
# history, the new submodule history isn't dangling, it's incorporated.
# Branches for any part of it can be made casually and pushed into any
# other repo as desired, so hooking up the `git submodule` helper
# command's conveniences is easy, e.g.
#     subdir=utils git split-submodule master
#     git branch utils $(git rev-parse master:utils)
#     git clone -sb utils . ../utilsrepo
# and you can then submodule add from there in other repos, but really,
# for small utility libraries and such, just fetching the submodule
# histories into your own repo is easiest. Setup on cloning a
# project using "incorporated" submodules like this is:
#   setup:  utils/.git
#
#   utils/.git:
#       @if _=`git rev-parse -q --verify utils`; then \
#           git config submodule.utils.active true \
#           && git config submodule.utils.url "`pwd -P`" \
#           && git clone -s . utils -nb utils \
#           && git submodule absorbgitdirs utils \
#           && git -C utils checkout $$(git rev-parse :utils); \
#       fi
# with `git config -f .gitmodules submodule.utils.path utils` and
# `git config -f .gitmodules submodule.utils.url ./`; cloners don't
# have to do anything but `make setup`, and `setup` should be a prereq
# on most things anyway.

# You can test that a commit and its rewrite put the same tree in the
# same place with this function:
# testit ()
# {
#     tree=($(git rev-parse `git rev-parse $1`: refs/original/refs/heads/$1));
#     echo $tree `test $tree != ${tree[1]} && echo ${tree[1]}`
# }
# so e.g. `testit make~95^2:t` will print the `t` tree there and if
# the `t` tree at ~95^2 from the original differs it'll print that too.

# To run it, say `subdir=path/to/it git split-submodule` with whatever
# filter-branch args you want.

# $GIT_COMMIT is set if we're already in filter-branch, if not, get there:
${GIT_COMMIT-exec git filter-branch --index-filter "subdir=$subdir; ${debug+debug=$debug;} $(sed 1,/SNIP/d "$0")" "$@"}

${debug+set -x}
fam=(`git rev-list --no-walk --parents $GIT_COMMIT`)
pathcheck=(`printf "%s:$subdir\\n" ${fam[@]} \
    | git cat-file --batch-check='%(objectname)' | uniq`)

[[ $pathcheck = *:* ]] || {
    subfam=($( set -- ${fam[@]}; shift;
        for par; do tpar=`map $par`; [[ $tpar != $par ]] &&
            git rev-parse -q --verify $tpar:"$subdir"
        done
    ))

    git rm -rq --cached --ignore-unmatch  "$subdir"
    if (( ${#pathcheck[@]} == 1 && ${#fam[@]} > 1 && ${#subfam[@]} > 0)); then
        # one id same for all entries, copy mapped mom's submod commit
        git update-index --add --cacheinfo 160000,$subfam,"$subdir"
    else
        # no mapped parents or something changed somewhere, make new
        # submod commit for current subdir content.  The new submod
        # commit has all mapped parents' submodule commits as parents:
        subnew=`git cat-file -p $GIT_COMMIT | sed 1,/^$/d \
            | git commit-tree $GIT_COMMIT:"$subdir" $(
                ${subfam:+printf ' -p %s' ${subfam[@]}}) 2>&-
            ` &&
        git update-index --add --cacheinfo 160000,$subnew,"$subdir"
    fi
}
${debug+set +x}
于 2019-02-21T00:19:03.813 回答
0

注意:子模块条目仅在您这样做时从父仓库创建

git submodule init
git submodule update

您不需要在rewrite-submodule-tree-filter脚本中使用这些命令,因为它只是关于正确设置.gitmodules文件内容。

git submodule只有当您第一次使用父仓库时,您才会执行这些“ ”命令:请参阅“使用子模块克隆项目”。

于 2012-11-17T12:17:16.233 回答
0

这是适用于 MacOSX 的更新答案。主要的变化是使用 pushd/popd 来改变目录,这样子模块就可以像 module/glop 而不仅仅是 glop。

#!/bin/bash

set -eu

if [ $# -eq 0 ]
then
    echo "Usage: $0 submodules-to-split"
fi

export _tmp=$(mktemp -d /tmp/git-submodule-split.XXXXXX)
export _libs="$@"
for i in $_libs
do
    mkdir -p $_tmp/$i
done

git filter-branch --commit-filter '
function gitCommit()
{
    git add -A
    if [ -n "$(git diff --cached --name-only)" ]
    then
        git commit -F $_msg
    fi
} >/dev/null

# from git-filter-branch
git checkout-index -f -u -a || die "Could not checkout the index"
# files that $commit removed are now still in the working tree;
# remove them, else they would be added again
git clean -d -q -f -x >&2

_git_dir=$GIT_DIR
_git_work_tree=$GIT_WORK_TREE
_git_index_file=$GIT_INDEX_FILE
unset GIT_DIR
unset GIT_WORK_TREE
unset GIT_INDEX_FILE

_msg=$(mktemp /tmp/git-submodule-split-msg.XXXXXX)
cat /dev/stdin > $_msg
for i in $_libs
do
    if [ -d "$i" ]
    then
        unset GIT_DIR
        unset GIT_WORK_TREE
        unset GIT_INDEX_FILE
        pushd $i > /dev/null
        if [ -d ".git" ]
        then
            gitCommit
        else
            git init >/dev/null
            gitCommit
        fi
        popd > /dev/null
        mkdir -p $_tmp/$i
        rsync -a -rtu $i/.git/ $_tmp/$i/.git/
        export GIT_DIR=$_git_dir
        export GIT_WORK_TREE=$_git_work_tree
        export GIT_INDEX_FILE=$_git_index_file
        git rm -q -r --cached $i >&2
        git submodule add ./$i $i >&2
        git add $i >&2
    fi
done
export GIT_DIR=$_git_dir
export GIT_WORK_TREE=$_git_work_tree
export GIT_INDEX_FILE=$_git_index_file

if [ -f ".gitmodules" ]
then
    git add .gitmodules >&2
fi

_new_rev=$(git write-tree)
shift
git commit-tree -F $_msg "$_new_rev" $@;
rm -f $_msg
' --tag-name-filter cat -- --all

for i in $_libs
do
    if [ -d "$_tmp/$i/.git" ]
    then
        rsync -a -i -rtu $_tmp/$i/.git/ $i/.git/
        pushd $i
        git reset --hard
        popd
    fi
done
rm -rf $_tmp

git for-each-ref refs/original --format="%(refname)" | while read i; do git update-ref -d $i; done

git reflog expire --expire=now --all
git gc --aggressive --prune=now
于 2014-10-22T20:03:50.870 回答