c# - 如何在不破坏单词的情况下在 c# 中破坏长字符串

Question

我想在 C# 中打破一个长字符串而不打破一个单词
示例：S AAA BBBBBBB CC DDDDDD V Breaking Character on 7 Count：

S AAA
BBBBBBB
CC 
DDDDDD 
V

我该怎么做呢？

score 4 · Accepted Answer

string inputStr = "S AAA BBBBBBB CC DDDDDD V ";
int maxWordLength = 7;
char separator = ' ';

string[] splitted = inputStr.Split(new[]{separator}, StringSplitOptions.RemoveEmptyEntries);

var joined = new Stack<string>();
joined.Push(splitted[0]);

foreach (var str in splitted.Skip(1))
{
    var strFromStack = joined.Pop();
    var joindedStr = strFromStack + separator + str;
    if(joindedStr.Length > maxWordLength)
    {
        joined.Push(strFromStack);
        joined.Push(str);
    }
    else
    {
        joined.Push(joindedStr);
    }
}   
var result = joined.Reverse().ToArray();

Console.WriteLine ("number of words: {0}", result.Length);

Console.WriteLine( string.Join(Environment.NewLine, result) );

印刷：

number of words: 5
S AAA
BBBBBBB
CC
DDDDDD
V

score 3 · Accepted Answer

这是一个利用正则表达式功能的更短的解决方案。

string input = "S AAA BBBBBBB CC DDDDDD V";

// Match up to 7 characters with optional trailing whitespace, but only on word boundaries
string pattern = @"\b.{1,7}\s*\b";

var matches = Regex.Matches(input, pattern);

foreach (var match in matches)
{
    Debug.WriteLine(match.ToString());
}

score 1 · Accepted Answer

如果我正确理解了您的问题，这可以解决问题。递归实现会更酷，但尾递归在 C# 中太糟糕了:)

也可以使用 yield 和IEnumerable<string>.

string[] splitSpecial(string words, int lenght)
{
  // The new result, will be turned into string[]
  var newSplit = new List<string>();
  // Split on normal chars, ie newline, space etc
  var splitted = words.Split();
  // Start out with null
  string word = null;

  for (int i = 0; i < splitted.Length; i++)
  {
      // If first word, add
      if (word == null)
      {
          word = splitted[i];
      }
      // If too long, add
      else if (splitted[i].Length + 1 + word.Length > lenght)
      {
          newSplit.Add(word);
          word = splitted[i];
      }
      // Else, concatenate and go again
      else
      {
          word += " " + splitted[i];
      }
  }
  // Flush what we have left, ie the last word
  newSplit.Add(word);

  // Convert into string[] (a requirement?)
  return newSplit.ToArray();
}

score 1 · Accepted Answer

为什么不尝试正则表达式？

(?:^|\s)(?:(.{1,7}|\S{7,}))(?=\s|$)

并使用所有捕获。

C#代码：

var text = "S AAA BBBBBBB CC DDDDDD V";
var matches = new Regex(@"(?:^|\s)(?:(.{1,7}|\S{7,}))(?=\s|$)").Matches(text).Cast<Match>().Select(x => x.Groups[1].Value).ToArray();
foreach (var match in matches)
{
    Console.WriteLine(match);
}

输出：

S AAA
BBBBBBB
CC
DDDDDD
V

score 0 · Accepted Answer

        string str = "S AAA BBBBBBB CC DDDDDD V";
        var words = str.Split(new char[] { ' ' }, StringSplitOptions.RemoveEmptyEntries);
        StringBuilder sb = new StringBuilder();
        List<string> result = new List<string>();
        for (int i = 0; i < words.Length; ++i)
        {
            if (sb.Length == 0)
            {
                sb.Append(words[i]);
            }
            else if (sb.Length + words[i].Length < 7)
            {
                sb.Append(' ');
                sb.Append(words[i]);
            }
            else
            {
                result.Add(sb.ToString());
                sb.Clear();
                sb.Append(words[i]);
            }
        }
        if (sb.Length > 0)
        {
            result.Add(sb.ToString());
        }

结果将包含：

S AAA
BBBBBBB
CC
DDDDDD
V

可以根据单词之间的分隔符是否应包含在 7 个字符中来调整谓词。

score 0 · Accepted Answer

这是向 HTML 文本添加换行符的方法：

SplitLongText（字符串_SourceText，int _MaxRowLength）{

        if (_SourceText.Length < _MaxRowLength)
        {
            return _SourceText;
        }
        else 
        {
            string _RowBreakText="";
            int _CurrentPlace = 0;
            while (_CurrentPlace < _SourceText.Length)
            {
                if (_SourceText.Length - _CurrentPlace < _MaxRowLength)
                {
                    _RowBreakText += _SourceText.Substring(_CurrentPlace);
                    _CurrentPlace = _SourceText.Length; 

                }
                else
                {
                    string _PartString = _SourceText.Substring(_CurrentPlace, _MaxRowLength);
                    int _LastSpace = _PartString.LastIndexOf(" ");
                    if (_LastSpace > 0)
                    {
                        _RowBreakText += _PartString.Substring(0, _LastSpace) + "<br/>" + _PartString.Substring(_LastSpace, (_PartString.Length - _LastSpace));
                    }
                    else
                    {
                        _RowBreakText += _PartString + "<br/>";
                    }
                    _CurrentPlace += _MaxRowLength;
                }

            }
            return _RowBreakText;

        }

score 0 · Accepted Answer

2021

看看这个扩展方法，它使用递归

public static string SubstringDontBreakWords(this string str, int maxLength)
{
    return str.Length <= maxLength ? str : str.Substring(0, str.LastIndexOf(" ")).Trim().SubstringDontBreakWords(maxLength);
}

像这样使用它

string text = "Hello friends";
text.SubstringDontBreakWords(10)

c# - 如何在不破坏单词的情况下在 c# 中破坏长字符串

7 回答 7

Related

Reference