如何获取存储在变量中的字符串的长度并将其分配给另一个变量?
myvar="some string" echo ${#myvar} # 11
如何为输出设置另一个变量11?
11
myvar='Généralités' chrlen=${#myvar} oLang=$LANG oLcAll=$LC_ALL LANG=C LC_ALL=C bytlen=${#myvar} LANG=$oLang LC_ALL=$oLcAll printf "%s is %d char len, but %d bytes len.\n" "${myvar}" $chrlen $bytlen
将呈现:
Généralités is 11 char len, but 14 bytes len.
您甚至可以查看存储的字符:
myvar='Généralités' chrlen=${#myvar} oLang=$LANG oLcAll=$LC_ALL LANG=C LC_ALL=C bytlen=${#myvar} printf -v myreal "%q" "$myvar" LANG=$oLang LC_ALL=$oLcAll printf "%s has %d chars, %d bytes: (%s).\n" "${myvar}" $chrlen $bytlen "$myreal"
会回答:
Généralités has 11 chars, 14 bytes: ($'G\303\251n\303\251ralit\303\251s').
注意:我已将设置添加到$LC_ALLwith $LANG。
$LC_ALL
$LANG
参数工作与常规变量相同
showStrLen() { local bytlen sreal oLang=$LANG oLcAll=$LC_ALL LANG=C LC_ALL=C bytlen=${#1} printf -v sreal %q "$1" LANG=$oLang LC_ALL=$oLcAll printf "String '%s' is %d bytes, but %d chars len: %s.\n" "$1" $bytlen ${#1} "$sreal" }
将作为
showStrLen théorème String 'théorème' is 10 bytes, but 8 chars len: $'th\303\251or\303\250me'
printf
如果你:
for string in Généralités Language Théorème Février "Left: ←" "Yin Yang ☯";do printf " - %-14s is %2d char length\n" "'$string'" ${#string} done - 'Généralités' is 11 char length - 'Language' is 8 char length - 'Théorème' is 8 char length - 'Février' is 7 char length - 'Left: ←' is 7 char length - 'Yin Yang ☯' is 10 char length
不是很漂亮的输出!
为此,这里有一个小功能:
strU8DiffLen() { local charlen=${#1} LANG=C LC_ALL=C return $(( ${#1} - charlen )) }
或写在一行中:
strU8DiffLen() { local chLen=${#1} LANG=C LC_ALL=C;return $((${#1}-chLen));}
那么现在:
for string in Généralités Language Théorème Février "Left: ←" "Yin Yang ☯";do strU8DiffLen "$string" printf " - %-$((14+$?))s is %2d chars length, but uses %2d bytes\n" \ "'$string'" ${#string} $((${#string}+$?)) done - 'Généralités' is 11 chars length, but uses 14 bytes - 'Language' is 8 chars length, but uses 8 bytes - 'Théorème' is 8 chars length, but uses 10 bytes - 'Février' is 7 chars length, but uses 8 bytes - 'Left: ←' is 7 chars length, but uses 9 bytes - 'Yin Yang ☯' is 10 chars length, but uses 12 bytes
但是留下了一些奇怪的 UTF-8 行为,比如双倍行距字符、零行距字符、反向置换和其他不可能那么简单的行为......