Android url中文编码问题

最近项目遇见一个很奇葩问题，关于URL问题，项目中加载图片，图片的URL含有中文，但是，我的手机可以加载，没问题，同事也都可以，但是测试手机却不可以，加载失败，找到问题，就是URL含有中文问题。

解决方案：

把中文字符encode即可：

方法1：

 public static String encodeUrl(String url) {

        return Uri.encode(url, "-![.:/,%?&=]");

 }

方法2：

 public static String toUtf8String(String s) {

        StringBuffer sb = new StringBuffer();

        for (int i = 0; i < s.length(); i++) {

            char c = s.charAt(i);

            if (c >= 0 && c <= 255) {

                sb.append(c);

            } else {

                byte[] b;

                try {

                    b = String.valueOf(c).getBytes("utf-8");

                } catch (Exception ex) {

                    System.out.println(ex);

                    b = new byte[0];

                }

                for (int j = 0; j < b.length; j++) {

                    int k = b[j];

                    if (k < 0)

                        k += 256;

                    sb.append("%" + Integer.toHexString(k).toUpperCase());

                }

            }

        }

        return sb.toString();

    }

或者

import java.io.CharArrayWriter;

import java.io.UnsupportedEncodingException;

import java.net.URLDecoder;

import java.nio.charset.Charset;

import java.nio.charset.IllegalCharsetNameException;

import java.nio.charset.UnsupportedCharsetException;

import java.util.BitSet;

public class URLEncoderURI {

    static BitSet dontNeedEncoding;

    static final int caseDiff = ('a' - 'A');

    static {

        /*

         * The list of characters that are not encoded has been determined as

         * follows:

         *

         * RFC 2396 states: ----- Data characters that are allowed in a URI but

         * do not have a reserved purpose are called unreserved. These include

         * upper and lower case letters, decimal digits, and a limited set of

         * punctuation marks and symbols.

         *

         * unreserved = alphanum | mark

         *

         * mark = "-" | "_" | "." | "!" | "~" | "*" | "'" | "(" | ")"

         *

         * Unreserved characters can be escaped without changing the semantics

         * of the URI, but this should not be done unless the URI is being used

         * in a context that does not allow the unescaped character to appear.

         * -----

         *

         * It appears that both Netscape and Internet Explorer escape all

         * special characters from this list with the exception of "-", "_",

         * ".", "*". While it is not clear why they are escaping the other

         * characters, perhaps it is safest to assume that there might be

         * contexts in which the others are unsafe if not escaped. Therefore, we

         * will use the same list. It is also noteworthy that this is consistent

         * with O'Reilly's "HTML: The Definitive Guide" (page 164).

         *

         * As a last note, Intenet Explorer does not encode the "@" character

         * which is clearly not unreserved according to the RFC. We are being

         * consistent with the RFC in this matter, as is Netscape.

         */

        dontNeedEncoding = new BitSet(256);

        int i;

        for (i = 'a'; i <= 'z'; i++) {

            dontNeedEncoding.set(i);

        }

        for (i = 'A'; i <= 'Z'; i++) {

            dontNeedEncoding.set(i);

        }

        for (i = '0'; i <= '9'; i++) {

            dontNeedEncoding.set(i);

        }

        dontNeedEncoding.set(' '); /*

                                     * encoding a space to a + is done in the

                                     * encode() method

                                     */

        dontNeedEncoding.set('-');

        dontNeedEncoding.set('_');

        dontNeedEncoding.set('.');

        dontNeedEncoding.set('*');

        dontNeedEncoding.set(':');

        dontNeedEncoding.set('/');

        dontNeedEncoding.set('?');

        dontNeedEncoding.set(';');

        dontNeedEncoding.set('&');

        dontNeedEncoding.set('=');

    }

    /**

     * You can't call the constructor.

     */

    private URLEncoderURI() {

    }

    /**

     * Translates a string into <code>application/x-www-form-urlencoded</code>

     * format using a specific encoding scheme. This method uses the supplied

     * encoding scheme to obtain the bytes for unsafe characters.

     * <p>

     * <em><strong>Note:</strong> The <a href=

     * "http://www.w3.org/TR/html40/appendix/notes.html#non-ascii-chars">

     * World Wide Web Consortium Recommendation</a> states that

     * UTF-8 should be used. Not doing so may introduce

     * incompatibilites.</em>

     *

     * @param s

     *            <code>String</code> to be translated.

     * @param enc

     *            The name of a supported <a

     *            href="../lang/package-summary.html#charenc">character

     *            encoding</a>.

     * @return the translated <code>String</code>.

     * @exception UnsupportedEncodingException

     *                If the named encoding is not supported

     * @see URLDecoder#decode(java.lang.String, java.lang.String)

     * @since 1.4

     */

    public static String encode(String s, String enc) throws UnsupportedEncodingException {

        boolean needToChange = false;

        StringBuffer out = new StringBuffer(s.length());

        Charset charset;

        CharArrayWriter charArrayWriter = new CharArrayWriter();

        if (enc == null)

            throw new NullPointerException("charsetName");

        try {

            charset = Charset.forName(enc);

        } catch (IllegalCharsetNameException e) {

            throw new UnsupportedEncodingException(enc);

        } catch (UnsupportedCharsetException e) {

            throw new UnsupportedEncodingException(enc);

        }

        for (int i = 0; i < s.length();) {

            int c = (int) s.charAt(i);

            // System.out.println("Examining character: " + c);

            if (dontNeedEncoding.get(c)) {

                if (c == ' ') {

                    c = '+';

                    needToChange = true;

                }

                // System.out.println("Storing: " + c);

                out.append((char) c);

                i++;

            } else {

                // convert to external encoding before hex conversion

                do {

                    charArrayWriter.write(c);

                    /*

                     * If this character represents the start of a Unicode

                     * surrogate pair, then pass in two characters. It's not

                     * clear what should be done if a bytes reserved in the

                     * surrogate pairs range occurs outside of a legal surrogate

                     * pair. For now, just treat it as if it were any other

                     * character.

                     */

                    if (c >= 0xD800 && c <= 0xDBFF) {

                        /*

                         * System.out.println(Integer.toHexString(c) +

                         * " is high surrogate");

                         */

                        if ((i + 1) < s.length()) {

                            int d = (int) s.charAt(i + 1);

                            /*

                             * System.out.println("\tExamining " +

                             * Integer.toHexString(d));

                             */

                            if (d >= 0xDC00 && d <= 0xDFFF) {

                                /*

                                 * System.out.println("\t" +

                                 * Integer.toHexString(d) +

                                 * " is low surrogate");

                                 */

                                charArrayWriter.write(d);

                                i++;

                            }

                        }

                    }

                    i++;

                } while (i < s.length() && !dontNeedEncoding.get((c = (int) s.charAt(i))));

                charArrayWriter.flush();

                String str = new String(charArrayWriter.toCharArray());

                byte[] ba = str.getBytes(charset);

                for (int j = 0; j < ba.length; j++) {

                    out.append('%');

                    char ch = Character.forDigit((ba[j] >> 4) & 0xF, 16);

                    // converting to use uppercase letter as part of

                    // the hex value if ch is a letter.

                    if (Character.isLetter(ch)) {

                        ch -= caseDiff;

                    }

                    out.append(ch);

                    ch = Character.forDigit(ba[j] & 0xF, 16);

                    if (Character.isLetter(ch)) {

                        ch -= caseDiff;

                    }

                    out.append(ch);

                }

                charArrayWriter.reset();

                needToChange = true;

            }

        }

        return (needToChange ? out.toString() : s);

    }

}

参考：

文／SIMPLE孙鹏（简书作者）
原文链接：http://www.jianshu.com/p/9be694c8fee2
著作权归作者所有，转载请联系作者获得授权，并标注“简书作者”。

Android url中文编码问题的更多相关文章

Android URL中文处理
不多说,贴上代码.大家都明确 import java.io.File; import android.net.Uri; public class Transition { /** * @param u ...
.NET C#中处理Url中文编码问题
近些日子在做一个用C#访问webservise的程序,由于需要传递中文参数去请求网站,所以碰到了中文编码问题.我们知道像百度这种搜索引擎中,当用户输入中文关键字后,它会把中文转码,以确保在Url中不会 ...
iOS url中文编码
有两种方法: 一,使用NSString的方法: NSString* string2 = [string1 stringByAddingPercentEscapesUsingEncoding:NSUTF ...
ANdroid URL
1 Android开源项目和工具分类 http://blog.csdn.net/shimiso/article/details/40889361 2 分享45个android实例源码 http://w ...
使用Curl进行抓取远程内容时url中文编码问题
PHP中对于URL进行编码,可以使用 urlencode() 或者 rawurlencode(),二者的区别是前者把空格编码为 '+',而后者把空格编码为 '%20',不过应该注意的是,在编码时应该只 ...
Apache+mod_encoding解决URL中文编码问题
我们经常在论坛上看到这样的求救贴: 为什么我看不了网站上中文文件名的文件?这时一定会有好心的大侠告诉说,到IE6的工具,Internet选项, 高级里,把"总是以UTF-8发送URL&qu ...
Android Url相关工具通用类UrlUtil
1.整体分析 1.1.源代码查看,可以直接Copy. public class UrlUtil { public static boolean isUrlPrefix(String url) { re ...
Apache2.2+mod_encoding解决URL中文编码问题
我们经常在论坛上看到这样的求救贴: 为什么我看不了网站上中文文件名的文件?这时一定会有好心的大侠告诉说,到IE6的工具,Internet选项, 高级里,把"总是以UTF-8发送URL&quo ...
URL中文编码
/// <summary> /// GB2312编码 /// </summary> /// <param name=" ...

随机推荐

CentOS安装配置ganglia
1. 下载ganglia源码包并解压 wget http://sourceforge.net/projects/ganglia/files/ganglia%20monitoring%20cor ...
十进制二进制之间的转化 PHP算法
[ 十进制转二进制 ] function test($var){ $func = function($i){ if($i < 2){ return $i; } $return['int'] = ...
python3.4+selenium爬58同城（一）
爬取http://bj.58.com/pbdn/0/pn2/中除转转.推广商品以外的产品信息,因为转转和推广的详情信息不规范,需要另外写一个方法存放,后期补上,详情页如下这周学习了爬虫,但是遇到一些 ...
python中__init__.py文件的作用
问题在执行models.py时,报ImportError:No module named transwarp.db的错误,但明明transwarp下就有db.py文件,路径也没有错误.真是想不通.后 ...
Ubuntu14.04LST安装weblogic11g
1:下载链接http://download.oracle.com/otn/nt/middleware/11g/wls/1036/wls1036_generic.jar 2:进行安装(前提已经安装好JD ...
SmartBusinessDevFramework架构设计-3：考虑开源？
掖着藏着,终归不是好的办法.说的跟花一样,究竟里子是什么东西.一个好的被子,里料是羽绒还是棉花还是丝绵还是黑心棉?有时候,真的是看过之后,才能体验其中的奥秘. 这个架构的设计初衷,总体是为了方便.ne ...
Linux下的摄影后期处理软件
由于喜欢摄影,在LInux上折腾,想找一款能代替lightroom的软件.发现darktable这款软件专业.于是就安装了. 以下是在Linux上安装darktable的instruction,需要添 ...
UESTC_秋实大哥与时空漫游 2015 UESTC Training for Graph Theory<Problem C>
C - 秋实大哥与时空漫游 Time Limit: 4500/1500MS (Java/Others) Memory Limit: 65535/65535KB (Java/Others) Su ...
LeeCode-Pow(x, n)
Implement pow(x, n). double myPow(double x, int n) { ) return 1.0; ) return 1.0/pow(x,-n); ); }
【转】android 电池（一）：锂电池基本原理篇
关键词:android 电池关机充电 androidboot.mode charger 平台信息:内核:linux2.6/linux3.0系统:android/android4.0 平台:S5PV3 ...

Android url中文编码问题

Android url中文编码问题的更多相关文章

随机推荐

热门专题