transformers 報錯,無法載入執行 bert-base-chinese github.com連不上

阿新 • • 發佈：2022-04-05

https://blog.csdn.net/weixin_37935970/article/details/123238677

pip install transformers==3.0.2

pip install torch==1.3.1

pip install huggingface_hub

tokenizer = torch.hub.load('huggingface/pytorch-transformers', 'tokenizer', 'bert-base-chinese')

(torch1.3) root@iZ2zedmeg2gi9atq5khtlgZ:~/online_doctor/bert_server# python bert_chinese_encode.py
Downloading: "https://github.com/huggingface/pytorch-transformers/archive/main.zip" to /root/.cache/torch/hub/main.zip
Traceback (most recent call last):
File "bert_chinese_encode.py", line 5, in <module>
tokenizer = torch.hub.load('huggingface/pytorch-transformers', 'tokenizer', 'bert-base-chinese')
File "/root/torch1.3/lib/python3.6/site-packages/torch/hub.py", line 399, in load
model = _load_local(repo_or_dir, model, *args, **kwargs)
File "/root/torch1.3/lib/python3.6/site-packages/torch/hub.py", line 427, in _load_local
entry = _load_entry_from_hubconf(hub_module, model)
File "/root/torch1.3/lib/python3.6/site-packages/torch/hub.py", line 230, in _load_entry_from_hubconf
_check_dependencies(m)
File "/root/torch1.3/lib/python3.6/site-packages/torch/hub.py", line 219, in _check_dependencies
raise RuntimeError('Missing dependencies: {}'.format(', '.join(missing_deps)))
RuntimeError: Missing dependencies: huggingface_hub

pip install huggingface_hub

網路問題：

/etc/hosts 配置，刪除；試了無數次終於開始下載了；（配了也不一定有效果）

Downloading: "https://github.com/huggingface/pytorch-transformers/archive/main.zip" to /root/.cache/torch/hub/main.zip

可以再windows下手動下載再上傳到.cache下

https://gitee.com/ineo6/hosts

(torch1.3) root@iZ2zedmeg2gi9atq5khtlgZ:~/online_doctor/bert_server# python bert_chinese_encode.py
============ huggingface pytorch-transformers
Downloading: "https://github.com/huggingface/pytorch-transformers/archive/main.zip" to /root/.cache/torch/hub/main.zip
============ huggingface pytorch-transformers
Traceback (most recent call last):
File "bert_chinese_encode.py", line 7, in <module>
model = torch.hub.load('huggingface/pytorch-transformers', 'model', 'bert-base-chinese')
File "/root/torch1.3/lib/python3.6/site-packages/torch/hub.py", line 397, in load
repo_or_dir = _get_cache_or_reload(repo_or_dir, force_reload, verbose, skip_validation)
File "/root/torch1.3/lib/python3.6/site-packages/torch/hub.py", line 165, in _get_cache_or_reload
repo_owner, repo_name, branch = _parse_repo_info(github)
File "/root/torch1.3/lib/python3.6/site-packages/torch/hub.py", line 119, in _parse_repo_info
with urlopen(f"https://github.com/{repo_owner}/{repo_name}/tree/main/"):
File "/usr/lib/python3.6/urllib/request.py", line 223, in urlopen
return opener.open(url, data, timeout)
File "/usr/lib/python3.6/urllib/request.py", line 526, in open
response = self._open(req, data)
File "/usr/lib/python3.6/urllib/request.py", line 544, in _open
'_open', req)
File "/usr/lib/python3.6/urllib/request.py", line 504, in _call_chain
result = func(*args)
File "/usr/lib/python3.6/urllib/request.py", line 1392, in https_open
context=self._context, check_hostname=self._check_hostname)
File "/usr/lib/python3.6/urllib/request.py", line 1352, in do_open
r = h.getresponse()
File "/usr/lib/python3.6/http/client.py", line 1383, in getresponse
response.begin()
File "/usr/lib/python3.6/http/client.py", line 320, in begin
version, status, reason = self._read_status()
File "/usr/lib/python3.6/http/client.py", line 289, in _read_status
raise RemoteDisconnected("Remote end closed connection without"
http.client.RemoteDisconnected: Remote end closed connection without response
(torch1.3) root@iZ2zedmeg2gi9atq5khtlgZ:~/online_doctor/bert_server# python bert_chinese_encode.py
============ huggingface pytorch-transformers
Using cache found in /root/.cache/torch/hub/huggingface_pytorch-transformers_main
============ huggingface pytorch-transformers
Using cache found in /root/.cache/torch/hub/huggingface_pytorch-transformers_main
Downloading: 66%|████████████████████████████████████████████████████████████████████▏ | 270M/412M [00:21<00:11, 12.1MB/s

"""
pip install transformers==3.0.2

pip install torch==1.3.1

pip install huggingface_hub
"""

import torch
import torch.nn as nn

# 使用torch.hub載入bert中文模型的字對映器
tokenizer = torch.hub.load('huggingface/pytorch-transformers', 'tokenizer', 'bert-base-chinese')
# 使用torch.hub載入bert中文模型
model = torch.hub.load('huggingface/pytorch-transformers', 'model', 'bert-base-chinese')


# 編寫獲取bert編碼的函式
def get_bert_encode(text_1, text_2, mark=102, max_len=10):
    '''
    功能: 使用bert中文模型對輸入的文字進行編碼
    text_1: 代表輸入的第一句話
    text_2: 代表輸入的第二句話
    mark: 分隔標記, 是bert預訓練模型tokenizer的一個自身特殊標記, 當輸入兩個文字的時候, 有中間的特殊分隔符, 102
    max_len: 限制的最大語句長度, 如果大於max_len, 進行截斷處理, 如果小於max_len, 進行0填充的處理
    return: 輸入文字的bert編碼
    '''
    # 第一步使用tokenizer進行兩個文字的字對映
    indexed_tokens = tokenizer.encode(text_1, text_2)
    # 接下來要對兩個文字進行補齊, 或者截斷的操作
    # 首先要找到分隔標記的位置
    k = indexed_tokens.index(mark)

    # 第二步處理第一句話, 第一句話是[:k]
    if len(indexed_tokens[:k]) >= max_len:
        # 長度大於max_len, 進行截斷處理
        indexed_tokens_1 = indexed_tokens[:max_len]
    else:
        # 長度小於max_len, 需要對剩餘的部分進行0填充
        indexed_tokens_1 = indexed_tokens[:k] + (max_len - len(indexed_tokens[:k])) * [0]

    # 第三步處理第二句話, 第二句話是[k:]
    if len(indexed_tokens[k:]) >= max_len:
        # 長度大於max_len, 進行截斷處理
        indexed_tokens_2 = indexed_tokens[k:k+max_len]
    else:
        # 長度小於max_len, 需要對剩餘的部分進行0填充
        indexed_tokens_2 = indexed_tokens[k:] + (max_len - len(indexed_tokens[k:])) * [0]

    # 接下來將處理後的indexed_tokens_1和indexed_tokens_2進行相加合併
    indexed_tokens = indexed_tokens_1 + indexed_tokens_2

    # 需要一個額外的標誌列表, 來告訴模型那部分是第一句話, 哪部分是第二句話
    # 利用0元素來表示第一句話, 利用1元素來表示第二句話
    # 注意: 兩句話的長度都已經被我們規範成了max_len
    segments_ids = [0] * max_len + [1] * max_len

    # 利用torch.tensor將兩個列表封裝成張量
    tokens_tensor = torch.tensor([indexed_tokens])
    segments_tensor = torch.tensor([segments_ids])

    # 利用模型進行編碼不求導
    with torch.no_grad():
        # 使用bert模型進行編碼, 傳入引數tokens_tensor和segments_tensor, 最終得到模型的輸出encoded_layers
        encoded_layers, _ = model(tokens_tensor, token_type_ids=segments_tensor)

    return encoded_layers


text_1 = "人生該如何起頭"
text_2 = "改變要如何起手"

encoded_layers = get_bert_encode(text_1, text_2)
print(encoded_layers)
print(encoded_layers.shape)

transformers 報錯,無法載入執行 bert-base-chinese github.com連不上

transformers 報錯,無法載入執行 bert-base-chinese github.com連不上

執行vue -V報錯無法載入 ****\nodejs\vue.ps1 造成

EasyCVR由於nginx啟動異常且報錯無法執行install處理方法

pycharm下可以執行python專案，Linux命令列下報錯無法導包，且sys.path.appen()新增環境變數無效

VS2019 除錯程式報錯無法找到.exe檔案

sql developer 17002報錯無法連線

php 驗證碼影象報錯無法找到/開啟字型(Warning: imagettftext(): Could not find/open font)的解決方法

報錯：crontab執行時區與系統時間不一致問題

Win10下RabbitMQ啟動後無報錯無法訪問http://localhost:15672

nginx反向代理部署springboot專案報404無法載入靜態資源

記錄一次seata中的服務報錯無法回滾問題(xid不一致)

unity console報錯無法顯示/空白報錯

springboot專案啟動報錯無法識別bootstrap.yml配置問題

無法載入檔案*.ps1，因為在此係統上禁止執行指令碼。有關詳細資訊，請參閱

IDEA報錯：Error:(46, 56) java: 程式包com.mysql.jdbc不存在

macOS安裝Homebrew報錯：Failed to connect to raw.githubusercontent.com port 443: Connection refused

Win10系統無法安裝PPTV如何解決？Win10系統安裝不上PPTV的解決方法

npm install current-device js 端判斷程式執行的裝置 https://github.com/matthewhudson/current-device

伺服器經常執行一段時間, mysql就連結不上的解決辦法

執行electron main.js報錯：無法載入檔案 C:\Users\Administrator\AppData\Roaming\npm\electron.ps1，因為在此係統中禁止執行指令碼。

transformers 報錯,無法載入執行 bert-base-chinese github.com連不上

相關推薦