国产精品电影_久久视频免费_欧美日韩国产激情_成年人视频免费在线播放_日本久久亚洲电影_久久都是精品_66av99_九色精品美女在线_蜜臀a∨国产成人精品_冲田杏梨av在线_欧美精品在线一区二区三区_麻豆mv在线看

多模態大模型Reyes增加batch推理方式,提升推理速度 原創

發布于 2025-1-16 15:35
瀏覽
0收藏

筆者在前面預訓練了一個多模態大模型Reyes,詳情見《??【多模態&LLM】Reyes:一個從0到1開始訓練的多模態大模型(技術報告)??》。本文將為Reyes增加一個batch推理方式,提高Reyes的推理速度。

Reyes-8B開源地址:

  • modelscope權重下載地址:https://modelscope.cn/models/yujunhuinlp/Reyes-8B
  • github:https://github.com/yujunhuics/Reyes

使用方式

將本倉庫中的??modeling_reyes.py???文件替換modelscrope下載的??modeling_reyes.py???運行即可。 batch推理詳細見github:??batch_inference.ipynb??.

??modeling_reyes.py??增項:

def chat_batch(
            self,
            tokenizer,
            pixel_values_list,
            questions,
            generation_config,
            histories=None,
            return_histories=False,
            num_patches_lists=None,
            IMG_START_TOKEN='<|vision_start|>',
            IMG_END_TOKEN='<|vision_end|>',
            IMG_CONTEXT_TOKEN='<|vision_pad|>',
            verbose=False,
            visual_features_list=None
    ):

        if histories isNone:
            histories = [[] for _ in questions]

        img_context_token_id = tokenizer.convert_tokens_to_ids(IMG_CONTEXT_TOKEN)
        self.img_context_token_id = img_context_token_id
        # Get eos_token_id from the template
        template = get_conv_template(self.template)
        template.system_message = self.system_message
        eos_token_id = tokenizer.convert_tokens_to_ids(template.sep)
        generation_config['eos_token_id'] = eos_token_id

        queries = []
        input_ids_list = []
        attention_mask_list = []

        for idx in range(len(questions)):
            question = questions[idx]
            history = histories[idx]
            pixel_values = pixel_values_list[idx] if pixel_values_list[idx] isnotNoneelseNone
            num_patches_list = [pixel_values.shape[0]] if pixel_values isnotNoneelse []

            ifnot history and pixel_values isnotNoneand'<image>'notin question:
                question = '<image>\n' + question

            template_i = get_conv_template(self.template)
            template_i.system_message = self.system_message
            for (old_question, old_answer) in history:
                template_i.append_message(template_i.roles[0], old_question)
                template_i.append_message(template_i.roles[1], old_answer)
            template_i.append_message(template_i.roles[0], question)
            template_i.append_message(template_i.roles[1], None)
            query = template_i.get_prompt()
            # Handle image tokens
            if pixel_values isnotNone:
                for num_patches in num_patches_list:
                    tile_pos_identifiers = [f"<tile_{i}>"for i in range(1, num_patches)] + ["<tile_global_thumbnail>"]
                    image_tokens = ''
                    for tile_pos_identifier in tile_pos_identifiers:
                        image_tokens += tile_pos_identifier + IMG_CONTEXT_TOKEN * self.num_image_token
                    image_tokens = IMG_START_TOKEN + image_tokens + IMG_END_TOKEN
                    query = query.replace('<image>', image_tokens, 1)

            model_inputs = tokenizer(
                query,
                return_tensors='pt',
                padding=True,
                truncation=True
            )
            input_ids = model_inputs['input_ids'].cuda()
            attention_mask = model_inputs['attention_mask'].cuda()
            input_ids_list.append(input_ids)
            attention_mask_list.append(attention_mask)

        # Call the generate function
        generation_output = self.generate_batch(
            pixel_values_list=pixel_values_list,
            input_ids_list=input_ids_list,
            attention_mask_list=attention_mask_list,
            **generation_config
        )
        responses = tokenizer.batch_decode(generation_output, skip_special_tokens=True)

        outputs = []
        for idx, response in enumerate(responses):
            response = response.split(template.sep)[0].strip()
            histories[idx].append((questions[idx], response))
            outputs.append(response)

        if return_histories:
            return outputs, histories
        else:
            if verbose:
                for idx, query in enumerate(queries):
                    query_to_print = query.replace(IMG_CONTEXT_TOKEN, '')
                    query_to_print = query_to_print.replace(f'{IMG_START_TOKEN}{IMG_END_TOKEN}', '<image>')
                    print(query_to_print, outputs[idx])
            return outputs

    @torch.no_grad()
    def generate_batch(
            self,
            pixel_values_list: Optional[List[torch.FloatTensor]] = None,
            input_ids_list: Optional[List[torch.FloatTensor]] = None,
            attention_mask_list: Optional[List[torch.LongTensor]] = None,
            visual_features: Optional[torch.FloatTensor] = None,
            generation_config: Optional[GenerationConfig] = None,
            output_hidden_states: Optional[bool] = None,
            return_dict: Optional[bool] = None,
            **generate_kwargs,
    ) -> torch.LongTensor:
        input_embeds_list = []
        attention_mask_padded_list = []

        max_seq_length = max(input_ids.shape[1] for input_ids in input_ids_list)

        for pixel_values, input_ids, attention_mask in zip(pixel_values_list, input_ids_list, attention_mask_list):
            if pixel_values isnotNone:
                if visual_features isnotNone:
                    vit_embeds = visual_features.cuda()
                    vit_embeds = self.mlp1(vit_embeds)
                else:
                    vit_embeds = self.extract_feature(pixel_values)

                input_embeds = self.language_model.get_input_embeddings()(input_ids)
                B, N, C = input_embeds.shape
                input_embeds = input_embeds.reshape(B * N, C)

                input_ids = input_ids.reshape(B * N)
                selected = (input_ids == self.img_context_token_id)
                assert selected.sum() != 0, "No valid image context token IDs found."
                input_embeds[selected] = vit_embeds.reshape(-1, C).to(input_embeds.device)

                input_embeds = input_embeds.reshape(B, N, C)
            else:
                input_embeds = self.language_model.get_input_embeddings()(input_ids)

            seq_length = input_embeds.shape[1]
            if seq_length < max_seq_length:
                pad_size = max_seq_length - seq_length
                input_embeds = F.pad(input_embeds, (0, 0, 0, pad_size))
                attention_mask = F.pad(attention_mask, (0, pad_size))

            input_embeds_list.append(input_embeds)
            attention_mask_padded_list.append(attention_mask)

        input_embeds = torch.cat(input_embeds_list, dim=0)
        attention_mask = torch.cat(attention_mask_padded_list, dim=0)

        outputs = self.language_model.generate(
            inputs_embeds=input_embeds,
            attention_mask=attention_mask,
            generation_config=generation_config,
            output_hidden_states=output_hidden_states,
            return_dict=return_dict,
            use_cache=True,
            **generate_kwargs,
        )

        return outputs

batch推理:

import torch
from modelscope import AutoTokenizer, AutoModel
from PIL import Image
import torchvision.transforms as T
from torchvision.transforms.functional import InterpolationMode

IMAGENET_MEAN = (0.485, 0.456, 0.406)
IMAGENET_STD = (0.229, 0.224, 0.225)


def build_transform(input_size):
    MEAN, STD = IMAGENET_MEAN, IMAGENET_STD
    transform = T.Compose([
        T.Lambda(lambda img: img.convert('RGB') if img.mode != 'RGB'else img),
        T.Resize((input_size, input_size), interpolation=InterpolationMode.BICUBIC),
        T.ToTensor(),
        T.Normalize(mean=MEAN, std=STD)
    ])
    return transform


def find_closest_aspect_ratio(aspect_ratio, target_ratios, width, height, image_size):
    best_ratio_diff = float('inf')
    best_ratio = (1, 1)
    area = width * height
    for ratio in target_ratios:
        target_aspect_ratio = ratio[0] / ratio[1]
        ratio_diff = abs(aspect_ratio - target_aspect_ratio)
        if ratio_diff < best_ratio_diff:
            best_ratio_diff = ratio_diff
            best_ratio = ratio
        elif ratio_diff == best_ratio_diff:
            if area > 0.5 * image_size * image_size * ratio[0] * ratio[1]:
                best_ratio = ratio
    return best_ratio


def dynamic_preprocess(image, min_num=1, max_num=12, image_size=448, use_thumbnail=False):
    orig_width, orig_height = image.size
    aspect_ratio = orig_width / orig_height

    # calculate the existing image aspect ratio
    target_ratios = set(
        (i, j) for n in range(min_num, max_num + 1) for i in range(1, n + 1) for j in range(1, n + 1) if
        i * j <= max_num and i * j >= min_num)
    target_ratios = sorted(target_ratios, key=lambda x: x[0] * x[1])

    # find the closest aspect ratio to the target
    target_aspect_ratio = find_closest_aspect_ratio(
        aspect_ratio, target_ratios, orig_width, orig_height, image_size)

    # calculate the target width and height
    target_width = image_size * target_aspect_ratio[0]
    target_height = image_size * target_aspect_ratio[1]
    blocks = target_aspect_ratio[0] * target_aspect_ratio[1]

    # resize the image
    resized_img = image.resize((target_width, target_height))
    processed_images = []
    for i in range(blocks):
        box = (
            (i % (target_width // image_size)) * image_size,
            (i // (target_width // image_size)) * image_size,
            ((i % (target_width // image_size)) + 1) * image_size,
            ((i // (target_width // image_size)) + 1) * image_size
        )
        # split the image
        split_img = resized_img.crop(box)
        processed_images.append(split_img)
    assert len(processed_images) == blocks
    if use_thumbnail and len(processed_images) != 1:
        thumbnail_img = image.resize((image_size, image_size))
        processed_images.append(thumbnail_img)
    return processed_images


def load_image(image_file, input_size=448, max_num=12):
    image = Image.open(image_file).convert('RGB')
    transform = build_transform(input_size=input_size)
    images = dynamic_preprocess(image, image_size=input_size, use_thumbnail=True, max_num=max_num)
    pixel_values = [transform(image) for image in images]
    pixel_values = torch.stack(pixel_values)
    return pixel_values


def preprocess_image(file_path, dynamic=True, max_num=6, image_size=448):
    try:
        if dynamic:
            return load_image(file_path, max_num=max_num).to(torch.bfloat16).cuda()
        else:
            img = Image.open(file_path).convert('RGB')
            transform = build_transform(image_size)
            pixel_values = transform(img)
            return torch.stack([pixel_values]).to(torch.bfloat16).cuda()
    except Exception as e:
        raise RuntimeError(f"Error processing image: {e}")


path = "Reyes-8B"

model = AutoModel.from_pretrained(
    path,
    torch_dtype=torch.bfloat16,
    trust_remote_code=True,
).eval().cuda()

# print(model)

tokenizer = AutoTokenizer.from_pretrained(path, trust_remote_code=True, use_fast=False)
generation_config = dict(max_new_tokens=2048, do_sample=False)
questions = [
    "<image>\nDescribe this image.",
    "<image>\nDescribe this image.",
    "<image>\nDescribe this image.",
]

images_path = ["t6.png","t6.png","t6.png"]


def conversation(model, tokenizer, questions, images_path,generation_config,histories):
    pixel_values_list=[]

    for i in range(len(questions)):
        if images_path[i] isnotNone:
            pixel_values = preprocess_image(file_path, dynamic=True)
            pixel_values_list.append(pixel_values)


    return model.chat_batch(tokenizer, pixel_values_list, questions, generation_config, histories, return_histories=False)

responses= conversation(model, tokenizer, questions, images_path,generation_config,histories=None)
for question, response in zip(questions, responses):
    print(f"User: {question}\nAssistant: {response}\n")



本文轉載自公眾號大模型自然語言處理  作者:余俊暉

原文鏈接:??https://mp.weixin.qq.com/s/IeDUGzTOnOEONrFoLvXFcg??


?著作權歸作者所有,如需轉載,請注明出處,否則將追究法律責任
收藏
回復
舉報
回復
相關推薦
欧美日韩高清不卡| 亚洲一区二区偷拍精品| 欧美在线视频网| 国产最新在线| 久久久久久久电影| 久久久久久精| 精品精品99| 中文字幕亚洲综合久久筱田步美| 国产九九在线| 亚洲美女屁股眼交3| 日本一道在线观看| 亚洲综合另类| 91传媒在线免费观看| 91精品啪在线观看国产手机 | 久久大胆人体| 91久久久免费一区二区| 先锋影音资源999| 国产精品自拍av| 午夜精品一区二区在线观看| 欧美人成在线| 国产精品直播网红| 国产精品对白| 久久国产精品免费视频| 九色porny自拍视频在线观看| 欧美日韩亚洲不卡| 欧美亚洲日本| 日韩欧美亚洲成人| 曰韩少妇与小伙激情| 国产精品你懂的在线欣赏| 日韩欧美亚洲天堂| 懂色av一区二区夜夜嗨| 好色先生视频污| 国产一区视频导航| 免费久久久久久| 久久国产精品无码网站| 日韩精品av一区二区三区| 亚洲大胆视频| 免费国产在线精品一区二区三区| 欧美视频导航| 久久国产一区| 天堂精品中文字幕在线| 欧美日韩在线高清| 日本美女一区二区三区| 亚洲精品美女久久7777777| 人人超碰91尤物精品国产| 日韩一区二区电影在线观看| 视频精品一区二区| 综合色婷婷一区二区亚洲欧美国产| 免费精品视频最新在线| 成年人三级视频| 26uuu久久天堂性欧美| 丰满少妇在线观看| 国产精品视频观看| 亚欧黄色av| 一本到三区不卡视频| 啊v在线视频| 日韩欧美中文字幕公布| 午夜影院一区| 久久九九全国免费精品观看| 伊人精品综合| 国产在线精品播放| 国产精品亚洲综合久久| 超碰cao国产精品一区二区| 欧美老女人性生活| 欧美男男freegayvideosroom| 91大神在线播放精品| 青青草成人影院| 国产乱子伦精品| 蜜桃精品视频在线| 97xxxxx| 好看的av在线不卡观看| 精品久久久久久综合日本| 精品一区二区国语对白| 波多野结衣天堂| 色综合一区二区三区| 第一中文字幕在线| 亚洲狠狠丁香婷婷综合久久久| 免费高清在线| 欧美高清视频不卡网| 日本成人片在线| 欧美在线免费视频| 伊人天天综合| 日本精品久久久久久久久久| 国产精品国产精品国产专区不蜜| 视频一区二区三区国产| 亚洲精品久久在线| 国产有码在线| 亚洲精品99999| 欧美性生活一级片| 久久99精品久久久久久水蜜桃| 国产自产高清不卡| 天堂在线视频中文网| 欧美一级高清片在线观看| 久久久久亚洲精品中文字幕| 444亚洲人体| 成人晚上爱看视频| 青青草娱乐在线| 国产一区二区三区视频免费| 小嫩嫩12欧美| 蜜桃视频在线观看成人| 欧美国产一区二区| 美女写真理伦片在线看| 久久久久www| 在线日韩中文| mm131亚洲精品| 精品福利视频一区二区三区| 欧美日本网站| 日韩一级黄色av| 激情久久综合| 99视频在线免费| 日韩美女视频在线| 日韩精品一区二区三区免费观影| 裸体裸乳免费看| 色偷偷88欧美精品久久久| 色诱色偷偷久久综合| 国产一区自拍视频| 亚洲美女免费视频| 高清av一区| 久久伊人资源站| 亚洲午夜久久久久久久久久久| 欧美国产大片| 99九九视频| 综合中文字幕亚洲| 91国拍精品国产粉嫩亚洲一区| 国产一区二区黄色| 一区二区三区在线看| 在线观看欧美| 久久久久久久免费视频| 欧美日韩成人在线| 精品国产乱码久久久久久果冻传媒| 成人免费a级片| 日韩欧美在线影院| 欧美a级一区| 在线看你懂得| 欧美中在线观看| 久久久99免费| 91精品国产自产观看在线| 日本黄色a视频| 日韩欧美在线123| 国产亚洲综合精品| www.在线视频.com| 亚洲xxxx视频| 欧美日韩国产在线播放| 日韩精选在线| 日本美女高潮视频| 久久精品国产一区| 成人黄页在线观看| 忘忧草在线www成人影院| 色视频一区二区三区| 欧美夫妻性生活| 国产欧美一级| 一区二区三区视频在线观看视频| 国产欧美精品在线播放| 亚洲视频免费在线| y111111国产精品久久久| 色综合av综合无码综合网站| www.日韩.com| 国产欧美在线观看一区| 96sao在线精品免费视频| 91极品尤物在线播放国产| 欧美激情视频播放| 综合久久久久久| 欧美丝袜激情| 精品欧美不卡一区二区在线观看| 91久久精品美女| 91福利在线观看| 亚洲在线免费| 日本免费一区二区六区| 日韩精品一区二区三区电影| 中文视频一区视频二区视频三区| 欧美不卡视频一区| 极品少妇xxxx精品少妇偷拍| 亚洲第一av| 国产精品入口芒果| 欧美精品久久一区二区| 亚洲视频狠狠干| 爽成人777777婷婷| 丝袜美腿美女被狂躁在线观看| 欧美重口乱码一区二区| 亚洲黄色av网站| 高清shemale亚洲人妖| 少妇高潮一区二区三区99| 欧美一级片中文字幕| 奇米四色中文综合久久| 一本大道综合伊人精品热热 | 日本不卡二区高清三区| 亚洲第一色中文字幕| 国产成人精品亚洲777人妖| 六九午夜精品视频| 成人亚洲成人影院| 91免费版黄色| 亚洲国产欧美在线成人app| bt欧美亚洲午夜电影天堂| 麻豆一区二区麻豆免费观看| 在线亚洲电影| 亚洲一区精品视频| 久久久久久69| 欧美午夜精品一区二区三区| 国产在线不卡视频| 香蕉久久精品|