亚洲欧美第一页_禁久久精品乱码_粉嫩av一区二区三区免费野_久草精品视频

? 歡迎來到蟲蟲下載站! | ?? 資源下載 ?? 資源專輯 ?? 關于我們
? 蟲蟲下載站

?? readme

?? The GNU MP Bignum Library
??
字號:
Copyright 2000, 2001 Free Software Foundation, Inc.This file is part of the GNU MP Library.The GNU MP Library is free software; you can redistribute it and/or modifyit under the terms of the GNU Lesser General Public License as published bythe Free Software Foundation; either version 3 of the License, or (at youroption) any later version.The GNU MP Library is distributed in the hope that it will be useful, butWITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITYor FITNESS FOR A PARTICULAR PURPOSE.  See the GNU Lesser General PublicLicense for more details.You should have received a copy of the GNU Lesser General Public Licensealong with the GNU MP Library.  If not, see http://www.gnu.org/licenses/.			AMD K6 MPN SUBROUTINESThis directory contains code optimized for AMD K6 CPUs, meaning K6, K6-2 andK6-3.The mmx subdirectory has MMX code suiting plain K6, the k62mmx subdirectoryhas MMX code suiting K6-2 and K6-3.  All chips in the K6 family have MMX,the separate directories are just so that ./configure can omit them if theassembler doesn't support MMX.STATUSTimes for the loops, with all code and data in L1 cache, are as follows.                                 cycles/limb	mpn_add_n/sub_n            3.25 normal, 2.75 in-place	mpn_mul_1                  6.25	mpn_add/submul_1           7.65-8.4  (varying with data values)	mpn_mul_basecase           9.25 cycles/crossproduct (approx)	mpn_sqr_basecase           4.7  cycles/crossproduct (approx)                                   or 9.2 cycles/triangleproduct (approx)	mpn_l/rshift               3.0	mpn_divrem_1              20.0	mpn_mod_1                 20.0	mpn_divexact_by3          11.0	mpn_copyi                  1.0	mpn_copyd                  1.0K6-2 and K6-3 have dual-issue MMX and get the following improvements.	mpn_l/rshift               1.75Prefetching of sources hasn't yet given any joy.  With the 3DNow "prefetch"instruction, code seems to run slower, and with just "mov" loads it doesn'tseem faster.  Results so far are inconsistent.  The K6 does a hardwareprefetch of the second cache line in a sector, so the penalty for notprefetching in software is reduced.NOTESAll K6 family chips have MMX, but only K6-2 and K6-3 have 3DNow.Plain K6 executes MMX instructions only in the X pipe, but K6-2 and K6-3 canexecute them in both X and Y (and in both together).Branch misprediction penalty is 1 to 4 cycles (Optimization Manualchapter 6 table 12).Write-allocate L1 data cache means prefetching of destinations is unnecessary.Store queue is 7 entries of 64 bits each.Floating point multiplications can be done in parallel with integermultiplications, but there doesn't seem to be any way to make use of this.OPTIMIZATIONSUnrolled loops are used to reduce looping overhead.  The unrolling isconfigurable up to 32 limbs/loop for most routines, up to 64 for some.Sometimes computed jumps into the unrolling are used to handle sizes not amultiple of the unrolling.  An attractive feature of this is that timessmoothly increase with operand size, but an indirect jump is about 6 cyclesand the setups about another 6, so it depends on how much the unrolled codeis faster than a simple loop as to whether a computed jump ought to be used.Position independent code is implemented using a call to get eip forcomputed jumps and a ret is always done, rather than an addl $4,%esp or apopl, so the CPU return address branch prediction stack stays synchronisedwith the actual stack in memory.  Such a call however still costs 4 to 7cycles.Branch prediction, in absence of any history, will guess forward jumps arenot taken and backward jumps are taken.  Where possible it's arranged thatthe less likely or less important case is under a taken forward jump.MMXPutting emms or femms as late as possible in a routine seems to be fastest.Perhaps an emms or femms stalls until all outstanding MMX instructions havecompleted, so putting it later gives them a chance to complete on their own,in parallel with other operations (like register popping).The Optimization Manual chapter 5 recommends using a femms on K6-2 and K6-3at the start of a routine, in case it's been preceded by x87 floating pointoperations.  This isn't done because in gmp programs it's expected that x87floating point won't be much used and that chances are an mpn routine won'thave been preceded by any x87 code.CODINGInstructions in general code are shown paired if they can decode and executetogether, meaning two short decode instructions with the second notdepending on the first, only the first using the shifter, no more than oneload, and no more than one store.K6 does some out of order execution so the pairings aren't essential, theyjust show what slots might be available.  When decoding is the limitingfactor things can be scheduled that might not execute until later.NOTESCode alignment- if an opcode/modrm or 0Fh/opcode/modrm crosses a cache line boundary,  short decode is inhibited.  The cross.pl script detects this.- loops and branch targets should be aligned to 16 bytes, or ensure at least  2 instructions before a 32 byte boundary.  This makes use of the 16 byte  cache in the BTB.Addressing modes- (%esi) degrades decoding from short to vector.  0(%esi) doesn't have this  problem, and can be used as an equivalent, or easier is just to use a  different register, like %ebx.- K6 and pre-CXT core K6-2 have the following problem.  (K6-2 CXT and K6-3  have it fixed, these being cpuid function 1 signatures 0x588 to 0x58F).  If more than 3 bytes are needed to determine instruction length then  decoding degrades from direct to long, or from long to vector.  This  happens with forms like "0F opcode mod/rm" with mod/rm=00-xxx-100 since  with mod=00 the sib determines whether there's a displacement.  This affects all MMX and 3DNow instructions, and others with an 0F prefix,  like movzbl.  The modes affected are anything with an index and no  displacement, or an index but no base, and this includes (%esp) which is  really (,%esp,1).  The cross.pl script detects problem cases.  The workaround is to always  use a displacement, and to do this with Zdisp if it's zero so the  assembler doesn't discard it.  See Optimization Manual rev D page 67 and 3DNow Porting Guide rev B pages  13-14 and 36-37.Calls- indirect jumps and calls are not branch predicted, they measure about 6  cycles.Various- adcl      2 cycles of decode, maybe 2 cycles executing in the X pipe- bsf       12-27 cycles- emms      5 cycles- femms     3 cycles- jecxz     2 cycles taken, 13 not taken (optimization manual says 7 not taken)- divl      20 cycles back-to-back- imull     2 decode, 3 execute- mull      2 decode, 3 execute (optimization manual decoding sample)- prefetch  2 cycles- rcll/rcrl implicit by one bit: 2 cycles            immediate or %cl count: 11 + 2 per bit for dword                                    13 + 4 per bit for byte- setCC	    2 cycles- xchgl	%eax,reg  1.5 cycles, back-to-back (strange)        reg,reg   2 cycles, back-to-backREFERENCES"AMD-K6 Processor Code Optimization Application Note", AMD publicationnumber 21924, revision D amendment 0, January 2000.  This describes K6-2 andK6-3.  Available on-line,http://vip.amd.com/us-en/assets/content_type/white_papers_and_tech_docs/21924.pdf"AMD-K6 MMX Enhanced Processor x86 Code Optimization Application Note", AMDpublication number 21828, revision A amendment 0, August 1997.  This is anolder edition of the above document, describing plain K6.  Availableon-line,http://vip.amd.com/us-en/assets/content_type/white_papers_and_tech_docs/21828.pdf"3DNow Technology Manual", AMD publication number 21928G/0-March 2000.This describes the femms and prefetch instructions, but nothing else from3DNow has been used.  Available on-line,http://vip.amd.com/us-en/assets/content_type/white_papers_and_tech_docs/21928.pdf"3DNow Instruction Porting Guide", AMD publication number 22621, revision B,August 1999.  This has some notes on general K6 optimizations as well as3DNow.  Available on-line,http://vip.amd.com/us-en/assets/content_type/white_papers_and_tech_docs/22621.pdf----------------Local variables:mode: textfill-column: 76End:

?? 快捷鍵說明

復制代碼 Ctrl + C
搜索代碼 Ctrl + F
全屏模式 F11
切換主題 Ctrl + Shift + D
顯示快捷鍵 ?
增大字號 Ctrl + =
減小字號 Ctrl + -
亚洲欧美第一页_禁久久精品乱码_粉嫩av一区二区三区免费野_久草精品视频
欧美综合欧美视频| 日韩丝袜情趣美女图片| 午夜精品影院在线观看| 亚洲欧洲三级电影| 亚洲色图在线看| 久久影院视频免费| 色狠狠一区二区三区香蕉| 欧美视频精品在线| 欧美激情一区二区三区不卡| 日韩精品国产欧美| 91在线高清观看| 日本一区二区高清| 美洲天堂一区二卡三卡四卡视频 | 久久人人爽人人爽| 午夜精品福利久久久| 色综合天天性综合| 国产欧美一区二区精品性色| 免费成人美女在线观看.| 欧美图区在线视频| 亚洲婷婷综合久久一本伊一区| 激情欧美日韩一区二区| 91精品国产入口| 亚洲福中文字幕伊人影院| 不卡电影一区二区三区| 久久精品夜色噜噜亚洲aⅴ| 麻豆精品在线看| 欧美一区二区三区啪啪| 偷窥少妇高潮呻吟av久久免费 | 国内一区二区视频| 精品污污网站免费看| 亚洲男女毛片无遮挡| a级精品国产片在线观看| 国产性做久久久久久| 国产精品羞羞答答xxdd| 久久综合五月天婷婷伊人| 久久er99热精品一区二区| 欧美群妇大交群中文字幕| 婷婷久久综合九色国产成人| 欧美日韩精品三区| 秋霞成人午夜伦在线观看| 91麻豆精品91久久久久同性| 日韩av中文字幕一区二区 | 日韩精品久久久久久| 欧美日韩一区中文字幕| 亚洲一区二区三区四区不卡| 欧美日韩一级二级三级| 视频精品一区二区| 精品噜噜噜噜久久久久久久久试看| 日本欧美加勒比视频| 精品国产乱码久久久久久浪潮| 久久电影网电视剧免费观看| 久久一二三国产| 成人app网站| 亚洲免费av观看| 欧美日韩一区二区三区四区五区| 日韩精品一区第一页| 欧美精品一区视频| 99在线精品观看| 日本视频一区二区| 国产午夜亚洲精品不卡| 91丨porny丨国产| 午夜不卡在线视频| 久久综合久久99| 91麻豆蜜桃一区二区三区| 亚洲成av人在线观看| 久久精品亚洲精品国产欧美| 欧洲生活片亚洲生活在线观看| 视频在线观看91| 亚洲国产成人自拍| 色婷婷久久久久swag精品| 蜜桃91丨九色丨蝌蚪91桃色| 欧美激情资源网| 欧美欧美午夜aⅴ在线观看| 国产呦萝稀缺另类资源| 亚洲午夜成aⅴ人片| 国产亚洲va综合人人澡精品| 欧美色综合网站| 国产成人精品免费一区二区| 亚洲综合小说图片| 中文字幕欧美区| 日韩三级精品电影久久久| 成人黄色小视频| 乱中年女人伦av一区二区| 亚洲欧洲99久久| 欧美精品一区二区三| 欧美丝袜丝交足nylons| 99久久伊人精品| 寂寞少妇一区二区三区| 亚洲国产wwwccc36天堂| 欧美激情艳妇裸体舞| 欧美xxxxx裸体时装秀| 欧美日韩一区二区三区四区五区| 懂色av中文字幕一区二区三区| 亚洲大片免费看| 一区二区三区在线免费观看| 国产日韩高清在线| 精品国产91亚洲一区二区三区婷婷| 欧美在线观看视频在线| 91丨九色丨尤物| 国产成人精品亚洲午夜麻豆| 美女爽到高潮91| 日本不卡1234视频| 亚洲午夜久久久久中文字幕久| 中文字幕一区二区在线播放| 久久久久久麻豆| 久久麻豆一区二区| 欧美大片一区二区三区| 91精品国产综合久久香蕉的特点 | 国产精品卡一卡二| 久久精品一区八戒影视| 欧美精品一区二区三区视频 | 国产精品丝袜一区| 国产午夜精品美女毛片视频| 精品久久久久久久久久久久久久久久久| 欧美日韩一级黄| 欧美日本在线看| 欧美日韩在线电影| 欧美日韩亚洲国产综合| 欧美丰满一区二区免费视频| 欧美福利电影网| 日韩欧美二区三区| 久久久精品中文字幕麻豆发布| 久久久久久麻豆| 国产精品色婷婷| 亚洲欧洲制服丝袜| 一区二区三区欧美| 亚洲一级不卡视频| 日本不卡在线视频| 久久爱另类一区二区小说| 国产一区二区三区电影在线观看 | 在线日韩av片| 欧美精品v国产精品v日韩精品| 欧美日高清视频| 久久综合九色欧美综合狠狠| 国产午夜精品理论片a级大结局| 中文字幕免费观看一区| 亚洲黄色在线视频| 日本不卡一二三| 丁香一区二区三区| 色婷婷综合久久| 日韩欧美一区二区久久婷婷| 久久精品视频一区二区三区| 亚洲另类春色国产| 日韩精品福利网| 成人午夜av影视| 欧美日韩国产123区| 精品欧美久久久| 亚洲欧洲精品成人久久奇米网| 亚洲大尺度视频在线观看| 国产在线播放一区| 在线精品视频一区二区三四| 欧美电视剧在线观看完整版| 国产精品久久久久久久浪潮网站| 亚洲一区二区视频| 国产呦萝稀缺另类资源| 91黄视频在线| 国产人妖乱国产精品人妖| 亚洲一区在线视频| 国产精品白丝jk白祙喷水网站| 91福利在线免费观看| 久久精品一区二区三区四区| 亚洲成av人片一区二区梦乃| 国产精品一二三区在线| 精品视频在线免费| 久久奇米777| 日韩精品每日更新| 一本色道久久综合亚洲精品按摩 | 黄页网站大全一区二区| 色婷婷久久久综合中文字幕 | 亚洲美女免费视频| 国产综合色视频| 欧美日韩一区精品| 国产精品久久久99| 国产精品亚洲专一区二区三区| 欧美精品免费视频| 一二三四区精品视频| av高清不卡在线| 国产网站一区二区| 久久国产精品99久久人人澡| 欧美日韩精品欧美日韩精品一 | 婷婷丁香久久五月婷婷| 色综合天天综合网国产成人综合天| 欧美电视剧免费全集观看| 亚洲高清视频中文字幕| 国产成人午夜精品影院观看视频| 欧美日韩一区精品| 一区二区三区中文免费| 99精品视频在线观看免费| 欧美国产精品v| 成人一道本在线| 中文字幕国产精品一区二区| 国产成人综合视频| 亚洲精品一线二线三线| 久久国产夜色精品鲁鲁99| 日韩视频不卡中文| 国内精品伊人久久久久av影院| 精品国产免费一区二区三区香蕉| 蜜桃av一区二区三区电影| 日韩三级中文字幕| 国产一区二区三区四|