cpu-architecture | 易学教程

Why the RISC instruction sets usually do not contain register to register copy instruction?

阅读更多关于 Why the RISC instruction sets usually do not contain register to register copy instruction?

问题 I had this question on my exam and i am confused because as far as i know that move $t0, $a0 # COPY $A0 TO $T0 in MIPS instruction provides that and MIPS is a RISC processor. Am I missing something? 回答1: Move is a pseudoinstruction, and when assembled will really be a different instruction. For instance move $t0, $zero gets implemented as addu $t0, $zero, $zero 来源： https://stackoverflow.com/questions/53286335/why-the-risc-instruction-sets-usually-do-not-contain-register-to-register-copy-i

Are caches of different level operating in the same frequency domain?

阅读更多关于 Are caches of different level operating in the same frequency domain?

问题 Larger caches are usually with longer bitlines or wordlines and thus most likely higher access latency and cycle time. So, does L2 caches work in the same domain as L1 caches? How about L3 cache (slices) since they are now non-inclusive and shared among all the cores? And related questions are: Are all function units in a core in the same clock domain? Are the uncore part all in the same clock domain? Are cores in the multi-core system synchronous? I believe clock domain crossing would

How many instructions need to be killed on a miss-predict in a 6-stage scalar or superscalar MIPS?

阅读更多关于 How many instructions need to be killed on a miss-predict in a 6-stage scalar or superscalar MIPS?

问题 I am working on a pipeline with 6 stages: F D I X0 X1 W. I am asked how many instructions need to be killed when a branch miss-predict happens. I have come up with 4. I think this because the branch resolution happens in X1 and we will need to kill all the instructions that came after the branch. In the pipeline diagram, it looks like it would require killing 4 instructions that are in the process of flowing through the pipeline. Is that correct? I am also asked how many need to be killed if

Does MSVC 2017 support automatic CPU dispatch?

阅读更多关于 Does MSVC 2017 support automatic CPU dispatch?

问题 I read on a few sites that MSVC can actually emit say AVX instructions, when SSE2 architecture is used and detect the AVX support runtime. Is it true? I tested various loops that would definitely benefit from AVX/AVX2 support, but when run in debugger I couldn't really find any AVX instructions. When /arch:AVX is used, then it emits AVX instructions, but it of course crashes on CPUs that doesn't support it (tested), so no runtime detection either. I could use AVX intrinsics though and it

On x86-64, is the “movnti” or “movntdq” instruction atomic when system crash?

阅读更多关于 On x86-64, is the “movnti” or “movntdq” instruction atomic when system crash?

问题 When using persistent memory like Intel optane DCPMM, is it possible to see partial result after reboot if system crash(power outage) in execution of movnt instruction? For: 4 or 8 byte movnti which x86 guarantees atomic for other purposes? 16-byte SSE movntdq / movntps which aren't guaranteed atomic but which in practice probably are on CPUs supporting persistent memory. 32-byte AVX vmovntdq / vmovntps 64-byte AVX512 vmovntdq / vmovntps full-line stores bonus question: MOVDIR64B which has

On x86-64, is the “movnti” or “movntdq” instruction atomic when system crash?

阅读更多关于 On x86-64, is the “movnti” or “movntdq” instruction atomic when system crash?

Why doesn't Ice Lake have MOVDIRx like tremont? Do they already have better ones?

阅读更多关于 Why doesn't Ice Lake have MOVDIRx like tremont? Do they already have better ones?

问题 I notice that Intel Tremont has 64 bytes store instructions with MOVDIRI and MOVDIR64B. Those guarantees atomic write to memory, whereas don't guarantee the load atomicity. Moreover, the write is weakly ordered, immediately followed fencing may be needed. I find no MOVDIRx in IceLake. Why doesn't Ice Lake need such instructions like MOVDIRx ? (At the bottom of page 15) Intel® ArchitectureInstruction Set Extensions and Future FeaturesProgramming Reference https://software.intel.com/sites

What happens with nested branches and speculative execution?

阅读更多关于 What happens with nested branches and speculative execution?

问题 Alright, so I know that if a particular conditional branch has a condition that takes time to compute (memory access, for instance), the CPU assumes a condition result and speculatively executes along that path. However, what would happen if, along that path, yet another slow conditional branch pops up (assuming, of course, that the first condition hasn't been resolved yet and the CPU can't just commit the changes)? Does the CPU just speculate inside the speculation? What happens if the last

What happens with nested branches and speculative execution?

阅读更多关于 What happens with nested branches and speculative execution?

What happens with nested branches and speculative execution?

阅读更多关于 What happens with nested branches and speculative execution?