The editor shares chip design, verification, DFT, and backend full process knowledge as well as a wealth of technical documents in the knowledge community. Everyone is welcome to join the discussion and learning for mutual progress!
Tonight’s student Q&A on MIPI RTL design is as follows, time is limited, and everyone is welcome to add more:
1. In the image below, why should we replace the one in the red box above with the one below?
Answer: To perform DFT testing, DFT must ensure that the clock is controllable.
2.This cross-shot stitching phenomenon generally occurs under what conditions in the circuit? Can you provide an example?
Answer: To find b8 within a byte, but b8 may be stitched across two shots, so there needs to be 8 cases to consider. For example, the lower bit first sends 0xb8, which is binary 0001_1101, which may be split into two bytes.
3.In the code, does (div1 & ~div0) refer to the red box in the image below, position 1 or 2? (Is it position 2? Why use this logic instead of directly using the falling edge of div1 as a condition?)
Answer: The red circle is 2, which is essentially a quarter frequency, collecting 1 byte of data.
4.In the code, is it transmitted across clock thresholds? (Or, should they be considered synchronous clocks since they are multiples of each other and do not require synchronization processing?)
Answer: Good question, byte_clk is the clock derived from csi_dck. The essence is that the phases are the same, just the frequencies are different, so no synchronization processing is needed.
5.It seems that the state machine designed for the lane_aligner module does not adopt the commonly used three-stage or two-stage state machine implementation? What type does this method belong to (two-stage timing without combinational logic)? (Are there any considerations when designing with state machines?)
Answer: It is still advisable to use a three-stage state machine. In actual projects, this writing method is still quite common, saving lines of code, effectively merging timing and combinational logic. It depends on personal habits.
6.Is it necessary to pay attention to reset and clock design here? (When does the clock cycle start, where does one cycle start and where does it end? Are there any requirements when designing this?)
Answer: It depends on the data format of the sender. The sender and receiver must match. For example, in this project, the dck clock’s falling edge collects the first bit of data, and the dck’s rising edge collects the second bit of data, which must be consistent with the sender.
////////////////////////////////////////Protocol//////////////////////////////////
7.What is the HS_ZERO field? How does the first image correspond to the second image, and which part corresponds? (SOT: HS_ZERO+SYNC BYTE? (Fixed two bytes?) EOT: TRAIL? (Fixed one byte?))
Answer: Good question, HS_ZERO is 0, which needs to be continuously sent for a period of time (as stipulated by the protocol), followed by SYNC BYTE, that is, SOT: HS_ZERO+SYNC BYTE, TRAIL is sending the last 1 bit of valid data inverted for a period of time to allow DPHY to enter LP state.
8.What is the complete structure of a long frame? In the first image, how is the frame size arranged after the frame tail, and what positions do they occupy in the third image? Why are they missing? (What fields are used in the complete frame design of this project?)
Answer: The frame size introduces the length of the frame, which is determined by the two words wc, and wc stands for word count.
9.When a packet is divided into two lanes for transmission, does each channel need to have a SoT and EoT? (Are SoT and EoT each one byte?) When receiving, are SoT and EoT not received (if received, wouldn’t the two channels receive two SoTs and two EoTs)? Is it only responsible for detecting SoT and EoT? (Is the receiving part only DataN?)
Answer: SoT and EoT are not simple bytes, but can be understood as a series of sequences, for example, SoT: HS_ZERO+SYNC BYTE, HS_ZERO needs to be sent for a period of time to allow DPHY to enter HS state. TRAIL is sending the last 1 bit of valid data inverted for a period of time to allow DPHY to enter LP state.
10.Is in0_00 the start SoT flag of the frame? Is in0_b8 the frame header flag? (What is the entire data protocol? Does it start receiving bytes when it detects the frame header b8? Is there anything to detect before the frame header? What about after?)
Answer: HS_ZERO needs to be sent for a period of time to allow DPHY to enter HS state, which means it needs to receive 00 for a short period of time, and then when b8 is detected, it indicates that the frame header has been detected.
11.What are the requirements for the arrangement of the data inside? (Why not take the last shot of data to start (in0_dly3)?)
Answer: Here is the ECC checksum data, which needs to consider the situation where two lanes are misaligned by one shot and complete the stitching.
12.Why is only one shot needed? (What is deskew? Only after detecting the synchronization header b8 does it “start” merging, and the previous bytes are discarded?)
Answer: The PHY protocol strictly limits the time of skew, and the two lanes may have different times. The time detected for b8 may also be different, requiring deskew to complete data stitching.
Chip Full Process Training
Chip Trainingis about to increase in price, please contact the editor via WeChat: 13541390811
-
Jingxin SoC front, middle, and backend low-power full-process practical training(Prices are less than 1/6 of the industry standard)
-
Jingxin SoC front-end video + documentation + practical + one-on-one lifetime guidance practical training
-
Jingxin SoC mid-end video + documentation + practical + one-on-one lifetime guidancepractical training
-
Jingxin SoC back video + documentation + practical + one-on-one lifetime guidancepractical training
-
12nm 2.5GHz A72 low-power DVFS practical training(Prices are less than 1/3 of the industry standard)
-
12nm 2.5GHz A72 DFT practical training(Prices are less than 1/3 of the industry standard)
-
RISC-V MCU 40nm full-chip PR practical training
-
DDR4/3 project practical training
-
MIPI CSI/DSI + ISP image processing + H.265 + USB + MAC practical training
-
Advanced Codec H.265/H.264 practical training
Introduction to chip Design Service design services:
-
Provides design, verification, DFT design services for SoC, MCU, ISP, CIS, etc.
-
Provides IP design for DDR/PCIE/MIPI/CAN/USB/ETH/QSPI/UART/I2C, etc.
-
Provides backend design for 7nm, 12nm, 28nm, 40nm, 55nm, 65nm, 90nm, etc.
-
Provides customized chip design services and training for universities and enterprises
Additionally, the editor shares chip design, verification, DFT, and backend full process knowledge as well as a wealth of technical documents in the knowledge community. Everyone is welcome to join the discussion and learning for mutual progress!
【Let every student be able to design a SoC/MCU chip】
【The only one on the internet】Jingxin SoC is a low-power ISP image processing SoC used for【chip full process design training】, using a low-power RISC-V processor, built-in ITCM SRAM, DTCM SRAM, integrated with IPs including MIPI, ISP, USB, QSPI, UART, I2C, GPIO, Ethernet MAC controller, and designed using SMIC40 process flow.
-
High-speed interface Verilog design implementation -
From image algorithm to RTL design implementation -
MIPI, ISP Verilog implementation and simulation -
Lint, CDC checks and UVM verification -
Post-simulation
Just the front-end course content is equivalent to 5-6 courses from other training institutions.
-
DFT design (chip-level) -
Synthesis logic synthesis (chip-level) -
Low-power UPF design, CLP technology -
Formal verification and other technologies
(3) In the SoC back-end course, you will learnlow-power design
-
Layout and routing (low-power FF flow)
-
StarRC/QRC -
STA/Tempus -
Power analysis -
DRC/LVS design
Just the back-end course content is equivalent to 3-4 courses from other training institutions.
Join the knowledge community and become part of the chip design knowledge repository!
The Knowledge Community of the SoC Training Camp
Today, we’ll introduce here, and the SoC courses provide servers for everyone to practice! Take you through the full process from algorithms, front-end, DFT to back-end SoC project design.
Please contact the account owner to sign up! Contact WeChat: 135-4139-0811
The data path of the Jingxin SoC training camp for image processing:
The CRG design of the Jingxin SoC:
One-click completion of full process for C code compilation, simulation, synthesis, DFT insertion, formal verification, layout and routing, parasitic parameter extraction, STA analysis, DRC/LVS, post-simulation, formal verification, power analysis, etc. The upgraded chip design project V2.0 flow is as follows:
SoC One-click Execution Flow
MIPI DPHY+CSI2 Decoding
Classic design in digital circuits: Implementation of multiple communication data Lane Merging design
Classic design in digital circuits: Implementation of multiple communication data Lane Distribution
UPF Low Power Design
Full chip UPF low power design (including DFT design)
The Jingxin SoC training project, before low power design, the power consumption was 27.9mW.
After low power design, the power consumption is 0.285mW, a reduction of 98.9%!
Check the voltage drop:
Low power check:
Chip layout design V1.0
Chip layout design V2.0
The DRC/LVS for low power design is highly valuable and challenging in the chip top layer! Unique experience sharing in the industry.
ISP Image Processing
-
dpc – Bad pixel correction
-
blc – Black level correction
-
bnr – Bayer noise reduction
-
dgain – Digital gain
-
-
demosaic – Demosaicing
-
wb – White balance gain
-
ccm – Color correction matrix
-
csc – Color space conversion (RGB2YUV conversion formula based on integer optimization)
-
gamma – Gamma correction (brightness based on table lookup gamma correction)
-
ee – Edge enhancement
-
stat_ae – Automatic exposure statistics
-
stat_awb – Automatic white balance statistics
CNN Image Recognition
Support handwritten digit AI recognition:
Simulation results: Simulated recognition of the digits 7, 2, 1, 0, 4, 1, 4, 9
The Jingxin Micro SoC Team’s Business
The Jingxin Micro SoC team’s business introduction:
-
Provides the most comprehensive practical chip project training on the internet
-
Provides chip design services for SoC, MCU, ISP, CIS, etc.
-
Provides chip design, verification, DFT design services
-
Provides backend design services, processes include 7nm, 12nm, 28nm, 40nm, 55nm, 65nm, 90nm, 110nm, 180nm, and provides tape-out channels
-
Provides customized chip design training and employment recommendations and guidance for universities and enterprises
Rich tutorial documents
In the 7-day sprint PR training camp, a student asked how to add PAD to IO? Please think about how to achieve the best between Jingxin SoC’s IO and PAD?
Join the knowledge community and become part of the chip design knowledge repository!
The Knowledge Community of the SoC Training Camp
SoC training camp VIP students asked why low-power cells were not inserted into the netlist?
Although asking questions is encouraged, when we encounter problems, we should first think proactively and solve problems actively. If we really can’t solve it, we can seek help. This way, we grow faster.
First, the editor received this question and opened the log. Checking the log is a virtue in IC design! I found that after the EDA tool ingested the UPF file, it reported the following warning, using many ff libraries.
The above error indicates that the UPF specifies a voltage of 0.99V, while the CELL is at 1.20V. The voltage mismatch leads to the logic synthesis resulting in the netlist not inserting MV CELL. Note that during synthesis, we used the ss library, so how could it be the ff library? Opening the constraint script revealed the following bug:
So, change the constraint to replace ff with ss. Run it again, and the result will come out:
In the low-power design section, I will leave a small bug inside the design, which is very simple. I will not disclose the answer. Everyone must think more and actively run this, so that you can learn deeply.
Training students please note that the Jingxin SoC uses the always-on power domain voltage domain’s pwrdown_mux signal as the switch control signal for the power switch cell. However, there is a small issue in the design that is not really a bug, but for the UPF low-power flow, it is undoubtedly a bug!
Training students please log into the server to track the pwrdown signal, paying attention to its load situation. The RTL code is as follows:
Combining the PR output netlist, we perform a CLP low-power check. The script can be found on the server. It can be seen that the CLP reports an error saying that the switch control signal of the power switch cannot be found.
One cannot help but say how important CLP checks are for low power and how important they are for chip design! The value of Jingxin SoC lies in connecting all these small knowledge points of chip design full process together! What are you waiting for? Hurry up and sign up!
In the mid-process of chip design, Jingxin SoC will insert isolation and other cells into the UPF constraints, but cannot insert power switch cells, therefore, the control signal pwrdown_mux for the power switch cell will be optimized away (removed) without load. Therefore, it is necessary to set the above MUX device as dont_touch or make pwrdown_mux a module port and prohibit auto_ungroup (and set no_boundary_optimization), so that this signal can be retained for the backend to implement power switch control. Students please complete the code modification based on the full set flow environment to complete the following tasks:
-
Lint check, front-end simulation,
-
Complete mid-end and back-end flow,
-
Complete CLP check, complete post-simulation
In the 7-day sprint PR training camp, a student asked why the same floorplan had some students running quickly while others encountered a lot of DRC issues (the EDA tool kept iterating), causing the tool to never finish running. What exactly is the problem?
First, the editor found that the student defined the stripe TM2 as horizontal, while those familiar with the Jingxin process know that TM2’s preference direction is VERTICAL.
Checking the Jingxin lef library file can also confirm:
What impact does using the wrong direction have? Everyone should practice on Jingxin SoC’s backend flow and experience it firsthand.
In the 7-day sprint PR training camp, a student asked why PR took a full day and night, 24 hours to complete routing and still had many DRC errors? The editor has tried to minimize the design scale to speed up PR design, and in fact, routing can be completed in 2 hours. Why is it so slow? The reason is the routing of low-power units. Specific reasons and solutions are welcome to join the Jingxin training camp for discussion.
The errors are mainly concentrated in M4, please think about how to solve them.
In the 7-day sprint PR training camp, a student asked why the second PG pin (VDDG) of the power switch cell was connected from M1 instead of M2. Please think about what the problem is? How to solve it?
In the 7-day sprint PR training camp, a student asked how to handle the LVS of corner pads that Jingxin SoC training camp students encountered?
After completing the front-end design simulation and DFT of Jingxin SoC training, we arrive at the backend flow. This tutorial teaches you how to complete the digital backend flow in one click.
The generated script command is as follows:
tclsh ./SCRIPTS/gen_flow.tcl -m flat all
Before generating the flow script, it is necessary to configure parameters such as setup.tcl. For more details, please refer to the unique full-stack chip engineer training provided by Jingxin SoC for front-end engineering, DFT engineering, and backend engineering, guiding you through the full process from algorithms, front-end, DFT to backend SoC project design.
The students in the Jingxin SoC training camp asked why innovus reported an error when reading the prepared floorplan def file? First, look at the log:
Reading floorplan file – ./data_in/DIGITAL_TOP.def (mem = 1595.0M).
#% Begin Load floorplan data … (date=10/23 22:38:01, mem=1579.3M)
**ERROR: (IMPFP-710): File version unknown is too old.
In the past during the EDI period, we could load the floorplan by defining fp_file:
set vars(fp_file) “./data_in/DIGITAL_TOP.def”
However, now innovus has upgraded and abandoned the loading method of fp_file. Of course, we can also use the old version of EDI9.1 and earlier to add fp_file and then save it as a new version, but this method is obviously unnecessary. As the log suggests, checking the log is a great engineering habit.
Input floorplan file is too old and is not supported in EDI 10.1 and newer.
You can use EDI 9.1 and before to read it in, then save again to create new version.
The editor’s intuition tells me to first check which def version the student saved?
The student’s saving method is as follows:
So how to solve it? Everyone please join the Jingxin training camp for practice.
Jingxin SoC uses many asynchronous FIFOs. Students interested in asynchronous RTL implementation can extract the asynchronous FIFO to see the layout connections:
Check the area of all asynchronous FIFO cells;
dbget [dbget top.insts.pstatus unplaced -p].area
Check the names of all asynchronous FIFO cells:
dbget [dbget top.insts.pstatus unplaced -p].name
Then how to extract the asynchronous paths to observe the layout routing? How to report timing? For more content, please refer to the knowledge community and SoC training camp.