Editing Syscon Error Codes

Jump to navigation Jump to search
Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.

The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then publish the changes below to finish undoing the edit.

Latest revision Your text
Line 228: Line 228:
*Components Involved:
*Components Involved:
**[[Power Supply]]
**[[Power Supply]]
When a console loses AC power, it may generate error A0801004. During the shutdown sequence, voltage regulators are turned off sequentially in reverse order to the Power On Sequence (POS). This allows components enough time to enter a power off state. It helps to prevents data corruption and voltage spikes or discharges that can damage sensitive components. 1004 errors commonly occur in machines that can power on but have graphical artifacts or no video, also known as GLOD. In such cases, the user has no choice but to improperly power off. Forcing a shutdown using the power rocker at the back of the console (for Phat models), or by pulling the power cord (for slim and super slim models) will cause an unexpected loss of AC power and prevent the SYSCON from completing the shutdown sequence.  
When a console loses AC power, it may generate error A0801004. The 1004 error commonly occurs in machines that can power on but have graphical artifacts or no video, also known as GLOD. In such cases, the system should be powered off by using the power rocker at the back of the console (for Phat models), or by pulling the power cord (for slim and super slim models). This will cause a loss of AC power, which can lead to the error.




This error may be disregarded if it occurred due to abnormal circumstances, such as a power outage or accidental unplugging. Since it did not result from a hardware malfunction, it is not a significant concern. If a console displays artifacts or the GLOD, the main issue should be addressed first, typically involving a GPU problem requiring a reball or replacement. Afterward, if 1004 errors reoccur, the AC/DC line, PSU, and its connection to the DC-DC converters should be diagnosed.
This error may be disregarded if it occurred due to abnormal circumstances, such as a power outage or accidental unplugging. Since it did not result from a hardware malfunction, it is not a significant concern. If a console displays artifacts or the GLOD, the main issue should be addressed first, typically involving a GPU problem requiring a reball or replacement. Afterward, if 1004 errors reoccur, the AC/DC line, PSU, and its connection to the DC-DC converters should be diagnosed.
==== 1005 ====
On an NPX-001 Super Slim Motherboard I was experiencing an A0091005 error, started looking at the CELL power rail and started replacing components, went to replace the Aluminum Polymer Caps and after replacing all 5 on the Cell Side the error was gone.  Seems to be a super slim error only but more testing will be needed to be done to see if it occurs on other motherboards.  Another user reported being able to fix the same error code by replacing the same capacitors.  Sometimes this error will be paired with a 3003 error.  I also had a PQX-001 board with A0801005 followed by A0043005, found a short on the RSX on what I believe to be FBVDDQ for the 28nm, I was unable to fix this issue as after removing every component the line was still short.


==== 1103 (Thermal Alert SYSTEM) ====
==== 1103 (Thermal Alert SYSTEM) ====
Line 268: Line 264:
**[[Thermal#Temperature_Monitors|Temperature Monitors]]
**[[Thermal#Temperature_Monitors|Temperature Monitors]]


Some PS3 motherboards ([[TMU-520]], [[COK-001]], [[COK-002]]), have a temperature monitor located somewhere in the CELL power block. The other retail PS3 motherboard models doesn't measure the temperature of the CELL VR
Some PS3 motherboards ([[TMU-520]], [[COK-001]], [[COK-002]]), have a temperature monitor located somewhere in the CELL power block. The other retail PS3 motherboard models doesnt meassures the temperature of the CELL VR


All the PS3 temperature monitor chips have a internal thermal sensor integrated + 2 pins for an optional external sensor. The temperature monitors for CELL and RSX are configured to use the external sensor, but this one for CELL VR probably uses the internal
All the PS3 temperature monitor chips have a internal thermal sensor integrated + 2 pins for an optional external sensor. The temperature monitors for CELL and RSX are configured to use the external sensor, but this one for CELL VR probably uses the internal
Line 314: Line 310:


As the console cools the microscopic gaps in the solder can be physically reconnected by thermal warping. Warping is due to differences in the Coefficients of Thermal Expansion (CTE) between materials in the motherboard and processor. This expansion and contraction can reconnect the solder joints just enough to allow the console to boot. Or it may disconnect them.  
As the console cools the microscopic gaps in the solder can be physically reconnected by thermal warping. Warping is due to differences in the Coefficients of Thermal Expansion (CTE) between materials in the motherboard and processor. This expansion and contraction can reconnect the solder joints just enough to allow the console to boot. Or it may disconnect them.  
*If they reconnected, the console will boot until it experiences another 1601/1701 event.
*If they reconnected, the console will boot until it experiances another 1601/1701 event.
*It they do not reconnect, the console cannot complete BitTraining and will fail in POST with error A0403034. Often with an associated Data error, such as A0404401 (if the broken solder joint affected a Data line on one of the SPI lines). If there is no Data error, the broken joint only affected the voltage for the SPI line. Either RSX_VDDR or YC_RC_VDDIO.  
*It they do not reconnect, the console cannot complete BitTraining and will fail in POST with error A0403034. Often with an associated Data error, such as A0404401 (if the broken solder joint affected a Data line on one of the SPI lines). If there is no Data error, the broken joint only affected the voltage for the SPI line. Either RSX_VDDR or YC_RC_VDDIO.  


Line 330: Line 326:
*To calibrate the FlexIO interface (BitTraining).
*To calibrate the FlexIO interface (BitTraining).


If Attention occurs during the Power ON State (Step# 80) it indicates an error condition. Basically, something is flagged by the Processor as abnormal. It's forced to attempt to resolve the problem before it can continue with whatever it was trying to do. If the error condition cannot be resolved, the CPU sends the ATTENTION signal to the SYSCON. The SYSCON immediately shuts off the console, then reads the SPI Status Register to determine the cause. Then it records the A0801701 in it's errorlog along with the specific cause (if it determined one). Errors that can cause the Attention include:
If Attention occurs during the Power ON State (Step# 80) it indicates an error condition. Basically, something is flagged by the Processor as abnormal. It's forced to attempt to resolve the problem before it can continue with whatever it was trying to do. If the error condition cannot be resolved, the CPU sends the ATTENTION signal to the SYSCON. The SYSCON immediately shuts off the console, then reads the SPI Status Register to determin the cause. Then it records the A0801701 in it's errorlog along with the specific cause (if it determined one). Errors that can cause the Attention include:
*Unresolved Checkstop errors (14FF)
*Unresolved Checkstop errors (14FF)
*Livelock Detection (1601)
*Livelock Detection (1601)
Line 336: Line 332:
*BGA/Bump Defect that occurs while the Console was On (Step# 80). Subsequent attempts to power on the console would result in 3034/4xxx errors.
*BGA/Bump Defect that occurs while the Console was On (Step# 80). Subsequent attempts to power on the console would result in 3034/4xxx errors.


A user get this error code with a damaged hard drive. He was transferring some games via FTP, and his console turned off with YLOD. When he tried to turn on again, he get a GLOD. Problem was fixed just by changing the HDD.
A user get this error code with a damaged hard drive. He was transfering some games via FTP, and his console turned off with Ylod. When he tried to turn on again, he get a Glod. Problem was fixed just by changing the HDD.


1701 has been reported from using homebrew apps that caused a software conflict. Uninstalling the software can resolve the issue. It that's not possible because the system is locked up, it may be necessary to restore the operating system (OS).
1701 has been reported from using homebrew apps that caused a software conflict. Uninstalling the software can resolve the issue. It that's not possible because the system is locked up, it may be necessary to restore the operating system (OS).
Line 359: Line 355:
=== Fatal Errors ===
=== Fatal Errors ===
----
----
*This fatal error codes seems to be repeated up to 3 times for 3 special cases, as example, errors '''20'''03, '''21'''03, and '''22'''03 are related with southbridge, the only thing that changes in the error code is the second digit (located immediately after the category 2). If at some point we find what means that second digit we can join the wiki page sections together (with titles: "2001 & 2101", "2002 & 2102", "2003 & 2103", etc...)<br>
*This fatal error codes seems to be repeated up to 3 times for 3 special cases, as example, errors '''20'''03, '''21'''03, and '''22'''03 are related with southbridge, the only thing that changes in the error code is the second digit (located inmediatly after the category 2). If at some point we find what means that second digit we can join the wiki page sections together (with titles: "2001 & 2101", "2002 & 2102", "2003 & 2103", etc...)<br>
In other words, there are 3 groups: '''20xx''' (composed by 13 errors), '''21xx''' (composed by 13 errors), and '''22xx''' (composed by 1 error). See {{Talk}}
In other words, there are 3 groups: '''20xx''' (composed by 13 errors), '''21xx''' (composed by 13 errors), and '''22xx''' (composed by 1 error). See {{Talk}}


Line 418: Line 414:
**[[South Bridge]] [[Thermal#Temperature_Monitors|Temperature Monitor]] (IC3101 on [[COK-001]])
**[[South Bridge]] [[Thermal#Temperature_Monitors|Temperature Monitor]] (IC3101 on [[COK-001]])


==== 2040 ====
==== 2040 (Super Slim CELL BE) ====  
I found during sabotage testing on a KTE-001 Board that removing F6300 caused a A0012040 error, this fuse appears to be on the 12v line.


==== 2044 (Super Slim short circuit - BT/Wi-Fi and 5Volt) ====  
==== 2044 (Super Slim short circuit - BT/Wi-Fi and 5Volt) ====  
Line 428: Line 423:
==== 2102 (RSX) ====
==== 2102 (RSX) ====
[[RSX]] (IC2001)
[[RSX]] (IC2001)
I had detected a short in the CELL, after removing one of the NEC/TOKINs, the error had changed.
I had detected a short in the CELL, after removing one of the NEC/TOKINS, the error had changed.


After applying slight pressure to the CELL, I used some thermal pads to create the pressure. 5 small, 1 mm pads, and two larger 2mm pads, the same size and depth as the ones used on the southbridge chip) The console now boots and runs without any issues.
After applying slight pressure to the CELL, I used some thermal pads to create the pressure. 5 small, 1 mm pads, and two larger 2mm pads, the same size and depth as the ones used on the southbridge chip) The console now boots and runs without any issues.
Line 461: Line 456:
SW_1_B enables control Pin 5 on IC6013, which generates +2.5V_LREG_XCG_500_MEM. If that fails it generates A0092113.​
SW_1_B enables control Pin 5 on IC6013, which generates +2.5V_LREG_XCG_500_MEM. If that fails it generates A0092113.​


'''2114 (Unknown)'''​
==== 2120 (HDMI) ====
HDMI Error (IC2502)


Fails Generate A0092114 and A0092014​
A0202120/A0213013 error combinations are common. They appear to be related to VDDIO. IC6301 is involved in the formation of +1.7V_MISC, which amonge other things provides input power to the DC-DC converters that output +1.2V_YC_RC_VDDIO, +1.5V_YC_RC_VDDA, +1.2V_SB_VDDC and +1.2V_SB_VDDR. Lack of voltage to these DC/DC converters downstream of IC6301 suggests F6302 has blown. A number of people have fixed these 2120/3013 errors by finding shorts at or near C6320 and replacing Fuse F6302. But there are many other SMD nearby that might cause these fuses to blow. So you will need to track the source of the short and fix it, or the fuse will just blow again.


==== 2120 (HDMI I/O Error) ====
A bad thermistor (TH2501) has been reported to cause A0002120.
NOTE: Context matters with this error code! The step number and the number of codes per YLOD is different. Careful observation allows you to diagnose the most likely cause.


2120 means an issue with the high speed data buss connection between the DVE <--> RSX <--> HDMI transmitter has occurred. This I/O error DOES NOT mean the HDMI encoder (IC2502) is bad. It is context based. Associative, not diagnostic by itself. You must infer the diagnosis by using other, more diagnostic codes and observe console behavior to identify the cause.
A0802120 and A0902120 errors may be related to the actual HDMI transmitter (IC2502). Or they can be caused by BGA/Bump defects affecting VDDIO, on the RSX or CELL. BGA defects on the RSX VDDIO pads have been confirmed with a pressure test to have caused 2120 errors.


Count the number of 2120's your SYSCON records per YLOD event; look at the timestamp. 10x A0202120 + A0213013 error combinations appear to be related to VDDIO, the reference voltage powering the I/O buss. IC6301 is involved in the formation of +1.7V_MISC, which among other things provides input power to the DC-DC converters that output +1.2V_YC_RC_VDDIO, +1.5V_YC_RC_VDDA, +1.2V_SB_VDDC and +1.2V_SB_VDDR. Lack of voltage to these DC/DC converters downstream of IC6301 suggests F6302 has blown. A number of people have fixed these 2120/3013 error combos by finding shorts at or near C6320 and replacing Fuse F6302. But there are many other SMD nearby that might cause these fuses to blow. So you will need to track the source of the short and fix it, or the fuse will just blow again.
==== 2122 (DVE) ====
 
A bad thermistor (TH2501) has been reported to cause A0002120. It provides over current protection for the HDMI transmitter and output device in case there's a 5v short. This might happen if pins 17 (GND) and 18 (+5v) are damaged on your HDMI port or cable. Or if C2558 or C2570 short. See the service manual circuit diagrams as there are other SMDs that could malfunction and cause this error.
 
A0802120 and A0902120 errors can be caused by BGA or Bump defects that affect I/O, either the RSX or CELL. BGA defects on RSX VDDIO pads have been confirmed with a pressure test to have caused 2120 errors, but usually only one of them occurs per YLOD event. For example, one YLOD event may generate A0403034, A0404412 and an A0902120 error. This would indicate a bad GPU, not a bad HDMI transmitter. And since it occurred during the shutdown state (step number 90) this excludes issues that would have generated an error earlier in POST, like a fuse or a short in the Voltage regulation module (VRM).
 
The HDMI transmitter (IC2502) can also cause A0802120 and A0902120 errors. The IC itself or any of the SMDs between it and the RSX. You can tell a genuine HDMI transmitter issue apart because there multiple A0802120 errors occurring during the bootloader after the console has completed the power on self test (POST). This excludes a fuse and VRM issues, as indicated by the step number 80 (power on state). You will usually see a different number of 2120s at random. Like 4 or 6 of them. This is different than the 10x 2120/3013 error combo or 3034/4xxx/2120 combo described earlier.
 
====2122 (DVE)====
DVE Error (IC2406, CXM4024R MultiAV controller for analog out)
DVE Error (IC2406, CXM4024R MultiAV controller for analog out)


====2124 (AV) ====
==== 2124 (AV) ====
This error tends to cause a delayed Yellow Light Of Death (10s - 1min). Sometimes described as a Green Light Of Death (GLOD) or Red Light Of Death (RLOD).  
This error tends to cause a delayed Yellow Light Of Death (10s - 1min). Sometimes described as a Green Light Of Death (GLOD) or Red Light Of Death (RLOD).  


2124 and 2024 errors have been fixed by replacing both the AV and HDMI encoders. One user reported 2024/2124 errors resolved by replacing the HDMI encoder. Another removed the HDMI encoder and tested the console without it. That console primarily filled the errorlog with 2124 errors, but a few 2024's as well. So it is unclear if 2124 is specific to the HDMI Encoder or AV Encoder. It seems it could be either.
2124 and 2024 errors have been fixed by replacing both the AV and HDMI encoders. One user reported 2024/2124 errors resolved by replacing the HDMI encoder. Another removed the HDMI encoder and tested the console without it. That console primarily filled the errorlog with 2124 errors, but a few 2024's as well. So it is unclear if 2124 is specific to the HDMI Encoder or AV Encoder. It seems it could be either.


====2130 (Thermal Sensor, CELL)====
==== 2130 (Thermal Sensor, CELL) ====
*Components Involved:
*Components Involved:
**[[CELL BE|CELL]]
**[[CELL BE|CELL]]
** [[CELL BE|CELL]] [[Thermal#Temperature_Monitors|Temperature Monitor]] (IC1101 on [[COK-001]])
**[[CELL BE|CELL]] [[Thermal#Temperature_Monitors|Temperature Monitor]] (IC1101 on [[COK-001]])


====2131 (Thermal Sensor, RSX)====
==== 2131 (Thermal Sensor, RSX) ====
*Components Involved:
*Components Involved:
** [[RSX]]
**[[RSX]]
**[[RSX]] [[Thermal#Temperature_Monitors|Temperature Monitor]] (IC2101 on [[COK-001]])
**[[RSX]] [[Thermal#Temperature_Monitors|Temperature Monitor]] (IC2101 on [[COK-001]])


====2133 (Thermal Sensor, South Bridge)====
==== 2133 (Thermal Sensor, South Bridge) ====
*Components Involved:
*Components Involved:
** [[South Bridge]]
**[[South Bridge]]
**[[South Bridge]] [[Thermal#Temperature_Monitors|Temperature Monitor]] (IC3101 on [[COK-001]])
**[[South Bridge]] [[Thermal#Temperature_Monitors|Temperature Monitor]] (IC3101 on [[COK-001]])


==== 2203 ([[South Bridge]])====
==== 2203 ([[South Bridge]]) ====


From sabotage tests it was found that disabling +2.5V_SB_PLL_VDDC
From sabotage tests it was found that disabling +2.5V_SB_PLL_VDDC
produced four A0802203 errors.​ Also, disabling +1.2V_SB_VDDR produced A0302203 & A0403034.
produced four A0802203 errors.​ Also, disabling +1.2V_SB_VDDR produced A0302203 & A0403034.


====2310====
==== 2310 ====
----
----


===Fatal Boot Errors===
=== Fatal Boot Errors ===
----
----


====3000====
==== 3000 ====
Power Failure
Power Failure


====3001====
==== 3001 ====
12v Power Failure
12v Power Failure


Usually this caused by a bad Power Supply Unit (PSU).  
Usually this caused by a bad Power Supply Unit (PSU).  


Alternatively, a failure on the 12v_main line can cause it. Check fuses, capacitors, resistors, and ICs on the 12v line. Measure resistance of the large 2 prong 12v connector on the motherboard. It should read in the Kilo ohms range if there is sufficient separation. Otherwise you may have a short somewhere on the line.
Alternatively, a failure on the 12v_main line can cause it. Check fuses, capacitors, resistors, and IC's on the 12v line. Measure resistance of the large 2 prong 12v connector on the motherboard. It should read in the Kilo ohms range if there is sufficient separation. Otherwise you may have a short somewhere on the line.


====3002====
==== 3002 ====
Power Failure
Power Failure


====3003 ([[CELL BE|CELL]] Core Power Failure)====
==== 3003 ([[CELL BE|CELL]] Core Power Failure) ====


This error will occur in the case of a PWR failure on the main core voltage of the CPU (VDDC). For example, if the filtering capacitors (NEC/TOKINs) are severely damaged. There are other SMDs in that filter, so it could be related to them as well.
This error will occur in the case of a PWR failure on the main core voltage of the CPU (VDDC). For example, if the filtering capacitors (NEC/TOKIN's) are severely damaged. There are other SMD's in that filter, so it could be related to them as well.


A shorted Blu-Ray drive can cause this error as well. Be sure that your drive is going well before doing anything on your console.
A shorted Blu-Ray drive can cause this error as well. Be sure that your drive is going well before doing anything on your console.


====3004 ([[RSX]] Core Power Failure)====
==== 3004 ([[RSX]] Core Power Failure) ====


This error will occur in the case of a PWR failure on the main core voltage of the GPU (VDDC). For example, if the filtering capacitors (NEC/TOKINs) are severely damaged. There are other SMDs in that filter, so it could be related to them as well.
This error will occur in the case of a PWR failure on the main core voltage of the GPU (VDDC). For example, if the filtering capacitors (NEC/TOKIN's) are severely damaged. There are other SMD's in that filter, so it could be related to them as well.


====3005====
==== 3010 ====
 
Had A0043005 on a PQX-001, found that the RSX was shorted out and causing A0043005, I was unable to fix the error.
 
This error will occur in the case of fuse F7601 is burning in PQX-001.
 
====3010====
[[CELL BE|CELL]] Error
[[CELL BE|CELL]] Error


Observations:  
Observations:  
A user triggered this error by injecting 3.3V into PWRGD (power good) of IC6103 (NCP5318 CPU Buck Controller). It generated error 20 1001 and 20 3010.
A user triggered this error by injecting 3.3V into PWRGD (power good) of IC6103 (NCP5318 CPU Buck Controller). It generated error 20 1001 and 20 3010.
Another user (Razmann4k) got this error on their CECHL04 by attempting the eraser mod on an already delidded Cell and noticed a crack running down the middle of the Cell die. It caused 20 3010. A 20 3010 error was also observed on a CELL that was physically damaged during a delidding attempt by the console owner.
Another user (Razmann4k) got this error on their CECHL04 by attempting the eraser mod on an already delidded Cell and noticed a crack running down the middle of the Cell die. It caused 20 3010.




This problem may be related to the PLL signal generator circuit, open resistors, crystal oscillator or even the integrated itself (CDC735/CDC736/4227ANLG)
This problem may be related to the PLL signal generator circuit, open resistors, crystal oscillator or even the integrated itself (CDC735/CDC736/4227ANLG)


====3011====
==== 3011 ====
[[CELL BE|CELL]]
[[CELL BE|CELL]]


====3012 ====
==== 3012 ====
[[CELL BE|CELL]]
[[CELL BE|CELL]]


====3013====
==== 3013 ====
BE_SPI DI/DO ERROR
BE_SPI DI/DO ERROR


Line 570: Line 551:
Through sabotage testing is was found that disabling +1.2V_YC_RC_VDDIO caused A0213013​.
Through sabotage testing is was found that disabling +1.2V_YC_RC_VDDIO caused A0213013​.


Also through sabotage testing, it was found that when L6305 is removed it cuts off +1.8V_RSX_FBVDDQ (VRAM voltage). It caused a 10x A0202120 & 1x A0213013 error combo.
Also through sabotage testing, it was found that when L6305 is removed it cut's off +1.8V_RSX_FBVDDQ (VRAM voltage). It caused a 10x A0202120 & 1x A0213013 error combo.


====3020====
==== 3020 ====
[[CELL BE|CELL]]
[[CELL BE|CELL]]


====3030====
==== 3030 ====
[[CELL BE|CELL]]
[[CELL BE|CELL]]


====3031====
==== 3031 ====
[[CELL BE|CELL]]
[[CELL BE|CELL]]


====3032====
==== 3032 ====
[[CELL BE|CELL]] Error
[[CELL BE|CELL]] Error


It was discovered through sabotage testing that disabling +1.5V_YC_RC_VDDA caused error A0313032
It was discovered throgh sabotage testing that disabling +1.5V_YC_RC_VDDA caused error A0313032


====3033====
==== 3033 ====
[[CELL BE|CELL]]
[[CELL BE|CELL]]


====3034====
==== 3034 ====
[[CELL BE|CELL]] / [[RSX]] Communication Error
[[CELL BE|CELL]] / [[RSX]] Communication Error


This error occurs when Bit Training fails. Bit Training, also know as bit calibration, is a critical process during the power-on-reset (POR) sequence of the CELL BE processor. It fine-tunes the behavior of individual bits within the 8-bit-wide Rambus channels. This adjustment accounts for variations in circuitry, wiring, and loading delays. Bit training plays a pivotal role in optimizing signal quality by calibrating the signal driver current, driver impedance, and ensuring that the timing of each of the eight data bits aligns with clock edges, effectively centering the data "eye" allowing for more accurate and reliable data transmission.
This is the most common error seen in early Phat model PS3's with the hottest 90nm [[RSX]] and [[CELL BE|CELL]] processors. It is the hallmark of a BGA defect (such as a cracked solder ball). It is by no means limited to the early models, however. These errors have been seen in every model of PS3 with varying frequency. The most reliable consoles appear to be those with a CPU/GPU of smaller manufacturing process, such as the Super Slim (SS) models (42xx and later) which have a 45nm [[CELL BE|CELL]] and 28nm [[RSX]]. The least reliable are the PS2 Backwards Compatable A-E Models, which have 90nm [[RSX]]/[[CELL BE|CELL]].


This is the most common error seen in early Phat model PS3's with the 90nm [[RSX]]. It is the hallmark of solder fatigue (such as a cracked solder ball or bump defect) which affects the Flex IO interface that allows the CPU, GPU, and SB to communicate. It is by no means limited to the early models, however. These errors have been seen in every model of PS3 with varying frequency. However, it's most common in the earliest models, likely due to a manufacturing defect in the 90nm RSX material set. Namely a CTE mismatch between underill and bump material that leads to premature solder fatigue and GPU failure. Dubbed "BumpGate," this is a well known failure modality among GPUs manufactured from 2005-2008. Although it has not been proven unequivocally that the 90nm RSX is affected by Bumpgate, members of the community have shown the 90nm RSX has an increased failure rate, similar material set, and exhibits similar symptoms to known bumpgate affected chipsets - such as black screens (GLOD), graphical artifacts like lines, double images, color splotches and pixelation, and etc.  
The root cause is mechanical fatigue due to thermal cycling. The materials used to contruct the motherboard and processors have different properties. For example, the cooefficient of thermal expansion for FR4 Fiberglass used in the Motherboard and Processor Substrate is different than that of the copper BGA pads, which is different than that of the Lead-Free solder used to join them. This means they will expand and contract at different rates as the chip heats up and cools down, which applies shearing force to the BGA. Over many thermal cycle this deforms the solder balls and cause a defect (Such as a solder crack, torn trace, or the ball may pull away from the pad).


While Bumpgate is a plausible explanation, it's not the only one. The materials used to construct the motherboard and processors have different coefficient of thermal expansion (CTE). This means they will expand and contract at different rates as the chip heats up and cools down, which applies force to solder connections. Over many thermal cycle this deforms the solder and causes a defect. That may affect the Bumps, which attach the silicon die to the interposer (sometimes referred to as substrate) or the Ball-Grid Array (BGA) which connects the interposer to the Motherboard.  
3034 is triggered when the voltage or data lines connecting the CPU/GPU are broken. There is often a data error (4XXX) that also appears, but not always. The most common cause is a BGA defect on the [[RSX]], which usually requires a reball/reflow to repair. Something about the [[RSX]] construction or workload causes it to fail more frequently, but the CPU can fail too. However, it's not always a BGA defect. The bumps on either chip can fail, Flex IO traces (the data lines that connect the CPU/GPU) can be broken/scratched, or accumulated damage from wear and tear (electromigration) can also cause this error. The true percentage of consoles with BGA defects that can be fixed with a reball/reflow is unknown. However, there is evidence to suggest that the underfill used to reinforce the CPU/GPU die and [[RSX]] Ram bumps was not as effective when the PS3 was manufactured. This could explain many of the consoles who's reball fails prematurely afterwards.


3034 is triggered when Bit calibration, also known as BitTraining, cannot complete correctly. So it is not limited to a singular cause. BGA defects from thermal cycling, drop damage, pulling force from separating the heat sink from the processors while disassembling, or delidding can occur. The bumps on CPU, GPU, or SB can fail, Flex IO traces that connect them can be broken/scratched, or accumulated damage from wear and tear (electromigration) can also cause BitTraining to fail. Anything that can disrupt the impedance of the FlexIO can cause BitTraining to fail. A skilled technician will need to use deductive reasoning to diagnose the cause and choose the appropriate repair.
If a reflow/reball of both the CPU/GPU fails, then the chip is beyond repair and needs replaced. The [[RSX]] can be replaced with the same model without modification. It can be replaced with a different model using a modchip (or new syscon modification) that injects the correct [[RSX]] ID during boot. This has been nicknamed a "Frankenstein Mod." Since they are married to each other, the CPU can only be replaced if also replacing the chipset (NAND/NOR and SYSCON Chips). Since the CPU can't as easily be replaced, a dead CPU is usually considered unrepairable.


A qualitative test known as a "pressure test" may be used to help make a diagnosis. Applying slight pressure, within reason (not your body weight or clamping force which could cause a BGA defect), to the processor flexes the motherboard beneath the BGA and "may" temporarily reconnect a solder ball with it's pad. Like holding 2 wires together. This can cause flickering on screen, a console to power on when it couldn't before, etc. If the console or error responds differently when pressure is applied, this may be taken as  evidence of a BGA defect. It is not definitive, but tips the odds in favor of that diagnosis. A reball in that case may be sucessful. However, if it does not respond to pressure is not likely to be the BGA and another explanation, such as bumps are more likely. It should be noted that bumps can be affected by force as well, but because the underfill supports them, it generally requires more force to reconnect them using this method. This is what the "Bolt mod," commonly performed on the XBOX 360 did. That much force permanently deforms the motherboard and causes irreparable damage. DO NOT DO THIS! But it illustrates the point. You don't need much force to see if the BGA is affected and if it responds to light pressure, it's unlikely to be the bumps. Therefore, taken together with other clues, it can be helpful to a skilled technician gathering evidence for a diagnosis.
==== 3035 ====
 
In consoles with a 90nm [[RSX]] (CECH-Axx/Bxx/Cxx/Exx/Gxx/Hxx, and M03 models) the most likely cause of a 3034 is the GPU itself. It can be replaced with another 90nm RSX without modification. However, it can also be replaced with a more reliable 65nm or 40nm model, using a process nicknamed a "Frankenstein Mod." SONY service technicians performed this modification in some officially refurbished consoles. The PS3 community has developed a method as well. Since there is a question about the 90nm RSX's reliability and both a reball and Frankenstein mod require the 90nm to be desoldered, it is advisable to replace the 90nm GPU with a more reliable model instead of risking another 90nm GPU. Rework is hard on the motherboard and surrounding components, so choosing a repair with the fewest uncertainty's is wise.
 
In models without the 90nm RSX, 3034 is still possible, but far less likely to be caused by the GPU. CPU BGA defects are common in dropped consoles, those that have been delidded or have trace damage to the area around the processors. So troubleshooting is necessary to make a diagnosis.
 
====3035 ====
[[CELL BE|CELL]] and [[RSX]]
[[CELL BE|CELL]] and [[RSX]]


====3036====
==== 3036 ====
[[CELL BE|CELL]] and [[RSX]]
[[CELL BE|CELL]] and [[RSX]]


====3037====
==== 3037 ====
[[CELL BE|CELL]] and [[RSX]]
[[CELL BE|CELL]] and [[RSX]]


====3038====
==== 3038 ====
[[CELL BE|CELL]] and [[RSX]]
[[CELL BE|CELL]] and [[RSX]]


====3039====
==== 3039 ====
[[CELL BE|CELL]] and [[RSX]]
[[CELL BE|CELL]] and [[RSX]]


==== 3040====
==== 3040 ====
Flash
Flash


A0603040 is know to be caused by not soldering the flash (NAND/NOR) back on properly. It happens when the flash is not powered. Step #60 is when the StarShip 2 flash controller and NAND/NOR are initialized, kicking off the firmware sequence that loads the Operating System. Check their voltages and be sure the FW is not corrupt. If you have a backup, you could try replacing the Flash to see if a module failed.
A0603040 is know to be caused by not soldering the flash (NAND/NOR) back on properly. It happens when the flash is not powered. Step #60 is when the StarShip 2 flash controller and NAND/NOR are initialized, kicking off the firmware sequence that loads the Operating System. Check their voltages and be sure the FW is not corrupt. If you have a backup, you could try replacing the Flash to see if a module failed.


====3041====
==== 3041 ====
Flash (eMMC)
Flash (eMMC)


===Data Errors===
=== Data Errors ===
----
----
*This error codes seems to be repeated up to 5 times for 5 special cases, as example, errors 4'''0'''01, 4'''1'''01, 4'''2'''01, 4'''3'''01, 4'''4'''01 are related to CELL, the only thing that changes in the error code is the second digit (located immediatly after the category). If at some point we find what means that digit we can join the wiki page sections together (with titles: "4001, 4101, 4201, 4301, 4401", etc...)
*This error codes seems to be repeated up to 5 times for 5 special cases, as example, errors 4'''0'''01, 4'''1'''01, 4'''2'''01, 4'''3'''01, 4'''4'''01 are related to CELL, the only thing that changes in the error code is the second digit (located inmediatly after the category). If at some point we find what means that digit we can join the wiki page sections together (with titles: "4001, 4101, 4201, 4301, 4401", etc...)


====4001====
==== 4001 ====
[[CELL BE|CELL]]
[[CELL BE|CELL]]


====4002====
==== 4002 ====
[[RSX]]
[[RSX]]


====4003====
==== 4003 ====
Southbridge
Southbridge


====4011====
==== 4011 ====
[[CELL BE|CELL]]
[[CELL BE|CELL]]


====4101====
==== 4101 ====
[[CELL BE|CELL]]
[[CELL BE|CELL]]


====4102====
==== 4102 ====
[[RSX]]
[[RSX]]


====4103====
==== 4103 ====
Southbridge
Southbridge


====4111====
==== 4111 ====
[[CELL BE|CELL]]
[[CELL BE|CELL]]


====4201====
==== 4201 ====
[[CELL BE|CELL]]
[[CELL BE|CELL]]


====4202====
==== 4202 ====
[[RSX]]
[[RSX]]


====4203====
==== 4203 ====
Southbridge
Southbridge


====4211====
==== 4211 ====
[[CELL BE|CELL]]
[[CELL BE|CELL]]


====4212====
==== 4212 ====
[[RSX]]
[[RSX]]


====4221====
==== 4221 ====
[[CELL BE|CELL]]
[[CELL BE|CELL]]


====4222====
==== 4222 ====
[[RSX]]
[[RSX]]


====4231====
==== 4231 ====
[[CELL BE|CELL]]
[[CELL BE|CELL]]


====4261====
==== 4261 ====
[[CELL BE|CELL]]
[[CELL BE|CELL]]


====4301====
==== 4301 ====
[[CELL BE|CELL]]
[[CELL BE|CELL]]


====4302====
==== 4302 ====
[[RSX]]
[[RSX]]


====4303====
==== 4303 ====
Southbridge
Southbridge


====4311====
==== 4311 ====
[[CELL BE|CELL]]
[[CELL BE|CELL]]


====4312====
==== 4312 ====
[[RSX]]
[[RSX]]


====4321====
==== 4321 ====
[[CELL BE|CELL]]
[[CELL BE|CELL]]


====4322====
==== 4322 ====
[[RSX]]
[[RSX]]


====4332====
==== 4332 ====
[[RSX]]
[[RSX]]


====4341====
==== 4341 ====
[[CELL BE|CELL]]
[[CELL BE|CELL]]


====4401====
==== 4401 ====
[[CELL BE|CELL]] or [[RSX]]
 
====4402====
[[CELL BE|CELL]] or [[RSX]]
[[CELL BE|CELL]] or [[RSX]]


====4403====
==== 4402 ====
[[CELL BE|CELL]] or [[RSX]]
[[CELL BE|CELL]] or [[RSX]]


====4411====
==== 4403 ====
[[CELL BE|CELL]] or [[RSX]]
[[CELL BE|CELL]] or [[RSX]]


====4412====
==== 4411 ====
[[CELL BE|CELL]] or [[RSX]]
[[CELL BE|CELL]] or [[RSX]]


====4421====
==== 4412 ====
[[CELL BE|CELL]] or [[RSX]]
[[CELL BE|CELL]] or [[RSX]]


====4422====
==== 4421 ====
[[CELL BE|CELL]] or [[RSX]]
[[CELL BE|CELL]] or [[RSX]]


====4432====
==== 4422 ====
[[CELL BE|CELL]] or [[RSX]]
[[CELL BE|CELL]] or [[RSX]]


====4441====
==== 4432 ====
[[CELL BE|CELL]] or [[RSX]]
[[CELL BE|CELL]] or [[RSX]]


====5FFF====
==== 4441 ====
[[CELL BE|CELL]] or [[RSX]]
[[CELL BE|CELL]] or [[RSX]]


{{Hardware Modification}}<noinclude>
{{Hardware Modification}}<noinclude>[[Category:Main]]</noinclude>
[[Category:Main]]
</noinclude>
Please note that all contributions to PS3 Developer wiki are considered to be released under the GNU Free Documentation License 1.2 (see PS3 Developer wiki:Copyrights for details). If you do not want your writing to be edited mercilessly and redistributed at will, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource. Do not submit copyrighted work without permission!

To protect the wiki against automated edit spam, we kindly ask you to solve the following hCaptcha:

Cancel Editing help (opens in new window)