Amino acid dipepetide frequency for Aspergillus pseudonomiae

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.128AlaAla: 8.128 ± 0.054
1.159AlaCys: 1.159 ± 0.013
4.034AlaAsp: 4.034 ± 0.028
4.777AlaGlu: 4.777 ± 0.035
3.209AlaPhe: 3.209 ± 0.022
5.585AlaGly: 5.585 ± 0.04
1.78AlaHis: 1.78 ± 0.019
4.403AlaIle: 4.403 ± 0.032
3.675AlaLys: 3.675 ± 0.03
7.804AlaLeu: 7.804 ± 0.042
1.968AlaMet: 1.968 ± 0.018
2.845AlaAsn: 2.845 ± 0.021
4.349AlaPro: 4.349 ± 0.033
3.257AlaGln: 3.257 ± 0.027
4.675AlaArg: 4.675 ± 0.028
6.895AlaSer: 6.895 ± 0.038
5.068AlaThr: 5.068 ± 0.029
5.404AlaVal: 5.404 ± 0.03
1.179AlaTrp: 1.179 ± 0.015
2.282AlaTyr: 2.282 ± 0.023
0.0AlaXaa: 0.0 ± 0.0
Cys
1.031CysAla: 1.031 ± 0.014
0.289CysCys: 0.289 ± 0.007
0.744CysAsp: 0.744 ± 0.01
0.651CysGlu: 0.651 ± 0.011
0.598CysPhe: 0.598 ± 0.009
0.984CysGly: 0.984 ± 0.014
0.388CysHis: 0.388 ± 0.008
0.815CysIle: 0.815 ± 0.013
0.518CysLys: 0.518 ± 0.009
1.473CysLeu: 1.473 ± 0.018
0.314CysMet: 0.314 ± 0.007
0.484CysAsn: 0.484 ± 0.009
0.722CysPro: 0.722 ± 0.015
0.519CysGln: 0.519 ± 0.009
0.855CysArg: 0.855 ± 0.013
1.065CysSer: 1.065 ± 0.014
0.777CysThr: 0.777 ± 0.012
0.884CysVal: 0.884 ± 0.014
0.225CysTrp: 0.225 ± 0.005
0.41CysTyr: 0.41 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
4.388AspAla: 4.388 ± 0.03
0.671AspCys: 0.671 ± 0.011
3.723AspAsp: 3.723 ± 0.033
4.061AspGlu: 4.061 ± 0.032
2.146AspPhe: 2.146 ± 0.019
3.888AspGly: 3.888 ± 0.026
1.291AspHis: 1.291 ± 0.014
3.218AspIle: 3.218 ± 0.023
2.235AspLys: 2.235 ± 0.019
5.13AspLeu: 5.13 ± 0.029
1.26AspMet: 1.26 ± 0.016
1.908AspAsn: 1.908 ± 0.018
3.333AspPro: 3.333 ± 0.025
1.947AspGln: 1.947 ± 0.018
3.069AspArg: 3.069 ± 0.025
4.105AspSer: 4.105 ± 0.031
3.044AspThr: 3.044 ± 0.019
3.635AspVal: 3.635 ± 0.026
0.901AspTrp: 0.901 ± 0.014
1.679AspTyr: 1.679 ± 0.017
0.0AspXaa: 0.0 ± 0.0
Glu
5.007GluAla: 5.007 ± 0.035
0.694GluCys: 0.694 ± 0.012
3.857GluAsp: 3.857 ± 0.029
5.042GluGlu: 5.042 ± 0.045
1.964GluPhe: 1.964 ± 0.018
3.607GluGly: 3.607 ± 0.023
1.407GluHis: 1.407 ± 0.014
3.083GluIle: 3.083 ± 0.025
3.507GluLys: 3.507 ± 0.025
5.154GluLeu: 5.154 ± 0.032
1.404GluMet: 1.404 ± 0.015
2.327GluAsn: 2.327 ± 0.02
2.72GluPro: 2.72 ± 0.029
2.488GluGln: 2.488 ± 0.022
3.804GluArg: 3.804 ± 0.032
4.304GluSer: 4.304 ± 0.03
3.477GluThr: 3.477 ± 0.025
3.492GluVal: 3.492 ± 0.024
0.891GluTrp: 0.891 ± 0.013
1.761GluTyr: 1.761 ± 0.017
0.0GluXaa: 0.0 ± 0.0
Phe
3.05PheAla: 3.05 ± 0.025
0.629PheCys: 0.629 ± 0.01
2.273PheAsp: 2.273 ± 0.022
2.082PheGlu: 2.082 ± 0.018
1.767PhePhe: 1.767 ± 0.021
2.86PheGly: 2.86 ± 0.029
1.016PheHis: 1.016 ± 0.013
1.98PheIle: 1.98 ± 0.021
1.445PheLys: 1.445 ± 0.015
3.837PheLeu: 3.837 ± 0.033
0.824PheMet: 0.824 ± 0.012
1.473PheAsn: 1.473 ± 0.017
2.078PhePro: 2.078 ± 0.019
1.478PheGln: 1.478 ± 0.016
2.051PheArg: 2.051 ± 0.019
3.101PheSer: 3.101 ± 0.021
2.253PheThr: 2.253 ± 0.022
2.472PheVal: 2.472 ± 0.021
0.697PheTrp: 0.697 ± 0.013
1.235PheTyr: 1.235 ± 0.015
0.0PheXaa: 0.0 ± 0.0
Gly
5.125GlyAla: 5.125 ± 0.038
0.986GlyCys: 0.986 ± 0.014
3.542GlyAsp: 3.542 ± 0.026
3.504GlyGlu: 3.504 ± 0.025
2.898GlyPhe: 2.898 ± 0.025
5.266GlyGly: 5.266 ± 0.047
1.691GlyHis: 1.691 ± 0.02
3.742GlyIle: 3.742 ± 0.024
3.255GlyLys: 3.255 ± 0.023
6.346GlyLeu: 6.346 ± 0.038
1.55GlyMet: 1.55 ± 0.017
2.517GlyAsn: 2.517 ± 0.022
3.312GlyPro: 3.312 ± 0.027
2.597GlyGln: 2.597 ± 0.022
3.953GlyArg: 3.953 ± 0.029
5.64GlySer: 5.64 ± 0.037
3.987GlyThr: 3.987 ± 0.028
4.506GlyVal: 4.506 ± 0.03
1.218GlyTrp: 1.218 ± 0.017
2.254GlyTyr: 2.254 ± 0.023
0.0GlyXaa: 0.0 ± 0.0
His
1.836HisAla: 1.836 ± 0.018
0.393HisCys: 0.393 ± 0.008
1.352HisAsp: 1.352 ± 0.013
1.335HisGlu: 1.335 ± 0.015
0.962HisPhe: 0.962 ± 0.014
1.776HisGly: 1.776 ± 0.016
0.861HisHis: 0.861 ± 0.014
1.351HisIle: 1.351 ± 0.016
0.882HisLys: 0.882 ± 0.012
2.353HisLeu: 2.353 ± 0.02
0.522HisMet: 0.522 ± 0.009
0.881HisAsn: 0.881 ± 0.012
1.711HisPro: 1.711 ± 0.017
0.98HisGln: 0.98 ± 0.012
1.617HisArg: 1.617 ± 0.016
1.942HisSer: 1.942 ± 0.018
1.34HisThr: 1.34 ± 0.014
1.468HisVal: 1.468 ± 0.017
0.383HisTrp: 0.383 ± 0.009
0.75HisTyr: 0.75 ± 0.011
0.0HisXaa: 0.0 ± 0.0
Ile
4.317IleAla: 4.317 ± 0.03
0.862IleCys: 0.862 ± 0.013
2.887IleAsp: 2.887 ± 0.021
2.845IleGlu: 2.845 ± 0.027
2.163IlePhe: 2.163 ± 0.024
3.407IleGly: 3.407 ± 0.032
1.319IleHis: 1.319 ± 0.015
2.704IleIle: 2.704 ± 0.023
2.053IleLys: 2.053 ± 0.019
4.979IleLeu: 4.979 ± 0.031
1.073IleMet: 1.073 ± 0.013
1.855IleAsn: 1.855 ± 0.016
3.276IlePro: 3.276 ± 0.026
2.062IleGln: 2.062 ± 0.02
2.875IleArg: 2.875 ± 0.024
4.102IleSer: 4.102 ± 0.025
2.921IleThr: 2.921 ± 0.025
3.391IleVal: 3.391 ± 0.024
0.781IleTrp: 0.781 ± 0.013
1.588IleTyr: 1.588 ± 0.018
0.0IleXaa: 0.0 ± 0.0
Lys
3.928LysAla: 3.928 ± 0.027
0.555LysCys: 0.555 ± 0.011
2.678LysAsp: 2.678 ± 0.021
3.206LysGlu: 3.206 ± 0.031
1.389LysPhe: 1.389 ± 0.017
2.868LysGly: 2.868 ± 0.024
1.083LysHis: 1.083 ± 0.015
2.163LysIle: 2.163 ± 0.021
2.842LysLys: 2.842 ± 0.033
3.925LysLeu: 3.925 ± 0.029
0.909LysMet: 0.909 ± 0.011
1.658LysAsn: 1.658 ± 0.017
2.54LysPro: 2.54 ± 0.024
1.799LysGln: 1.799 ± 0.017
3.163LysArg: 3.163 ± 0.025
3.282LysSer: 3.282 ± 0.029
2.6LysThr: 2.6 ± 0.022
2.747LysVal: 2.747 ± 0.022
0.659LysTrp: 0.659 ± 0.01
1.395LysTyr: 1.395 ± 0.015
0.0LysXaa: 0.0 ± 0.0
Leu
7.828LeuAla: 7.828 ± 0.043
1.37LeuCys: 1.37 ± 0.014
5.225LeuAsp: 5.225 ± 0.032
5.529LeuGlu: 5.529 ± 0.035
3.631LeuPhe: 3.631 ± 0.03
6.191LeuGly: 6.191 ± 0.037
2.401LeuHis: 2.401 ± 0.02
4.241LeuIle: 4.241 ± 0.03
4.054LeuLys: 4.054 ± 0.03
8.973LeuLeu: 8.973 ± 0.059
1.87LeuMet: 1.87 ± 0.019
3.301LeuAsn: 3.301 ± 0.024
5.594LeuPro: 5.594 ± 0.034
4.002LeuGln: 4.002 ± 0.028
5.876LeuArg: 5.876 ± 0.037
7.798LeuSer: 7.798 ± 0.038
4.982LeuThr: 4.982 ± 0.031
5.75LeuVal: 5.75 ± 0.035
1.287LeuTrp: 1.287 ± 0.016
2.648LeuTyr: 2.648 ± 0.026
0.0LeuXaa: 0.0 ± 0.0
Met
2.165MetAla: 2.165 ± 0.018
0.275MetCys: 0.275 ± 0.006
1.234MetAsp: 1.234 ± 0.013
1.304MetGlu: 1.304 ± 0.014
0.758MetPhe: 0.758 ± 0.012
1.484MetGly: 1.484 ± 0.017
0.492MetHis: 0.492 ± 0.009
1.071MetIle: 1.071 ± 0.012
0.984MetLys: 0.984 ± 0.012
1.919MetLeu: 1.919 ± 0.02
0.567MetMet: 0.567 ± 0.009
0.819MetAsn: 0.819 ± 0.01
1.188MetPro: 1.188 ± 0.015
0.871MetGln: 0.871 ± 0.012
1.259MetArg: 1.259 ± 0.014
1.858MetSer: 1.858 ± 0.018
1.312MetThr: 1.312 ± 0.015
1.414MetVal: 1.414 ± 0.015
0.272MetTrp: 0.272 ± 0.006
0.566MetTyr: 0.566 ± 0.01
0.0MetXaa: 0.0 ± 0.0
Asn
3.081AsnAla: 3.081 ± 0.024
0.501AsnCys: 0.501 ± 0.008
1.945AsnAsp: 1.945 ± 0.019
2.023AsnGlu: 2.023 ± 0.019
1.36AsnPhe: 1.36 ± 0.017
2.887AsnGly: 2.887 ± 0.023
0.883AsnHis: 0.883 ± 0.01
2.155AsnIle: 2.155 ± 0.02
1.48AsnLys: 1.48 ± 0.017
3.291AsnLeu: 3.291 ± 0.026
0.866AsnMet: 0.866 ± 0.013
1.452AsnAsn: 1.452 ± 0.018
2.481AsnPro: 2.481 ± 0.023
1.337AsnGln: 1.337 ± 0.015
1.992AsnArg: 1.992 ± 0.018
2.71AsnSer: 2.71 ± 0.024
2.235AsnThr: 2.235 ± 0.019
2.385AsnVal: 2.385 ± 0.02
0.593AsnTrp: 0.593 ± 0.01
1.119AsnTyr: 1.119 ± 0.016
0.0AsnXaa: 0.0 ± 0.0
Pro
4.708ProAla: 4.708 ± 0.035
0.618ProCys: 0.618 ± 0.011
3.182ProAsp: 3.182 ± 0.023
3.859ProGlu: 3.859 ± 0.031
2.17ProPhe: 2.17 ± 0.02
3.879ProGly: 3.879 ± 0.031
1.325ProHis: 1.325 ± 0.015
2.585ProIle: 2.585 ± 0.022
2.49ProLys: 2.49 ± 0.024
4.806ProLeu: 4.806 ± 0.029
1.072ProMet: 1.072 ± 0.014
2.171ProAsn: 2.171 ± 0.02
4.47ProPro: 4.47 ± 0.058
2.39ProGln: 2.39 ± 0.027
3.338ProArg: 3.338 ± 0.031
5.901ProSer: 5.901 ± 0.046
3.872ProThr: 3.872 ± 0.03
3.642ProVal: 3.642 ± 0.025
0.82ProTrp: 0.82 ± 0.012
1.579ProTyr: 1.579 ± 0.016
0.0ProXaa: 0.0 ± 0.0
Gln
3.306GlnAla: 3.306 ± 0.026
0.509GlnCys: 0.509 ± 0.01
2.073GlnAsp: 2.073 ± 0.019
2.437GlnGlu: 2.437 ± 0.022
1.343GlnPhe: 1.343 ± 0.018
2.517GlnGly: 2.517 ± 0.023
1.055GlnHis: 1.055 ± 0.014
1.96GlnIle: 1.96 ± 0.019
1.961GlnLys: 1.961 ± 0.02
3.606GlnLeu: 3.606 ± 0.026
0.895GlnMet: 0.895 ± 0.011
1.591GlnAsn: 1.591 ± 0.015
2.471GlnPro: 2.471 ± 0.028
2.204GlnGln: 2.204 ± 0.038
2.626GlnArg: 2.626 ± 0.024
3.225GlnSer: 3.225 ± 0.028
2.356GlnThr: 2.356 ± 0.019
2.244GlnVal: 2.244 ± 0.02
0.621GlnTrp: 0.621 ± 0.011
1.236GlnTyr: 1.236 ± 0.014
0.0GlnXaa: 0.0 ± 0.0
Arg
4.484ArgAla: 4.484 ± 0.027
0.799ArgCys: 0.799 ± 0.012
3.27ArgAsp: 3.27 ± 0.027
3.702ArgGlu: 3.702 ± 0.029
2.252ArgPhe: 2.252 ± 0.02
3.582ArgGly: 3.582 ± 0.027
1.601ArgHis: 1.601 ± 0.017
2.956ArgIle: 2.956 ± 0.023
3.265ArgLys: 3.265 ± 0.028
5.696ArgLeu: 5.696 ± 0.033
1.31ArgMet: 1.31 ± 0.015
2.264ArgAsn: 2.264 ± 0.02
3.362ArgPro: 3.362 ± 0.029
2.61ArgGln: 2.61 ± 0.022
4.898ArgArg: 4.898 ± 0.041
4.751ArgSer: 4.751 ± 0.037
3.211ArgThr: 3.211 ± 0.022
3.491ArgVal: 3.491 ± 0.023
1.005ArgTrp: 1.005 ± 0.013
1.799ArgTyr: 1.799 ± 0.018
0.0ArgXaa: 0.0 ± 0.0
Ser
6.404SerAla: 6.404 ± 0.036
1.014SerCys: 1.014 ± 0.016
4.271SerAsp: 4.271 ± 0.031
4.186SerGlu: 4.186 ± 0.027
3.22SerPhe: 3.22 ± 0.025
5.525SerGly: 5.525 ± 0.037
2.051SerHis: 2.051 ± 0.021
4.203SerIle: 4.203 ± 0.034
3.574SerLys: 3.574 ± 0.03
7.622SerLeu: 7.622 ± 0.042
1.737SerMet: 1.737 ± 0.016
3.0SerAsn: 3.0 ± 0.026
5.239SerPro: 5.239 ± 0.042
3.347SerGln: 3.347 ± 0.027
4.957SerArg: 4.957 ± 0.033
8.472SerSer: 8.472 ± 0.067
5.494SerThr: 5.494 ± 0.043
4.767SerVal: 4.767 ± 0.027
1.216SerTrp: 1.216 ± 0.015
2.205SerTyr: 2.205 ± 0.021
0.0SerXaa: 0.0 ± 0.0
Thr
5.091ThrAla: 5.091 ± 0.029
0.813ThrCys: 0.813 ± 0.013
2.935ThrAsp: 2.935 ± 0.023
3.201ThrGlu: 3.201 ± 0.023
2.298ThrPhe: 2.298 ± 0.019
4.256ThrGly: 4.256 ± 0.032
1.329ThrHis: 1.329 ± 0.015
3.174ThrIle: 3.174 ± 0.024
2.455ThrLys: 2.455 ± 0.022
5.378ThrLeu: 5.378 ± 0.032
1.244ThrMet: 1.244 ± 0.013
2.077ThrAsn: 2.077 ± 0.02
4.12ThrPro: 4.12 ± 0.032
2.072ThrGln: 2.072 ± 0.021
3.06ThrArg: 3.06 ± 0.022
5.181ThrSer: 5.181 ± 0.037
4.066ThrThr: 4.066 ± 0.034
3.97ThrVal: 3.97 ± 0.03
0.907ThrTrp: 0.907 ± 0.012
1.716ThrTyr: 1.716 ± 0.018
0.0ThrXaa: 0.0 ± 0.0
Val
5.199ValAla: 5.199 ± 0.032
0.92ValCys: 0.92 ± 0.011
3.781ValAsp: 3.781 ± 0.026
3.762ValGlu: 3.762 ± 0.032
2.594ValPhe: 2.594 ± 0.027
4.127ValGly: 4.127 ± 0.032
1.51ValHis: 1.51 ± 0.015
3.2ValIle: 3.2 ± 0.025
2.781ValLys: 2.781 ± 0.023
5.868ValLeu: 5.868 ± 0.032
1.364ValMet: 1.364 ± 0.015
2.308ValAsn: 2.308 ± 0.017
3.651ValPro: 3.651 ± 0.024
2.485ValGln: 2.485 ± 0.021
3.476ValArg: 3.476 ± 0.024
4.903ValSer: 4.903 ± 0.03
3.623ValThr: 3.623 ± 0.024
4.37ValVal: 4.37 ± 0.033
0.912ValTrp: 0.912 ± 0.013
1.901ValTyr: 1.901 ± 0.019
0.0ValXaa: 0.0 ± 0.0
Trp
1.17TrpAla: 1.17 ± 0.017
0.221TrpCys: 0.221 ± 0.006
0.93TrpAsp: 0.93 ± 0.013
0.883TrpGlu: 0.883 ± 0.013
0.586TrpPhe: 0.586 ± 0.011
0.983TrpGly: 0.983 ± 0.012
0.387TrpHis: 0.387 ± 0.008
0.834TrpIle: 0.834 ± 0.013
0.842TrpLys: 0.842 ± 0.012
1.471TrpLeu: 1.471 ± 0.017
0.392TrpMet: 0.392 ± 0.009
0.667TrpAsn: 0.667 ± 0.01
0.635TrpPro: 0.635 ± 0.011
0.587TrpGln: 0.587 ± 0.011
0.996TrpArg: 0.996 ± 0.013
1.093TrpSer: 1.093 ± 0.015
0.961TrpThr: 0.961 ± 0.014
0.945TrpVal: 0.945 ± 0.012
0.302TrpTrp: 0.302 ± 0.007
0.473TrpTyr: 0.473 ± 0.009
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.228TyrAla: 2.228 ± 0.019
0.465TyrCys: 0.465 ± 0.009
1.694TyrAsp: 1.694 ± 0.019
1.604TyrGlu: 1.604 ± 0.017
1.291TyrPhe: 1.291 ± 0.018
2.216TyrGly: 2.216 ± 0.025
0.823TyrHis: 0.823 ± 0.012
1.625TyrIle: 1.625 ± 0.017
1.093TyrLys: 1.093 ± 0.013
2.923TyrLeu: 2.923 ± 0.026
0.67TyrMet: 0.67 ± 0.011
1.18TyrAsn: 1.18 ± 0.014
1.624TyrPro: 1.624 ± 0.017
1.203TyrGln: 1.203 ± 0.015
1.768TyrArg: 1.768 ± 0.016
2.178TyrSer: 2.178 ± 0.023
1.763TyrThr: 1.763 ± 0.019
1.758TyrVal: 1.758 ± 0.016
0.489TyrTrp: 0.489 ± 0.008
1.026TyrTyr: 1.026 ± 0.014
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 13384 proteins (6142267 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski