Amino acid dipepetide frequency for Dictyobacter aurantiacus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.415AlaAla: 9.415 ± 0.075
1.026AlaCys: 1.026 ± 0.023
4.296AlaAsp: 4.296 ± 0.049
5.084AlaGlu: 5.084 ± 0.055
3.464AlaPhe: 3.464 ± 0.04
6.671AlaGly: 6.671 ± 0.063
2.448AlaHis: 2.448 ± 0.034
5.78AlaIle: 5.78 ± 0.052
2.262AlaLys: 2.262 ± 0.034
11.343AlaLeu: 11.343 ± 0.094
2.303AlaMet: 2.303 ± 0.032
2.92AlaAsn: 2.92 ± 0.043
4.215AlaPro: 4.215 ± 0.049
4.695AlaGln: 4.695 ± 0.048
6.684AlaArg: 6.684 ± 0.071
6.026AlaSer: 6.026 ± 0.055
5.539AlaThr: 5.539 ± 0.057
5.977AlaVal: 5.977 ± 0.057
1.363AlaTrp: 1.363 ± 0.025
2.882AlaTyr: 2.882 ± 0.034
0.0AlaXaa: 0.0 ± 0.0
Cys
0.857CysAla: 0.857 ± 0.02
0.148CysCys: 0.148 ± 0.009
0.484CysAsp: 0.484 ± 0.015
0.5CysGlu: 0.5 ± 0.016
0.378CysPhe: 0.378 ± 0.013
0.919CysGly: 0.919 ± 0.021
0.269CysHis: 0.269 ± 0.013
0.58CysIle: 0.58 ± 0.018
0.24CysLys: 0.24 ± 0.011
1.009CysLeu: 1.009 ± 0.022
0.212CysMet: 0.212 ± 0.009
0.304CysAsn: 0.304 ± 0.012
0.532CysPro: 0.532 ± 0.018
0.457CysGln: 0.457 ± 0.014
0.521CysArg: 0.521 ± 0.016
0.633CysSer: 0.633 ± 0.018
0.539CysThr: 0.539 ± 0.016
0.654CysVal: 0.654 ± 0.018
0.187CysTrp: 0.187 ± 0.009
0.358CysTyr: 0.358 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
4.598AspAla: 4.598 ± 0.046
0.406AspCys: 0.406 ± 0.015
2.382AspAsp: 2.382 ± 0.039
3.435AspGlu: 3.435 ± 0.041
1.859AspPhe: 1.859 ± 0.029
3.688AspGly: 3.688 ± 0.054
1.201AspHis: 1.201 ± 0.023
3.091AspIle: 3.091 ± 0.039
1.442AspLys: 1.442 ± 0.025
5.184AspLeu: 5.184 ± 0.055
1.176AspMet: 1.176 ± 0.021
1.589AspAsn: 1.589 ± 0.025
3.036AspPro: 3.036 ± 0.038
2.285AspGln: 2.285 ± 0.033
2.711AspArg: 2.711 ± 0.037
2.304AspSer: 2.304 ± 0.03
2.749AspThr: 2.749 ± 0.033
3.443AspVal: 3.443 ± 0.042
0.851AspTrp: 0.851 ± 0.019
1.61AspTyr: 1.61 ± 0.032
0.0AspXaa: 0.0 ± 0.0
Glu
5.728GluAla: 5.728 ± 0.063
0.431GluCys: 0.431 ± 0.014
2.703GluAsp: 2.703 ± 0.041
4.321GluGlu: 4.321 ± 0.061
1.462GluPhe: 1.462 ± 0.025
3.399GluGly: 3.399 ± 0.042
1.879GluHis: 1.879 ± 0.031
3.274GluIle: 3.274 ± 0.04
2.418GluLys: 2.418 ± 0.036
5.79GluLeu: 5.79 ± 0.061
1.554GluMet: 1.554 ± 0.028
1.728GluAsn: 1.728 ± 0.027
2.404GluPro: 2.404 ± 0.038
4.322GluGln: 4.322 ± 0.051
4.622GluArg: 4.622 ± 0.057
2.564GluSer: 2.564 ± 0.031
3.005GluThr: 3.005 ± 0.037
3.833GluVal: 3.833 ± 0.05
0.694GluTrp: 0.694 ± 0.019
1.572GluTyr: 1.572 ± 0.026
0.0GluXaa: 0.0 ± 0.0
Phe
3.313PheAla: 3.313 ± 0.044
0.458PheCys: 0.458 ± 0.015
1.937PheAsp: 1.937 ± 0.029
1.898PheGlu: 1.898 ± 0.029
1.706PhePhe: 1.706 ± 0.034
2.801PheGly: 2.801 ± 0.04
0.908PheHis: 0.908 ± 0.018
2.08PheIle: 2.08 ± 0.033
1.066PheLys: 1.066 ± 0.022
3.787PheLeu: 3.787 ± 0.053
0.828PheMet: 0.828 ± 0.022
1.251PheAsn: 1.251 ± 0.019
1.67PhePro: 1.67 ± 0.03
1.51PheGln: 1.51 ± 0.023
1.703PheArg: 1.703 ± 0.027
2.573PheSer: 2.573 ± 0.037
2.219PheThr: 2.219 ± 0.032
2.553PheVal: 2.553 ± 0.035
0.612PheTrp: 0.612 ± 0.017
1.247PheTyr: 1.247 ± 0.023
0.0PheXaa: 0.0 ± 0.0
Gly
5.991GlyAla: 5.991 ± 0.061
0.771GlyCys: 0.771 ± 0.02
3.171GlyAsp: 3.171 ± 0.04
3.797GlyGlu: 3.797 ± 0.046
2.654GlyPhe: 2.654 ± 0.039
5.476GlyGly: 5.476 ± 0.061
1.814GlyHis: 1.814 ± 0.03
4.595GlyIle: 4.595 ± 0.046
3.074GlyLys: 3.074 ± 0.039
7.213GlyLeu: 7.213 ± 0.068
1.897GlyMet: 1.897 ± 0.028
2.538GlyAsn: 2.538 ± 0.042
2.822GlyPro: 2.822 ± 0.037
3.437GlyGln: 3.437 ± 0.041
4.238GlyArg: 4.238 ± 0.048
4.734GlySer: 4.734 ± 0.049
4.362GlyThr: 4.362 ± 0.056
4.945GlyVal: 4.945 ± 0.055
1.305GlyTrp: 1.305 ± 0.025
2.655GlyTyr: 2.655 ± 0.038
0.0GlyXaa: 0.0 ± 0.0
His
2.232HisAla: 2.232 ± 0.033
0.283HisCys: 0.283 ± 0.011
1.248HisAsp: 1.248 ± 0.025
1.422HisGlu: 1.422 ± 0.023
1.039HisPhe: 1.039 ± 0.022
1.896HisGly: 1.896 ± 0.031
0.794HisHis: 0.794 ± 0.022
1.697HisIle: 1.697 ± 0.029
0.737HisLys: 0.737 ± 0.02
2.948HisLeu: 2.948 ± 0.042
0.632HisMet: 0.632 ± 0.015
0.876HisAsn: 0.876 ± 0.02
1.711HisPro: 1.711 ± 0.03
1.22HisGln: 1.22 ± 0.024
1.358HisArg: 1.358 ± 0.023
1.426HisSer: 1.426 ± 0.026
1.555HisThr: 1.555 ± 0.027
1.735HisVal: 1.735 ± 0.025
0.443HisTrp: 0.443 ± 0.014
0.941HisTyr: 0.941 ± 0.019
0.0HisXaa: 0.0 ± 0.0
Ile
6.034IleAla: 6.034 ± 0.058
0.665IleCys: 0.665 ± 0.018
3.262IleAsp: 3.262 ± 0.037
3.704IleGlu: 3.704 ± 0.046
2.257IlePhe: 2.257 ± 0.034
4.384IleGly: 4.384 ± 0.048
1.426IleHis: 1.426 ± 0.027
3.471IleIle: 3.471 ± 0.049
1.885IleLys: 1.885 ± 0.032
5.638IleLeu: 5.638 ± 0.061
1.166IleMet: 1.166 ± 0.019
2.109IleAsn: 2.109 ± 0.033
2.997IlePro: 2.997 ± 0.037
2.542IleGln: 2.542 ± 0.034
2.913IleArg: 2.913 ± 0.033
3.772IleSer: 3.772 ± 0.051
3.397IleThr: 3.397 ± 0.036
4.284IleVal: 4.284 ± 0.05
0.808IleTrp: 0.808 ± 0.02
1.812IleTyr: 1.812 ± 0.032
0.0IleXaa: 0.0 ± 0.0
Lys
2.919LysAla: 2.919 ± 0.046
0.19LysCys: 0.19 ± 0.009
1.71LysAsp: 1.71 ± 0.031
2.07LysGlu: 2.07 ± 0.034
0.789LysPhe: 0.789 ± 0.018
2.128LysGly: 2.128 ± 0.036
0.843LysHis: 0.843 ± 0.019
1.708LysIle: 1.708 ± 0.029
1.572LysLys: 1.572 ± 0.032
2.987LysLeu: 2.987 ± 0.032
0.853LysMet: 0.853 ± 0.019
1.208LysAsn: 1.208 ± 0.025
1.676LysPro: 1.676 ± 0.03
1.89LysGln: 1.89 ± 0.03
2.276LysArg: 2.276 ± 0.035
1.737LysSer: 1.737 ± 0.031
1.957LysThr: 1.957 ± 0.03
2.19LysVal: 2.19 ± 0.037
0.367LysTrp: 0.367 ± 0.014
0.912LysTyr: 0.912 ± 0.022
0.0LysXaa: 0.0 ± 0.0
Leu
11.164LeuAla: 11.164 ± 0.084
1.21LeuCys: 1.21 ± 0.023
5.267LeuAsp: 5.267 ± 0.051
5.667LeuGlu: 5.667 ± 0.06
3.99LeuPhe: 3.99 ± 0.048
7.287LeuGly: 7.287 ± 0.07
2.784LeuHis: 2.784 ± 0.035
6.036LeuIle: 6.036 ± 0.062
3.442LeuLys: 3.442 ± 0.046
13.041LeuLeu: 13.041 ± 0.123
2.119LeuMet: 2.119 ± 0.032
3.42LeuAsn: 3.42 ± 0.04
6.075LeuPro: 6.075 ± 0.061
5.266LeuGln: 5.266 ± 0.054
6.78LeuArg: 6.78 ± 0.073
7.07LeuSer: 7.07 ± 0.062
6.145LeuThr: 6.145 ± 0.055
7.246LeuVal: 7.246 ± 0.063
1.439LeuTrp: 1.439 ± 0.029
3.016LeuTyr: 3.016 ± 0.04
0.0LeuXaa: 0.0 ± 0.0
Met
2.312MetAla: 2.312 ± 0.033
0.175MetCys: 0.175 ± 0.01
1.09MetAsp: 1.09 ± 0.024
1.225MetGlu: 1.225 ± 0.022
0.683MetPhe: 0.683 ± 0.017
1.659MetGly: 1.659 ± 0.028
0.622MetHis: 0.622 ± 0.015
1.247MetIle: 1.247 ± 0.02
0.818MetLys: 0.818 ± 0.018
2.763MetLeu: 2.763 ± 0.033
0.559MetMet: 0.559 ± 0.018
0.821MetAsn: 0.821 ± 0.019
1.341MetPro: 1.341 ± 0.026
1.318MetGln: 1.318 ± 0.024
1.611MetArg: 1.611 ± 0.026
1.51MetSer: 1.51 ± 0.026
1.369MetThr: 1.369 ± 0.022
1.603MetVal: 1.603 ± 0.025
0.213MetTrp: 0.213 ± 0.009
0.553MetTyr: 0.553 ± 0.018
0.0MetXaa: 0.0 ± 0.0
Asn
3.199AsnAla: 3.199 ± 0.043
0.305AsnCys: 0.305 ± 0.01
1.667AsnAsp: 1.667 ± 0.029
1.73AsnGlu: 1.73 ± 0.028
1.162AsnPhe: 1.162 ± 0.023
2.644AsnGly: 2.644 ± 0.043
0.746AsnHis: 0.746 ± 0.019
2.069AsnIle: 2.069 ± 0.03
1.079AsnLys: 1.079 ± 0.022
3.195AsnLeu: 3.195 ± 0.042
0.783AsnMet: 0.783 ± 0.017
1.433AsnAsn: 1.433 ± 0.032
2.039AsnPro: 2.039 ± 0.03
1.567AsnGln: 1.567 ± 0.028
1.715AsnArg: 1.715 ± 0.03
1.889AsnSer: 1.889 ± 0.03
2.066AsnThr: 2.066 ± 0.036
2.258AsnVal: 2.258 ± 0.033
0.581AsnTrp: 0.581 ± 0.018
1.098AsnTyr: 1.098 ± 0.02
0.0AsnXaa: 0.0 ± 0.0
Pro
5.105ProAla: 5.105 ± 0.048
0.35ProCys: 0.35 ± 0.013
3.23ProAsp: 3.23 ± 0.038
3.582ProGlu: 3.582 ± 0.044
2.01ProPhe: 2.01 ± 0.032
3.961ProGly: 3.961 ± 0.041
1.305ProHis: 1.305 ± 0.02
2.619ProIle: 2.619 ± 0.036
1.271ProLys: 1.271 ± 0.027
5.406ProLeu: 5.406 ± 0.048
1.054ProMet: 1.054 ± 0.021
1.566ProAsn: 1.566 ± 0.032
2.568ProPro: 2.568 ± 0.044
2.533ProGln: 2.533 ± 0.038
2.74ProArg: 2.74 ± 0.04
3.064ProSer: 3.064 ± 0.038
3.299ProThr: 3.299 ± 0.049
3.704ProVal: 3.704 ± 0.048
0.726ProTrp: 0.726 ± 0.019
1.585ProTyr: 1.585 ± 0.026
0.0ProXaa: 0.0 ± 0.0
Gln
5.428GlnAla: 5.428 ± 0.051
0.368GlnCys: 0.368 ± 0.011
2.156GlnAsp: 2.156 ± 0.03
3.136GlnGlu: 3.136 ± 0.049
1.494GlnPhe: 1.494 ± 0.025
3.433GlnGly: 3.433 ± 0.038
1.526GlnHis: 1.526 ± 0.025
2.676GlnIle: 2.676 ± 0.032
1.868GlnLys: 1.868 ± 0.03
5.199GlnLeu: 5.199 ± 0.059
1.225GlnMet: 1.225 ± 0.023
1.467GlnAsn: 1.467 ± 0.028
2.844GlnPro: 2.844 ± 0.038
3.96GlnGln: 3.96 ± 0.058
3.831GlnArg: 3.831 ± 0.04
2.859GlnSer: 2.859 ± 0.04
2.752GlnThr: 2.752 ± 0.035
3.412GlnVal: 3.412 ± 0.036
0.703GlnTrp: 0.703 ± 0.016
1.334GlnTyr: 1.334 ± 0.027
0.0GlnXaa: 0.0 ± 0.0
Arg
5.334ArgAla: 5.334 ± 0.058
0.595ArgCys: 0.595 ± 0.016
2.915ArgAsp: 2.915 ± 0.036
4.038ArgGlu: 4.038 ± 0.05
2.351ArgPhe: 2.351 ± 0.03
3.776ArgGly: 3.776 ± 0.047
1.716ArgHis: 1.716 ± 0.029
3.622ArgIle: 3.622 ± 0.041
2.146ArgLys: 2.146 ± 0.034
7.005ArgLeu: 7.005 ± 0.067
1.597ArgMet: 1.597 ± 0.026
1.803ArgAsn: 1.803 ± 0.026
2.911ArgPro: 2.911 ± 0.038
3.647ArgGln: 3.647 ± 0.046
4.624ArgArg: 4.624 ± 0.051
3.574ArgSer: 3.574 ± 0.042
3.258ArgThr: 3.258 ± 0.037
4.117ArgVal: 4.117 ± 0.046
1.083ArgTrp: 1.083 ± 0.024
2.269ArgTyr: 2.269 ± 0.03
0.0ArgXaa: 0.0 ± 0.0
Ser
5.349SerAla: 5.349 ± 0.055
0.573SerCys: 0.573 ± 0.017
3.128SerAsp: 3.128 ± 0.036
3.133SerGlu: 3.133 ± 0.039
2.38SerPhe: 2.38 ± 0.033
4.942SerGly: 4.942 ± 0.053
1.516SerHis: 1.516 ± 0.027
3.547SerIle: 3.547 ± 0.042
1.826SerLys: 1.826 ± 0.03
6.533SerLeu: 6.533 ± 0.052
1.623SerMet: 1.623 ± 0.026
2.142SerAsn: 2.142 ± 0.032
3.19SerPro: 3.19 ± 0.037
2.858SerGln: 2.858 ± 0.038
3.301SerArg: 3.301 ± 0.037
4.54SerSer: 4.54 ± 0.059
3.773SerThr: 3.773 ± 0.049
3.836SerVal: 3.836 ± 0.04
1.057SerTrp: 1.057 ± 0.024
2.041SerTyr: 2.041 ± 0.032
0.0SerXaa: 0.0 ± 0.0
Thr
5.023ThrAla: 5.023 ± 0.05
0.558ThrCys: 0.558 ± 0.017
2.664ThrAsp: 2.664 ± 0.035
2.601ThrGlu: 2.601 ± 0.032
2.348ThrPhe: 2.348 ± 0.036
4.433ThrGly: 4.433 ± 0.05
1.461ThrHis: 1.461 ± 0.025
3.972ThrIle: 3.972 ± 0.049
1.368ThrLys: 1.368 ± 0.028
6.706ThrLeu: 6.706 ± 0.057
1.318ThrMet: 1.318 ± 0.024
2.058ThrAsn: 2.058 ± 0.035
3.857ThrPro: 3.857 ± 0.053
2.461ThrGln: 2.461 ± 0.035
3.416ThrArg: 3.416 ± 0.039
3.797ThrSer: 3.797 ± 0.047
3.839ThrThr: 3.839 ± 0.049
3.951ThrVal: 3.951 ± 0.046
0.843ThrTrp: 0.843 ± 0.023
1.91ThrTyr: 1.91 ± 0.033
0.0ThrXaa: 0.0 ± 0.0
Val
6.59ValAla: 6.59 ± 0.052
0.7ValCys: 0.7 ± 0.017
3.439ValAsp: 3.439 ± 0.039
3.922ValGlu: 3.922 ± 0.054
2.389ValPhe: 2.389 ± 0.035
4.71ValGly: 4.71 ± 0.052
1.557ValHis: 1.557 ± 0.025
4.024ValIle: 4.024 ± 0.048
2.078ValLys: 2.078 ± 0.035
7.425ValLeu: 7.425 ± 0.075
1.564ValMet: 1.564 ± 0.024
2.253ValAsn: 2.253 ± 0.029
3.523ValPro: 3.523 ± 0.04
3.011ValGln: 3.011 ± 0.036
4.158ValArg: 4.158 ± 0.05
4.341ValSer: 4.341 ± 0.048
4.064ValThr: 4.064 ± 0.049
5.142ValVal: 5.142 ± 0.053
0.864ValTrp: 0.864 ± 0.021
1.947ValTyr: 1.947 ± 0.068
0.0ValXaa: 0.0 ± 0.0
Trp
0.98TrpAla: 0.98 ± 0.024
0.177TrpCys: 0.177 ± 0.009
0.669TrpAsp: 0.669 ± 0.019
0.783TrpGlu: 0.783 ± 0.019
0.583TrpPhe: 0.583 ± 0.017
0.977TrpGly: 0.977 ± 0.026
0.474TrpHis: 0.474 ± 0.016
0.803TrpIle: 0.803 ± 0.02
0.54TrpLys: 0.54 ± 0.015
1.917TrpLeu: 1.917 ± 0.039
0.393TrpMet: 0.393 ± 0.014
0.579TrpAsn: 0.579 ± 0.016
0.655TrpPro: 0.655 ± 0.018
0.941TrpGln: 0.941 ± 0.023
1.051TrpArg: 1.051 ± 0.022
1.095TrpSer: 1.095 ± 0.025
0.763TrpThr: 0.763 ± 0.018
0.789TrpVal: 0.789 ± 0.021
0.284TrpTrp: 0.284 ± 0.011
0.486TrpTyr: 0.486 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.819TyrAla: 2.819 ± 0.04
0.392TyrCys: 0.392 ± 0.013
1.62TyrAsp: 1.62 ± 0.027
1.655TyrGlu: 1.655 ± 0.028
1.122TyrPhe: 1.122 ± 0.023
2.304TyrGly: 2.304 ± 0.036
0.908TyrHis: 0.908 ± 0.023
1.635TyrIle: 1.635 ± 0.025
0.85TyrLys: 0.85 ± 0.022
3.377TyrLeu: 3.377 ± 0.036
0.637TyrMet: 0.637 ± 0.014
1.163TyrAsn: 1.163 ± 0.024
1.614TyrPro: 1.614 ± 0.029
1.751TyrGln: 1.751 ± 0.029
2.128TyrArg: 2.128 ± 0.03
1.78TyrSer: 1.78 ± 0.028
1.933TyrThr: 1.933 ± 0.031
2.026TyrVal: 2.026 ± 0.065
0.515TyrTrp: 0.515 ± 0.014
1.117TyrTyr: 1.117 ± 0.024
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7362 proteins (2368895 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski