Amino acid dipepetide frequency for Ferruginibacter sp. BO-59

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.725AlaAla: 5.725 ± 0.081
0.633AlaCys: 0.633 ± 0.021
3.898AlaAsp: 3.898 ± 0.065
3.878AlaGlu: 3.878 ± 0.057
3.512AlaPhe: 3.512 ± 0.06
5.343AlaGly: 5.343 ± 0.069
1.216AlaHis: 1.216 ± 0.035
5.388AlaIle: 5.388 ± 0.074
4.435AlaLys: 4.435 ± 0.058
6.056AlaLeu: 6.056 ± 0.068
1.684AlaMet: 1.684 ± 0.038
3.562AlaAsn: 3.562 ± 0.06
2.33AlaPro: 2.33 ± 0.039
2.48AlaGln: 2.48 ± 0.045
2.376AlaArg: 2.376 ± 0.044
4.569AlaSer: 4.569 ± 0.069
3.965AlaThr: 3.965 ± 0.069
4.33AlaVal: 4.33 ± 0.064
0.837AlaTrp: 0.837 ± 0.025
2.405AlaTyr: 2.405 ± 0.038
0.0AlaXaa: 0.0 ± 0.0
Cys
0.534CysAla: 0.534 ± 0.024
0.172CysCys: 0.172 ± 0.013
0.469CysAsp: 0.469 ± 0.017
0.431CysGlu: 0.431 ± 0.017
0.476CysPhe: 0.476 ± 0.019
0.666CysGly: 0.666 ± 0.025
0.207CysHis: 0.207 ± 0.013
0.711CysIle: 0.711 ± 0.024
0.543CysLys: 0.543 ± 0.02
0.788CysLeu: 0.788 ± 0.025
0.21CysMet: 0.21 ± 0.013
0.5CysAsn: 0.5 ± 0.019
0.363CysPro: 0.363 ± 0.018
0.219CysGln: 0.219 ± 0.012
0.332CysArg: 0.332 ± 0.016
0.577CysSer: 0.577 ± 0.021
0.44CysThr: 0.44 ± 0.022
0.508CysVal: 0.508 ± 0.019
0.101CysTrp: 0.101 ± 0.009
0.361CysTyr: 0.361 ± 0.018
0.0CysXaa: 0.0 ± 0.0
Asp
3.823AspAla: 3.823 ± 0.053
0.44AspCys: 0.44 ± 0.019
2.544AspAsp: 2.544 ± 0.046
3.573AspGlu: 3.573 ± 0.058
3.327AspPhe: 3.327 ± 0.055
3.786AspGly: 3.786 ± 0.065
0.934AspHis: 0.934 ± 0.026
4.058AspIle: 4.058 ± 0.066
3.98AspLys: 3.98 ± 0.058
4.64AspLeu: 4.64 ± 0.06
1.066AspMet: 1.066 ± 0.028
2.832AspAsn: 2.832 ± 0.047
2.063AspPro: 2.063 ± 0.038
1.404AspGln: 1.404 ± 0.035
1.786AspArg: 1.786 ± 0.037
3.601AspSer: 3.601 ± 0.073
2.655AspThr: 2.655 ± 0.063
3.276AspVal: 3.276 ± 0.051
0.85AspTrp: 0.85 ± 0.024
2.579AspTyr: 2.579 ± 0.043
0.0AspXaa: 0.0 ± 0.0
Glu
3.742GluAla: 3.742 ± 0.055
0.389GluCys: 0.389 ± 0.016
2.668GluAsp: 2.668 ± 0.048
3.85GluGlu: 3.85 ± 0.077
2.514GluPhe: 2.514 ± 0.045
3.335GluGly: 3.335 ± 0.05
0.943GluHis: 0.943 ± 0.026
4.875GluIle: 4.875 ± 0.07
6.024GluLys: 6.024 ± 0.079
5.113GluLeu: 5.113 ± 0.074
1.66GluMet: 1.66 ± 0.029
4.144GluAsn: 4.144 ± 0.066
1.673GluPro: 1.673 ± 0.036
2.096GluGln: 2.096 ± 0.038
2.279GluArg: 2.279 ± 0.045
3.024GluSer: 3.024 ± 0.05
2.91GluThr: 2.91 ± 0.05
3.433GluVal: 3.433 ± 0.051
0.764GluTrp: 0.764 ± 0.025
2.078GluTyr: 2.078 ± 0.041
0.0GluXaa: 0.0 ± 0.0
Phe
3.37PheAla: 3.37 ± 0.05
0.565PheCys: 0.565 ± 0.021
3.043PheAsp: 3.043 ± 0.052
2.877PheGlu: 2.877 ± 0.048
2.984PhePhe: 2.984 ± 0.052
3.47PheGly: 3.47 ± 0.053
0.985PheHis: 0.985 ± 0.028
4.101PheIle: 4.101 ± 0.049
3.422PheLys: 3.422 ± 0.046
4.953PheLeu: 4.953 ± 0.07
1.126PheMet: 1.126 ± 0.028
3.071PheAsn: 3.071 ± 0.046
2.037PhePro: 2.037 ± 0.041
1.615PheGln: 1.615 ± 0.036
1.878PheArg: 1.878 ± 0.036
4.289PheSer: 4.289 ± 0.063
3.165PheThr: 3.165 ± 0.051
2.993PheVal: 2.993 ± 0.048
0.657PheTrp: 0.657 ± 0.021
2.269PheTyr: 2.269 ± 0.036
0.0PheXaa: 0.0 ± 0.0
Gly
4.377GlyAla: 4.377 ± 0.066
0.648GlyCys: 0.648 ± 0.027
3.468GlyAsp: 3.468 ± 0.051
3.307GlyGlu: 3.307 ± 0.043
3.649GlyPhe: 3.649 ± 0.051
4.832GlyGly: 4.832 ± 0.074
1.182GlyHis: 1.182 ± 0.031
5.66GlyIle: 5.66 ± 0.068
5.605GlyLys: 5.605 ± 0.067
5.676GlyLeu: 5.676 ± 0.061
1.678GlyMet: 1.678 ± 0.039
4.1GlyAsn: 4.1 ± 0.076
1.673GlyPro: 1.673 ± 0.036
1.983GlyGln: 1.983 ± 0.041
2.438GlyArg: 2.438 ± 0.041
4.302GlySer: 4.302 ± 0.069
3.971GlyThr: 3.971 ± 0.082
3.953GlyVal: 3.953 ± 0.056
1.08GlyTrp: 1.08 ± 0.031
2.98GlyTyr: 2.98 ± 0.048
0.0GlyXaa: 0.0 ± 0.0
His
1.139HisAla: 1.139 ± 0.029
0.205HisCys: 0.205 ± 0.013
0.942HisAsp: 0.942 ± 0.026
1.009HisGlu: 1.009 ± 0.031
1.31HisPhe: 1.31 ± 0.03
1.113HisGly: 1.113 ± 0.032
0.527HisHis: 0.527 ± 0.019
1.346HisIle: 1.346 ± 0.03
1.049HisLys: 1.049 ± 0.028
1.922HisLeu: 1.922 ± 0.038
0.312HisMet: 0.312 ± 0.015
0.958HisAsn: 0.958 ± 0.026
1.083HisPro: 1.083 ± 0.028
0.742HisGln: 0.742 ± 0.023
0.765HisArg: 0.765 ± 0.023
1.213HisSer: 1.213 ± 0.025
0.976HisThr: 0.976 ± 0.028
0.972HisVal: 0.972 ± 0.026
0.299HisTrp: 0.299 ± 0.015
0.852HisTyr: 0.852 ± 0.024
0.0HisXaa: 0.0 ± 0.0
Ile
5.667IleAla: 5.667 ± 0.08
0.806IleCys: 0.806 ± 0.027
4.256IleAsp: 4.256 ± 0.055
4.534IleGlu: 4.534 ± 0.063
3.965IlePhe: 3.965 ± 0.06
4.677IleGly: 4.677 ± 0.062
1.485IleHis: 1.485 ± 0.032
6.013IleIle: 6.013 ± 0.091
5.561IleLys: 5.561 ± 0.068
6.806IleLeu: 6.806 ± 0.075
1.398IleMet: 1.398 ± 0.033
4.729IleAsn: 4.729 ± 0.063
3.449IlePro: 3.449 ± 0.049
2.454IleGln: 2.454 ± 0.044
2.893IleArg: 2.893 ± 0.044
5.87IleSer: 5.87 ± 0.078
4.632IleThr: 4.632 ± 0.059
4.453IleVal: 4.453 ± 0.066
0.769IleTrp: 0.769 ± 0.025
2.778IleTyr: 2.778 ± 0.05
0.0IleXaa: 0.0 ± 0.0
Lys
4.696LysAla: 4.696 ± 0.062
0.385LysCys: 0.385 ± 0.015
4.38LysAsp: 4.38 ± 0.058
5.574LysGlu: 5.574 ± 0.077
2.963LysPhe: 2.963 ± 0.05
4.475LysGly: 4.475 ± 0.057
1.131LysHis: 1.131 ± 0.031
6.494LysIle: 6.494 ± 0.067
6.969LysLys: 6.969 ± 0.084
6.033LysLeu: 6.033 ± 0.07
2.232LysMet: 2.232 ± 0.038
5.683LysAsn: 5.683 ± 0.057
2.813LysPro: 2.813 ± 0.043
2.597LysGln: 2.597 ± 0.044
2.833LysArg: 2.833 ± 0.046
4.451LysSer: 4.451 ± 0.058
4.095LysThr: 4.095 ± 0.051
4.318LysVal: 4.318 ± 0.063
0.946LysTrp: 0.946 ± 0.026
3.1LysTyr: 3.1 ± 0.049
0.0LysXaa: 0.0 ± 0.0
Leu
5.894LeuAla: 5.894 ± 0.073
0.802LeuCys: 0.802 ± 0.027
4.282LeuAsp: 4.282 ± 0.056
4.589LeuGlu: 4.589 ± 0.07
4.944LeuPhe: 4.944 ± 0.059
5.269LeuGly: 5.269 ± 0.068
1.852LeuHis: 1.852 ± 0.04
6.654LeuIle: 6.654 ± 0.084
7.268LeuLys: 7.268 ± 0.069
8.735LeuLeu: 8.735 ± 0.098
2.089LeuMet: 2.089 ± 0.043
5.369LeuAsn: 5.369 ± 0.086
4.165LeuPro: 4.165 ± 0.049
3.886LeuGln: 3.886 ± 0.057
3.477LeuArg: 3.477 ± 0.051
6.935LeuSer: 6.935 ± 0.081
4.746LeuThr: 4.746 ± 0.058
5.043LeuVal: 5.043 ± 0.063
1.005LeuTrp: 1.005 ± 0.03
3.256LeuTyr: 3.256 ± 0.051
0.0LeuXaa: 0.0 ± 0.0
Met
1.825MetAla: 1.825 ± 0.035
0.181MetCys: 0.181 ± 0.012
1.201MetAsp: 1.201 ± 0.031
1.446MetGlu: 1.446 ± 0.033
0.825MetPhe: 0.825 ± 0.026
1.378MetGly: 1.378 ± 0.028
0.48MetHis: 0.48 ± 0.017
1.613MetIle: 1.613 ± 0.032
2.207MetLys: 2.207 ± 0.04
2.062MetLeu: 2.062 ± 0.053
0.698MetMet: 0.698 ± 0.024
1.473MetAsn: 1.473 ± 0.029
1.108MetPro: 1.108 ± 0.03
1.02MetGln: 1.02 ± 0.027
0.925MetArg: 0.925 ± 0.027
1.294MetSer: 1.294 ± 0.034
1.007MetThr: 1.007 ± 0.026
1.406MetVal: 1.406 ± 0.028
0.231MetTrp: 0.231 ± 0.011
0.735MetTyr: 0.735 ± 0.025
0.0MetXaa: 0.0 ± 0.0
Asn
4.009AsnAla: 4.009 ± 0.057
0.504AsnCys: 0.504 ± 0.02
3.191AsnAsp: 3.191 ± 0.059
3.567AsnGlu: 3.567 ± 0.057
3.17AsnPhe: 3.17 ± 0.046
4.258AsnGly: 4.258 ± 0.077
1.152AsnHis: 1.152 ± 0.029
4.765AsnIle: 4.765 ± 0.064
4.21AsnLys: 4.21 ± 0.057
5.231AsnLeu: 5.231 ± 0.07
1.203AsnMet: 1.203 ± 0.027
3.893AsnAsn: 3.893 ± 0.067
3.005AsnPro: 3.005 ± 0.046
1.979AsnGln: 1.979 ± 0.038
2.204AsnArg: 2.204 ± 0.039
3.91AsnSer: 3.91 ± 0.06
3.123AsnThr: 3.123 ± 0.052
3.484AsnVal: 3.484 ± 0.044
0.793AsnTrp: 0.793 ± 0.025
2.771AsnTyr: 2.771 ± 0.047
0.0AsnXaa: 0.0 ± 0.0
Pro
3.142ProAla: 3.142 ± 0.05
0.265ProCys: 0.265 ± 0.016
2.749ProAsp: 2.749 ± 0.042
2.857ProGlu: 2.857 ± 0.045
2.128ProPhe: 2.128 ± 0.044
3.14ProGly: 3.14 ± 0.055
0.744ProHis: 0.744 ± 0.02
2.227ProIle: 2.227 ± 0.04
2.303ProLys: 2.303 ± 0.044
3.413ProLeu: 3.413 ± 0.05
0.801ProMet: 0.801 ± 0.025
1.913ProAsn: 1.913 ± 0.038
1.294ProPro: 1.294 ± 0.035
1.424ProGln: 1.424 ± 0.032
1.13ProArg: 1.13 ± 0.031
2.6ProSer: 2.6 ± 0.043
1.896ProThr: 1.896 ± 0.045
3.406ProVal: 3.406 ± 0.055
0.48ProTrp: 0.48 ± 0.018
1.571ProTyr: 1.571 ± 0.033
0.0ProXaa: 0.0 ± 0.0
Gln
2.148GlnAla: 2.148 ± 0.04
0.195GlnCys: 0.195 ± 0.012
1.423GlnAsp: 1.423 ± 0.033
1.757GlnGlu: 1.757 ± 0.042
1.799GlnPhe: 1.799 ± 0.034
1.74GlnGly: 1.74 ± 0.037
0.706GlnHis: 0.706 ± 0.024
2.594GlnIle: 2.594 ± 0.046
3.342GlnLys: 3.342 ± 0.047
3.589GlnLeu: 3.589 ± 0.055
0.952GlnMet: 0.952 ± 0.028
2.34GlnAsn: 2.34 ± 0.039
1.384GlnPro: 1.384 ± 0.03
1.911GlnGln: 1.911 ± 0.048
1.336GlnArg: 1.336 ± 0.036
2.315GlnSer: 2.315 ± 0.043
1.92GlnThr: 1.92 ± 0.041
2.045GlnVal: 2.045 ± 0.043
0.477GlnTrp: 0.477 ± 0.018
1.404GlnTyr: 1.404 ± 0.032
0.0GlnXaa: 0.0 ± 0.0
Arg
2.103ArgAla: 2.103 ± 0.036
0.264ArgCys: 0.264 ± 0.013
1.849ArgAsp: 1.849 ± 0.039
2.147ArgGlu: 2.147 ± 0.04
2.03ArgPhe: 2.03 ± 0.037
2.072ArgGly: 2.072 ± 0.045
0.691ArgHis: 0.691 ± 0.022
3.076ArgIle: 3.076 ± 0.054
3.098ArgLys: 3.098 ± 0.045
3.516ArgLeu: 3.516 ± 0.05
1.029ArgMet: 1.029 ± 0.026
2.415ArgAsn: 2.415 ± 0.043
1.336ArgPro: 1.336 ± 0.032
1.478ArgGln: 1.478 ± 0.032
1.6ArgArg: 1.6 ± 0.038
2.054ArgSer: 2.054 ± 0.035
1.809ArgThr: 1.809 ± 0.033
1.987ArgVal: 1.987 ± 0.043
0.56ArgTrp: 0.56 ± 0.022
1.576ArgTyr: 1.576 ± 0.039
0.0ArgXaa: 0.0 ± 0.0
Ser
4.66SerAla: 4.66 ± 0.065
0.674SerCys: 0.674 ± 0.024
3.588SerAsp: 3.588 ± 0.052
3.527SerGlu: 3.527 ± 0.05
4.13SerPhe: 4.13 ± 0.051
5.211SerGly: 5.211 ± 0.078
1.183SerHis: 1.183 ± 0.03
5.068SerIle: 5.068 ± 0.066
4.477SerLys: 4.477 ± 0.062
6.462SerLeu: 6.462 ± 0.084
1.421SerMet: 1.421 ± 0.031
3.468SerAsn: 3.468 ± 0.053
2.612SerPro: 2.612 ± 0.046
2.245SerGln: 2.245 ± 0.043
2.393SerArg: 2.393 ± 0.046
4.719SerSer: 4.719 ± 0.065
3.489SerThr: 3.489 ± 0.063
4.453SerVal: 4.453 ± 0.059
0.932SerTrp: 0.932 ± 0.023
2.701SerTyr: 2.701 ± 0.047
0.0SerXaa: 0.0 ± 0.0
Thr
4.101ThrAla: 4.101 ± 0.078
0.376ThrCys: 0.376 ± 0.018
3.103ThrAsp: 3.103 ± 0.051
2.818ThrGlu: 2.818 ± 0.053
2.767ThrPhe: 2.767 ± 0.04
4.571ThrGly: 4.571 ± 0.073
0.99ThrHis: 0.99 ± 0.028
4.311ThrIle: 4.311 ± 0.064
3.391ThrLys: 3.391 ± 0.05
4.917ThrLeu: 4.917 ± 0.063
1.021ThrMet: 1.021 ± 0.03
2.987ThrAsn: 2.987 ± 0.053
2.403ThrPro: 2.403 ± 0.046
1.747ThrGln: 1.747 ± 0.028
1.724ThrArg: 1.724 ± 0.033
3.625ThrSer: 3.625 ± 0.058
3.191ThrThr: 3.191 ± 0.071
3.6ThrVal: 3.6 ± 0.063
0.662ThrTrp: 0.662 ± 0.025
2.012ThrTyr: 2.012 ± 0.042
0.0ThrXaa: 0.0 ± 0.0
Val
4.434ValAla: 4.434 ± 0.062
0.616ValCys: 0.616 ± 0.024
3.176ValAsp: 3.176 ± 0.048
3.149ValGlu: 3.149 ± 0.057
3.204ValPhe: 3.204 ± 0.049
3.543ValGly: 3.543 ± 0.057
1.128ValHis: 1.128 ± 0.029
4.866ValIle: 4.866 ± 0.067
4.408ValLys: 4.408 ± 0.052
5.422ValLeu: 5.422 ± 0.056
1.406ValMet: 1.406 ± 0.033
3.634ValAsn: 3.634 ± 0.052
2.501ValPro: 2.501 ± 0.048
1.935ValGln: 1.935 ± 0.041
2.054ValArg: 2.054 ± 0.041
4.498ValSer: 4.498 ± 0.058
3.537ValThr: 3.537 ± 0.059
3.779ValVal: 3.779 ± 0.058
0.721ValTrp: 0.721 ± 0.023
2.299ValTyr: 2.299 ± 0.047
0.0ValXaa: 0.0 ± 0.0
Trp
0.763TrpAla: 0.763 ± 0.025
0.112TrpCys: 0.112 ± 0.011
0.738TrpAsp: 0.738 ± 0.024
0.645TrpGlu: 0.645 ± 0.022
0.645TrpPhe: 0.645 ± 0.022
0.878TrpGly: 0.878 ± 0.027
0.289TrpHis: 0.289 ± 0.013
0.905TrpIle: 0.905 ± 0.027
1.099TrpLys: 1.099 ± 0.028
1.234TrpLeu: 1.234 ± 0.032
0.393TrpMet: 0.393 ± 0.019
0.855TrpAsn: 0.855 ± 0.024
0.442TrpPro: 0.442 ± 0.019
0.594TrpGln: 0.594 ± 0.02
0.545TrpArg: 0.545 ± 0.019
0.71TrpSer: 0.71 ± 0.025
0.654TrpThr: 0.654 ± 0.024
0.716TrpVal: 0.716 ± 0.025
0.253TrpTrp: 0.253 ± 0.015
0.523TrpTyr: 0.523 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.471TyrAla: 2.471 ± 0.051
0.376TyrCys: 0.376 ± 0.018
2.252TyrAsp: 2.252 ± 0.04
1.975TyrGlu: 1.975 ± 0.039
2.524TyrPhe: 2.524 ± 0.042
2.805TyrGly: 2.805 ± 0.048
0.909TyrHis: 0.909 ± 0.024
2.469TyrIle: 2.469 ± 0.039
2.779TyrLys: 2.779 ± 0.045
3.695TyrLeu: 3.695 ± 0.052
0.734TyrMet: 0.734 ± 0.023
2.499TyrAsn: 2.499 ± 0.044
1.75TyrPro: 1.75 ± 0.038
1.569TyrGln: 1.569 ± 0.038
1.723TyrArg: 1.723 ± 0.035
2.855TyrSer: 2.855 ± 0.043
2.135TyrThr: 2.135 ± 0.038
2.155TyrVal: 2.155 ± 0.035
0.576TyrTrp: 0.576 ± 0.021
1.871TyrTyr: 1.871 ± 0.045
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4167 proteins (1453408 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski