Amino acid dipepetide frequency for Oceaniovalibus sp. ACAM 378

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.472AlaAla: 16.472 ± 0.184
1.151AlaCys: 1.151 ± 0.031
7.175AlaAsp: 7.175 ± 0.08
7.175AlaGlu: 7.175 ± 0.082
4.452AlaPhe: 4.452 ± 0.067
10.594AlaGly: 10.594 ± 0.117
2.23AlaHis: 2.23 ± 0.036
6.282AlaIle: 6.282 ± 0.069
3.5AlaLys: 3.5 ± 0.051
13.89AlaLeu: 13.89 ± 0.138
3.911AlaMet: 3.911 ± 0.049
2.781AlaAsn: 2.781 ± 0.042
5.818AlaPro: 5.818 ± 0.082
4.26AlaGln: 4.26 ± 0.058
8.742AlaArg: 8.742 ± 0.104
5.516AlaSer: 5.516 ± 0.06
6.477AlaThr: 6.477 ± 0.078
8.759AlaVal: 8.759 ± 0.093
1.488AlaTrp: 1.488 ± 0.034
2.206AlaTyr: 2.206 ± 0.04
0.0AlaXaa: 0.0 ± 0.0
Cys
1.121CysAla: 1.121 ± 0.03
0.12CysCys: 0.12 ± 0.01
0.683CysAsp: 0.683 ± 0.025
0.439CysGlu: 0.439 ± 0.018
0.339CysPhe: 0.339 ± 0.015
0.934CysGly: 0.934 ± 0.024
0.298CysHis: 0.298 ± 0.014
0.446CysIle: 0.446 ± 0.017
0.206CysLys: 0.206 ± 0.011
0.871CysLeu: 0.871 ± 0.028
0.18CysMet: 0.18 ± 0.011
0.225CysAsn: 0.225 ± 0.013
0.501CysPro: 0.501 ± 0.017
0.212CysGln: 0.212 ± 0.011
0.6CysArg: 0.6 ± 0.02
0.463CysSer: 0.463 ± 0.02
0.449CysThr: 0.449 ± 0.018
0.647CysVal: 0.647 ± 0.02
0.126CysTrp: 0.126 ± 0.009
0.187CysTyr: 0.187 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
7.213AspAla: 7.213 ± 0.09
0.53AspCys: 0.53 ± 0.018
3.732AspAsp: 3.732 ± 0.064
3.293AspGlu: 3.293 ± 0.055
2.307AspPhe: 2.307 ± 0.045
5.685AspGly: 5.685 ± 0.075
1.425AspHis: 1.425 ± 0.03
3.283AspIle: 3.283 ± 0.046
1.561AspLys: 1.561 ± 0.034
6.555AspLeu: 6.555 ± 0.072
1.64AspMet: 1.64 ± 0.029
1.298AspAsn: 1.298 ± 0.032
3.885AspPro: 3.885 ± 0.055
1.792AspGln: 1.792 ± 0.037
4.602AspArg: 4.602 ± 0.057
2.502AspSer: 2.502 ± 0.048
3.331AspThr: 3.331 ± 0.051
4.264AspVal: 4.264 ± 0.053
1.141AspTrp: 1.141 ± 0.031
1.362AspTyr: 1.362 ± 0.036
0.0AspXaa: 0.0 ± 0.0
Glu
6.457GluAla: 6.457 ± 0.079
0.359GluCys: 0.359 ± 0.016
2.794GluAsp: 2.794 ± 0.05
2.572GluGlu: 2.572 ± 0.054
1.756GluPhe: 1.756 ± 0.033
4.014GluGly: 4.014 ± 0.054
1.047GluHis: 1.047 ± 0.027
3.538GluIle: 3.538 ± 0.05
1.883GluLys: 1.883 ± 0.045
4.757GluLeu: 4.757 ± 0.063
1.672GluMet: 1.672 ± 0.033
1.593GluAsn: 1.593 ± 0.039
2.47GluPro: 2.47 ± 0.046
2.013GluGln: 2.013 ± 0.037
4.045GluArg: 4.045 ± 0.056
2.492GluSer: 2.492 ± 0.044
3.829GluThr: 3.829 ± 0.051
3.836GluVal: 3.836 ± 0.052
0.702GluTrp: 0.702 ± 0.023
1.051GluTyr: 1.051 ± 0.027
0.0GluXaa: 0.0 ± 0.0
Phe
4.677PheAla: 4.677 ± 0.056
0.444PheCys: 0.444 ± 0.018
2.999PheAsp: 2.999 ± 0.048
1.958PheGlu: 1.958 ± 0.038
1.42PhePhe: 1.42 ± 0.033
3.765PheGly: 3.765 ± 0.05
0.823PheHis: 0.823 ± 0.022
1.713PheIle: 1.713 ± 0.041
0.909PheLys: 0.909 ± 0.027
3.571PheLeu: 3.571 ± 0.048
0.878PheMet: 0.878 ± 0.023
1.041PheAsn: 1.041 ± 0.025
1.664PhePro: 1.664 ± 0.031
1.062PheGln: 1.062 ± 0.028
2.192PheArg: 2.192 ± 0.039
2.228PheSer: 2.228 ± 0.041
2.207PheThr: 2.207 ± 0.035
2.799PheVal: 2.799 ± 0.043
0.555PheTrp: 0.555 ± 0.023
0.828PheTyr: 0.828 ± 0.023
0.0PheXaa: 0.0 ± 0.0
Gly
10.175GlyAla: 10.175 ± 0.084
0.906GlyCys: 0.906 ± 0.027
5.047GlyAsp: 5.047 ± 0.064
4.412GlyGlu: 4.412 ± 0.057
3.777GlyPhe: 3.777 ± 0.055
7.553GlyGly: 7.553 ± 0.105
1.957GlyHis: 1.957 ± 0.036
4.711GlyIle: 4.711 ± 0.063
2.929GlyLys: 2.929 ± 0.054
9.212GlyLeu: 9.212 ± 0.086
2.639GlyMet: 2.639 ± 0.043
2.198GlyAsn: 2.198 ± 0.052
3.744GlyPro: 3.744 ± 0.055
3.169GlyGln: 3.169 ± 0.043
5.959GlyArg: 5.959 ± 0.073
4.307GlySer: 4.307 ± 0.058
4.913GlyThr: 4.913 ± 0.069
6.56GlyVal: 6.56 ± 0.083
1.503GlyTrp: 1.503 ± 0.037
2.263GlyTyr: 2.263 ± 0.041
0.0GlyXaa: 0.0 ± 0.0
His
2.301HisAla: 2.301 ± 0.039
0.266HisCys: 0.266 ± 0.013
1.327HisAsp: 1.327 ± 0.034
0.956HisGlu: 0.956 ± 0.026
0.791HisPhe: 0.791 ± 0.023
1.937HisGly: 1.937 ± 0.037
0.569HisHis: 0.569 ± 0.02
1.021HisIle: 1.021 ± 0.029
0.477HisLys: 0.477 ± 0.017
2.182HisLeu: 2.182 ± 0.037
0.541HisMet: 0.541 ± 0.019
0.479HisAsn: 0.479 ± 0.015
1.454HisPro: 1.454 ± 0.033
0.534HisGln: 0.534 ± 0.018
1.491HisArg: 1.491 ± 0.031
0.993HisSer: 0.993 ± 0.026
0.882HisThr: 0.882 ± 0.026
1.437HisVal: 1.437 ± 0.031
0.357HisTrp: 0.357 ± 0.014
0.489HisTyr: 0.489 ± 0.019
0.0HisXaa: 0.0 ± 0.0
Ile
7.728IleAla: 7.728 ± 0.084
0.602IleCys: 0.602 ± 0.021
3.741IleAsp: 3.741 ± 0.057
3.482IleGlu: 3.482 ± 0.052
1.848IlePhe: 1.848 ± 0.037
5.212IleGly: 5.212 ± 0.06
0.984IleHis: 0.984 ± 0.027
2.258IleIle: 2.258 ± 0.052
1.342IleLys: 1.342 ± 0.034
5.078IleLeu: 5.078 ± 0.075
1.132IleMet: 1.132 ± 0.029
1.377IleAsn: 1.377 ± 0.033
2.51IlePro: 2.51 ± 0.048
1.171IleGln: 1.171 ± 0.029
3.295IleArg: 3.295 ± 0.049
3.081IleSer: 3.081 ± 0.046
3.134IleThr: 3.134 ± 0.046
4.129IleVal: 4.129 ± 0.062
0.717IleTrp: 0.717 ± 0.023
1.099IleTyr: 1.099 ± 0.027
0.0IleXaa: 0.0 ± 0.0
Lys
3.404LysAla: 3.404 ± 0.058
0.183LysCys: 0.183 ± 0.011
1.567LysAsp: 1.567 ± 0.034
1.276LysGlu: 1.276 ± 0.034
0.843LysPhe: 0.843 ± 0.026
2.503LysGly: 2.503 ± 0.045
0.536LysHis: 0.536 ± 0.021
1.58LysIle: 1.58 ± 0.037
1.054LysLys: 1.054 ± 0.028
2.692LysLeu: 2.692 ± 0.05
0.799LysMet: 0.799 ± 0.026
0.75LysAsn: 0.75 ± 0.026
1.663LysPro: 1.663 ± 0.036
0.861LysGln: 0.861 ± 0.027
2.102LysArg: 2.102 ± 0.042
1.795LysSer: 1.795 ± 0.038
1.927LysThr: 1.927 ± 0.039
2.076LysVal: 2.076 ± 0.04
0.381LysTrp: 0.381 ± 0.015
0.613LysTyr: 0.613 ± 0.019
0.0LysXaa: 0.0 ± 0.0
Leu
12.993LeuAla: 12.993 ± 0.11
1.0LeuCys: 1.0 ± 0.028
5.975LeuAsp: 5.975 ± 0.069
4.862LeuGlu: 4.862 ± 0.059
3.688LeuPhe: 3.688 ± 0.055
8.567LeuGly: 8.567 ± 0.086
1.917LeuHis: 1.917 ± 0.04
5.857LeuIle: 5.857 ± 0.075
3.014LeuLys: 3.014 ± 0.048
9.307LeuLeu: 9.307 ± 0.116
2.653LeuMet: 2.653 ± 0.043
2.903LeuAsn: 2.903 ± 0.045
5.674LeuPro: 5.674 ± 0.076
2.595LeuGln: 2.595 ± 0.047
7.152LeuArg: 7.152 ± 0.085
7.083LeuSer: 7.083 ± 0.075
6.402LeuThr: 6.402 ± 0.066
6.779LeuVal: 6.779 ± 0.091
1.336LeuTrp: 1.336 ± 0.034
1.856LeuTyr: 1.856 ± 0.034
0.0LeuXaa: 0.0 ± 0.0
Met
3.51MetAla: 3.51 ± 0.052
0.186MetCys: 0.186 ± 0.01
1.373MetAsp: 1.373 ± 0.03
1.224MetGlu: 1.224 ± 0.031
0.895MetPhe: 0.895 ± 0.025
2.191MetGly: 2.191 ± 0.043
0.445MetHis: 0.445 ± 0.016
1.731MetIle: 1.731 ± 0.037
1.018MetLys: 1.018 ± 0.027
2.688MetLeu: 2.688 ± 0.045
0.761MetMet: 0.761 ± 0.023
0.871MetAsn: 0.871 ± 0.022
1.548MetPro: 1.548 ± 0.029
0.994MetGln: 0.994 ± 0.027
1.841MetArg: 1.841 ± 0.034
1.693MetSer: 1.693 ± 0.034
2.207MetThr: 2.207 ± 0.038
1.96MetVal: 1.96 ± 0.036
0.242MetTrp: 0.242 ± 0.012
0.372MetTyr: 0.372 ± 0.015
0.0MetXaa: 0.0 ± 0.0
Asn
3.167AsnAla: 3.167 ± 0.048
0.236AsnCys: 0.236 ± 0.014
1.535AsnAsp: 1.535 ± 0.043
1.187AsnGlu: 1.187 ± 0.032
0.965AsnPhe: 0.965 ± 0.025
2.444AsnGly: 2.444 ± 0.044
0.524AsnHis: 0.524 ± 0.018
1.394AsnIle: 1.394 ± 0.034
0.602AsnLys: 0.602 ± 0.022
2.586AsnLeu: 2.586 ± 0.043
0.668AsnMet: 0.668 ± 0.02
0.7AsnAsn: 0.7 ± 0.026
1.9AsnPro: 1.9 ± 0.033
0.656AsnGln: 0.656 ± 0.021
1.926AsnArg: 1.926 ± 0.035
1.232AsnSer: 1.232 ± 0.03
1.365AsnThr: 1.365 ± 0.031
1.827AsnVal: 1.827 ± 0.035
0.404AsnTrp: 0.404 ± 0.015
0.58AsnTyr: 0.58 ± 0.017
0.0AsnXaa: 0.0 ± 0.0
Pro
5.935ProAla: 5.935 ± 0.079
0.368ProCys: 0.368 ± 0.017
4.198ProAsp: 4.198 ± 0.059
3.762ProGlu: 3.762 ± 0.052
1.998ProPhe: 1.998 ± 0.034
4.954ProGly: 4.954 ± 0.071
1.074ProHis: 1.074 ± 0.029
2.328ProIle: 2.328 ± 0.042
1.507ProLys: 1.507 ± 0.032
4.864ProLeu: 4.864 ± 0.059
1.365ProMet: 1.365 ± 0.035
1.292ProAsn: 1.292 ± 0.027
2.486ProPro: 2.486 ± 0.044
1.586ProGln: 1.586 ± 0.037
2.935ProArg: 2.935 ± 0.051
2.569ProSer: 2.569 ± 0.04
2.642ProThr: 2.642 ± 0.043
4.594ProVal: 4.594 ± 0.052
0.696ProTrp: 0.696 ± 0.023
1.064ProTyr: 1.064 ± 0.028
0.0ProXaa: 0.0 ± 0.0
Gln
3.758GlnAla: 3.758 ± 0.055
0.21GlnCys: 0.21 ± 0.012
1.666GlnAsp: 1.666 ± 0.032
1.308GlnGlu: 1.308 ± 0.03
1.051GlnPhe: 1.051 ± 0.029
2.606GlnGly: 2.606 ± 0.047
0.591GlnHis: 0.591 ± 0.021
2.022GlnIle: 2.022 ± 0.039
0.967GlnLys: 0.967 ± 0.028
2.734GlnLeu: 2.734 ± 0.041
1.036GlnMet: 1.036 ± 0.027
0.865GlnAsn: 0.865 ± 0.024
1.57GlnPro: 1.57 ± 0.029
1.082GlnGln: 1.082 ± 0.036
2.299GlnArg: 2.299 ± 0.042
1.872GlnSer: 1.872 ± 0.038
1.922GlnThr: 1.922 ± 0.039
2.352GlnVal: 2.352 ± 0.041
0.395GlnTrp: 0.395 ± 0.017
0.557GlnTyr: 0.557 ± 0.022
0.0GlnXaa: 0.0 ± 0.0
Arg
8.311ArgAla: 8.311 ± 0.078
0.534ArgCys: 0.534 ± 0.021
4.538ArgAsp: 4.538 ± 0.071
3.438ArgGlu: 3.438 ± 0.05
2.781ArgPhe: 2.781 ± 0.052
4.778ArgGly: 4.778 ± 0.061
1.62ArgHis: 1.62 ± 0.031
4.168ArgIle: 4.168 ± 0.056
2.059ArgLys: 2.059 ± 0.048
7.503ArgLeu: 7.503 ± 0.078
2.102ArgMet: 2.102 ± 0.039
1.896ArgAsn: 1.896 ± 0.037
3.443ArgPro: 3.443 ± 0.056
2.369ArgGln: 2.369 ± 0.045
5.309ArgArg: 5.309 ± 0.072
3.421ArgSer: 3.421 ± 0.057
3.279ArgThr: 3.279 ± 0.046
4.733ArgVal: 4.733 ± 0.06
0.997ArgTrp: 0.997 ± 0.029
1.463ArgTyr: 1.463 ± 0.032
0.0ArgXaa: 0.0 ± 0.0
Ser
6.098SerAla: 6.098 ± 0.065
0.426SerCys: 0.426 ± 0.016
3.454SerAsp: 3.454 ± 0.056
2.832SerGlu: 2.832 ± 0.048
2.233SerPhe: 2.233 ± 0.037
5.761SerGly: 5.761 ± 0.078
1.109SerHis: 1.109 ± 0.026
2.664SerIle: 2.664 ± 0.044
1.406SerLys: 1.406 ± 0.035
5.185SerLeu: 5.185 ± 0.054
1.393SerMet: 1.393 ± 0.031
1.368SerAsn: 1.368 ± 0.03
2.677SerPro: 2.677 ± 0.042
1.563SerGln: 1.563 ± 0.031
3.402SerArg: 3.402 ± 0.055
2.613SerSer: 2.613 ± 0.051
2.666SerThr: 2.666 ± 0.041
4.069SerVal: 4.069 ± 0.051
0.682SerTrp: 0.682 ± 0.022
1.182SerTyr: 1.182 ± 0.027
0.0SerXaa: 0.0 ± 0.0
Thr
6.773ThrAla: 6.773 ± 0.072
0.505ThrCys: 0.505 ± 0.018
3.324ThrAsp: 3.324 ± 0.054
3.014ThrGlu: 3.014 ± 0.047
2.113ThrPhe: 2.113 ± 0.039
5.951ThrGly: 5.951 ± 0.068
1.154ThrHis: 1.154 ± 0.029
2.882ThrIle: 2.882 ± 0.055
1.41ThrLys: 1.41 ± 0.032
6.429ThrLeu: 6.429 ± 0.064
1.401ThrMet: 1.401 ± 0.028
1.289ThrAsn: 1.289 ± 0.032
3.68ThrPro: 3.68 ± 0.054
1.533ThrGln: 1.533 ± 0.033
3.631ThrArg: 3.631 ± 0.054
2.799ThrSer: 2.799 ± 0.047
3.081ThrThr: 3.081 ± 0.054
4.554ThrVal: 4.554 ± 0.053
0.713ThrTrp: 0.713 ± 0.022
1.141ThrTyr: 1.141 ± 0.029
0.0ThrXaa: 0.0 ± 0.0
Val
9.122ValAla: 9.122 ± 0.085
0.639ValCys: 0.639 ± 0.022
3.992ValAsp: 3.992 ± 0.057
4.018ValGlu: 4.018 ± 0.049
3.046ValPhe: 3.046 ± 0.043
5.53ValGly: 5.53 ± 0.074
1.367ValHis: 1.367 ± 0.033
4.318ValIle: 4.318 ± 0.057
1.889ValLys: 1.889 ± 0.04
7.677ValLeu: 7.677 ± 0.075
2.173ValMet: 2.173 ± 0.044
2.002ValAsn: 2.002 ± 0.036
3.754ValPro: 3.754 ± 0.047
2.208ValGln: 2.208 ± 0.039
4.515ValArg: 4.515 ± 0.061
4.173ValSer: 4.173 ± 0.056
4.863ValThr: 4.863 ± 0.061
5.633ValVal: 5.633 ± 0.074
1.021ValTrp: 1.021 ± 0.028
1.417ValTyr: 1.417 ± 0.035
0.0ValXaa: 0.0 ± 0.0
Trp
1.397TrpAla: 1.397 ± 0.037
0.134TrpCys: 0.134 ± 0.009
0.751TrpAsp: 0.751 ± 0.025
0.591TrpGlu: 0.591 ± 0.022
0.567TrpPhe: 0.567 ± 0.02
0.978TrpGly: 0.978 ± 0.025
0.322TrpHis: 0.322 ± 0.016
0.806TrpIle: 0.806 ± 0.022
0.409TrpLys: 0.409 ± 0.018
1.685TrpLeu: 1.685 ± 0.035
0.384TrpMet: 0.384 ± 0.017
0.43TrpAsn: 0.43 ± 0.018
0.733TrpPro: 0.733 ± 0.024
0.617TrpGln: 0.617 ± 0.022
1.184TrpArg: 1.184 ± 0.026
0.847TrpSer: 0.847 ± 0.021
0.789TrpThr: 0.789 ± 0.024
0.912TrpVal: 0.912 ± 0.026
0.237TrpTrp: 0.237 ± 0.013
0.261TrpTyr: 0.261 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.263TyrAla: 2.263 ± 0.041
0.248TyrCys: 0.248 ± 0.014
1.531TyrAsp: 1.531 ± 0.035
1.08TyrGlu: 1.08 ± 0.028
0.866TyrPhe: 0.866 ± 0.026
1.98TyrGly: 1.98 ± 0.038
0.493TyrHis: 0.493 ± 0.018
0.917TyrIle: 0.917 ± 0.025
0.463TyrLys: 0.463 ± 0.017
2.147TyrLeu: 2.147 ± 0.039
0.421TyrMet: 0.421 ± 0.016
0.539TyrAsn: 0.539 ± 0.022
1.045TyrPro: 1.045 ± 0.028
0.585TyrGln: 0.585 ± 0.019
1.521TyrArg: 1.521 ± 0.033
1.103TyrSer: 1.103 ± 0.028
1.012TyrThr: 1.012 ± 0.026
1.436TyrVal: 1.436 ± 0.033
0.34TyrTrp: 0.34 ± 0.015
0.502TyrTyr: 0.502 ± 0.02
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4789 proteins (1539245 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski