Amino acid dipepetide frequency for Corynebacterium geronticis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.949AlaAla: 13.949 ± 0.185
1.039AlaCys: 1.039 ± 0.046
6.068AlaAsp: 6.068 ± 0.094
7.799AlaGlu: 7.799 ± 0.116
3.8AlaPhe: 3.8 ± 0.082
9.371AlaGly: 9.371 ± 0.122
2.501AlaHis: 2.501 ± 0.062
5.945AlaIle: 5.945 ± 0.11
4.463AlaLys: 4.463 ± 0.095
12.179AlaLeu: 12.179 ± 0.15
3.15AlaMet: 3.15 ± 0.069
3.093AlaAsn: 3.093 ± 0.066
4.962AlaPro: 4.962 ± 0.099
5.217AlaGln: 5.217 ± 0.099
6.718AlaArg: 6.718 ± 0.127
6.326AlaSer: 6.326 ± 0.107
6.408AlaThr: 6.408 ± 0.098
9.199AlaVal: 9.199 ± 0.126
1.593AlaTrp: 1.593 ± 0.052
2.266AlaTyr: 2.266 ± 0.062
0.0AlaXaa: 0.0 ± 0.0
Cys
0.938CysAla: 0.938 ± 0.037
0.118CysCys: 0.118 ± 0.014
0.485CysAsp: 0.485 ± 0.025
0.506CysGlu: 0.506 ± 0.031
0.301CysPhe: 0.301 ± 0.022
0.868CysGly: 0.868 ± 0.043
0.197CysHis: 0.197 ± 0.019
0.383CysIle: 0.383 ± 0.021
0.206CysLys: 0.206 ± 0.018
0.611CysLeu: 0.611 ± 0.031
0.164CysMet: 0.164 ± 0.016
0.197CysAsn: 0.197 ± 0.017
0.409CysPro: 0.409 ± 0.027
0.26CysGln: 0.26 ± 0.022
0.403CysArg: 0.403 ± 0.027
0.606CysSer: 0.606 ± 0.033
0.498CysThr: 0.498 ± 0.03
0.688CysVal: 0.688 ± 0.034
0.114CysTrp: 0.114 ± 0.012
0.133CysTyr: 0.133 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
7.285AspAla: 7.285 ± 0.107
0.393AspCys: 0.393 ± 0.026
2.964AspAsp: 2.964 ± 0.079
3.952AspGlu: 3.952 ± 0.093
2.006AspPhe: 2.006 ± 0.053
4.424AspGly: 4.424 ± 0.091
1.382AspHis: 1.382 ± 0.047
2.754AspIle: 2.754 ± 0.07
1.538AspLys: 1.538 ± 0.053
5.327AspLeu: 5.327 ± 0.082
1.074AspMet: 1.074 ± 0.04
1.408AspAsn: 1.408 ± 0.053
3.654AspPro: 3.654 ± 0.072
2.003AspGln: 2.003 ± 0.052
3.141AspArg: 3.141 ± 0.058
2.634AspSer: 2.634 ± 0.067
2.827AspThr: 2.827 ± 0.072
4.762AspVal: 4.762 ± 0.088
0.731AspTrp: 0.731 ± 0.036
1.417AspTyr: 1.417 ± 0.053
0.0AspXaa: 0.0 ± 0.0
Glu
7.815GluAla: 7.815 ± 0.124
0.416GluCys: 0.416 ± 0.028
3.737GluAsp: 3.737 ± 0.09
4.48GluGlu: 4.48 ± 0.091
2.032GluPhe: 2.032 ± 0.054
4.789GluGly: 4.789 ± 0.084
2.193GluHis: 2.193 ± 0.054
3.23GluIle: 3.23 ± 0.073
2.406GluLys: 2.406 ± 0.065
7.079GluLeu: 7.079 ± 0.111
1.344GluMet: 1.344 ± 0.046
1.759GluAsn: 1.759 ± 0.044
2.824GluPro: 2.824 ± 0.083
3.699GluGln: 3.699 ± 0.078
4.439GluArg: 4.439 ± 0.098
3.204GluSer: 3.204 ± 0.067
2.796GluThr: 2.796 ± 0.057
5.429GluVal: 5.429 ± 0.101
0.725GluTrp: 0.725 ± 0.032
1.465GluTyr: 1.465 ± 0.054
0.0GluXaa: 0.0 ± 0.0
Phe
4.127PheAla: 4.127 ± 0.079
0.305PheCys: 0.305 ± 0.024
2.466PheAsp: 2.466 ± 0.063
2.174PheGlu: 2.174 ± 0.063
1.337PhePhe: 1.337 ± 0.053
3.679PheGly: 3.679 ± 0.087
0.761PheHis: 0.761 ± 0.038
1.671PheIle: 1.671 ± 0.055
0.891PheLys: 0.891 ± 0.035
3.036PheLeu: 3.036 ± 0.072
0.684PheMet: 0.684 ± 0.031
0.999PheAsn: 0.999 ± 0.042
1.493PhePro: 1.493 ± 0.051
1.125PheGln: 1.125 ± 0.04
1.688PheArg: 1.688 ± 0.055
2.275PheSer: 2.275 ± 0.062
1.723PheThr: 1.723 ± 0.05
2.628PheVal: 2.628 ± 0.066
0.485PheTrp: 0.485 ± 0.029
0.758PheTyr: 0.758 ± 0.034
0.0PheXaa: 0.0 ± 0.0
Gly
8.889GlyAla: 8.889 ± 0.113
0.697GlyCys: 0.697 ± 0.034
4.154GlyAsp: 4.154 ± 0.081
5.27GlyGlu: 5.27 ± 0.096
3.181GlyPhe: 3.181 ± 0.072
6.398GlyGly: 6.398 ± 0.127
1.828GlyHis: 1.828 ± 0.053
4.624GlyIle: 4.624 ± 0.088
3.318GlyLys: 3.318 ± 0.072
7.749GlyLeu: 7.749 ± 0.117
2.311GlyMet: 2.311 ± 0.063
2.332GlyAsn: 2.332 ± 0.064
2.953GlyPro: 2.953 ± 0.064
2.862GlyGln: 2.862 ± 0.065
4.463GlyArg: 4.463 ± 0.091
4.814GlySer: 4.814 ± 0.09
4.817GlyThr: 4.817 ± 0.09
7.333GlyVal: 7.333 ± 0.105
1.297GlyTrp: 1.297 ± 0.044
2.045GlyTyr: 2.045 ± 0.055
0.0GlyXaa: 0.0 ± 0.0
His
2.52HisAla: 2.52 ± 0.065
0.238HisCys: 0.238 ± 0.017
1.305HisAsp: 1.305 ± 0.041
1.427HisGlu: 1.427 ± 0.048
0.849HisPhe: 0.849 ± 0.042
1.876HisGly: 1.876 ± 0.061
0.706HisHis: 0.706 ± 0.032
1.141HisIle: 1.141 ± 0.044
0.606HisLys: 0.606 ± 0.028
2.2HisLeu: 2.2 ± 0.064
0.58HisMet: 0.58 ± 0.03
0.671HisAsn: 0.671 ± 0.033
1.553HisPro: 1.553 ± 0.047
0.881HisGln: 0.881 ± 0.037
1.591HisArg: 1.591 ± 0.056
1.283HisSer: 1.283 ± 0.046
1.363HisThr: 1.363 ± 0.05
1.717HisVal: 1.717 ± 0.056
0.349HisTrp: 0.349 ± 0.026
0.567HisTyr: 0.567 ± 0.031
0.0HisXaa: 0.0 ± 0.0
Ile
7.114IleAla: 7.114 ± 0.117
0.479IleCys: 0.479 ± 0.026
3.632IleAsp: 3.632 ± 0.073
3.498IleGlu: 3.498 ± 0.073
1.734IlePhe: 1.734 ± 0.056
4.563IleGly: 4.563 ± 0.096
1.052IleHis: 1.052 ± 0.045
2.476IleIle: 2.476 ± 0.066
1.446IleLys: 1.446 ± 0.05
3.968IleLeu: 3.968 ± 0.089
0.92IleMet: 0.92 ± 0.038
1.574IleAsn: 1.574 ± 0.05
2.661IlePro: 2.661 ± 0.067
1.463IleGln: 1.463 ± 0.051
2.931IleArg: 2.931 ± 0.062
2.929IleSer: 2.929 ± 0.064
3.1IleThr: 3.1 ± 0.069
4.151IleVal: 4.151 ± 0.083
0.495IleTrp: 0.495 ± 0.03
0.958IleTyr: 0.958 ± 0.038
0.0IleXaa: 0.0 ± 0.0
Lys
4.135LysAla: 4.135 ± 0.086
0.156LysCys: 0.156 ± 0.017
1.895LysAsp: 1.895 ± 0.056
2.237LysGlu: 2.237 ± 0.062
0.934LysPhe: 0.934 ± 0.036
2.411LysGly: 2.411 ± 0.056
0.844LysHis: 0.844 ± 0.033
1.815LysIle: 1.815 ± 0.051
1.723LysLys: 1.723 ± 0.063
3.406LysLeu: 3.406 ± 0.078
0.785LysMet: 0.785 ± 0.031
1.126LysAsn: 1.126 ± 0.041
1.92LysPro: 1.92 ± 0.068
1.576LysGln: 1.576 ± 0.05
2.421LysArg: 2.421 ± 0.063
1.74LysSer: 1.74 ± 0.046
1.812LysThr: 1.812 ± 0.054
2.951LysVal: 2.951 ± 0.076
0.38LysTrp: 0.38 ± 0.026
0.672LysTyr: 0.672 ± 0.034
0.0LysXaa: 0.0 ± 0.0
Leu
11.897LeuAla: 11.897 ± 0.153
0.88LeuCys: 0.88 ± 0.04
5.872LeuAsp: 5.872 ± 0.104
6.005LeuGlu: 6.005 ± 0.118
3.09LeuPhe: 3.09 ± 0.077
8.744LeuGly: 8.744 ± 0.138
2.259LeuHis: 2.259 ± 0.063
5.025LeuIle: 5.025 ± 0.1
3.192LeuLys: 3.192 ± 0.069
10.01LeuLeu: 10.01 ± 0.155
2.162LeuMet: 2.162 ± 0.063
2.81LeuAsn: 2.81 ± 0.067
5.086LeuPro: 5.086 ± 0.087
3.536LeuGln: 3.536 ± 0.075
6.573LeuArg: 6.573 ± 0.105
6.341LeuSer: 6.341 ± 0.093
4.642LeuThr: 4.642 ± 0.093
7.638LeuVal: 7.638 ± 0.117
1.264LeuTrp: 1.264 ± 0.047
1.734LeuTyr: 1.734 ± 0.062
0.0LeuXaa: 0.0 ± 0.0
Met
2.644MetAla: 2.644 ± 0.062
0.184MetCys: 0.184 ± 0.015
1.083MetAsp: 1.083 ± 0.039
1.125MetGlu: 1.125 ± 0.041
0.836MetPhe: 0.836 ± 0.038
1.775MetGly: 1.775 ± 0.051
0.574MetHis: 0.574 ± 0.03
1.235MetIle: 1.235 ± 0.038
0.916MetLys: 0.916 ± 0.041
2.63MetLeu: 2.63 ± 0.059
0.519MetMet: 0.519 ± 0.033
0.817MetAsn: 0.817 ± 0.036
1.23MetPro: 1.23 ± 0.046
0.916MetGln: 0.916 ± 0.031
1.557MetArg: 1.557 ± 0.051
1.746MetSer: 1.746 ± 0.053
1.328MetThr: 1.328 ± 0.047
1.816MetVal: 1.816 ± 0.046
0.307MetTrp: 0.307 ± 0.021
0.374MetTyr: 0.374 ± 0.026
0.0MetXaa: 0.0 ± 0.0
Asn
3.311AsnAla: 3.311 ± 0.066
0.199AsnCys: 0.199 ± 0.016
1.571AsnAsp: 1.571 ± 0.052
1.707AsnGlu: 1.707 ± 0.055
1.001AsnPhe: 1.001 ± 0.032
2.143AsnGly: 2.143 ± 0.062
0.584AsnHis: 0.584 ± 0.029
1.547AsnIle: 1.547 ± 0.047
0.999AsnLys: 0.999 ± 0.035
2.77AsnLeu: 2.77 ± 0.057
0.577AsnMet: 0.577 ± 0.032
0.986AsnAsn: 0.986 ± 0.042
2.143AsnPro: 2.143 ± 0.054
1.004AsnGln: 1.004 ± 0.041
1.648AsnArg: 1.648 ± 0.058
1.483AsnSer: 1.483 ± 0.053
1.724AsnThr: 1.724 ± 0.056
2.338AsnVal: 2.338 ± 0.061
0.383AsnTrp: 0.383 ± 0.024
0.681AsnTyr: 0.681 ± 0.033
0.0AsnXaa: 0.0 ± 0.0
Pro
5.277ProAla: 5.277 ± 0.115
0.267ProCys: 0.267 ± 0.022
2.859ProAsp: 2.859 ± 0.068
4.566ProGlu: 4.566 ± 0.091
1.642ProPhe: 1.642 ± 0.05
4.215ProGly: 4.215 ± 0.089
1.106ProHis: 1.106 ± 0.042
2.197ProIle: 2.197 ± 0.061
2.088ProLys: 2.088 ± 0.06
4.446ProLeu: 4.446 ± 0.082
1.124ProMet: 1.124 ± 0.041
1.636ProAsn: 1.636 ± 0.05
1.756ProPro: 1.756 ± 0.062
1.952ProGln: 1.952 ± 0.056
2.601ProArg: 2.601 ± 0.063
2.916ProSer: 2.916 ± 0.072
2.953ProThr: 2.953 ± 0.071
4.078ProVal: 4.078 ± 0.074
0.763ProTrp: 0.763 ± 0.035
0.986ProTyr: 0.986 ± 0.041
0.0ProXaa: 0.0 ± 0.0
Gln
4.87GlnAla: 4.87 ± 0.104
0.248GlnCys: 0.248 ± 0.02
1.768GlnAsp: 1.768 ± 0.053
2.383GlnGlu: 2.383 ± 0.062
1.115GlnPhe: 1.115 ± 0.034
2.826GlnGly: 2.826 ± 0.068
1.105GlnHis: 1.105 ± 0.045
1.873GlnIle: 1.873 ± 0.054
1.286GlnLys: 1.286 ± 0.05
4.07GlnLeu: 4.07 ± 0.102
0.947GlnMet: 0.947 ± 0.04
0.865GlnAsn: 0.865 ± 0.036
2.016GlnPro: 2.016 ± 0.061
2.322GlnGln: 2.322 ± 0.072
3.482GlnArg: 3.482 ± 0.076
2.009GlnSer: 2.009 ± 0.056
1.727GlnThr: 1.727 ± 0.056
2.897GlnVal: 2.897 ± 0.068
0.71GlnTrp: 0.71 ± 0.036
0.774GlnTyr: 0.774 ± 0.031
0.0GlnXaa: 0.0 ± 0.0
Arg
6.459ArgAla: 6.459 ± 0.107
0.508ArgCys: 0.508 ± 0.025
3.398ArgAsp: 3.398 ± 0.067
4.627ArgGlu: 4.627 ± 0.094
2.234ArgPhe: 2.234 ± 0.056
4.586ArgGly: 4.586 ± 0.091
1.427ArgHis: 1.427 ± 0.045
3.257ArgIle: 3.257 ± 0.078
2.389ArgLys: 2.389 ± 0.066
5.432ArgLeu: 5.432 ± 0.096
1.674ArgMet: 1.674 ± 0.052
1.917ArgAsn: 1.917 ± 0.059
2.792ArgPro: 2.792 ± 0.064
2.288ArgGln: 2.288 ± 0.064
4.614ArgArg: 4.614 ± 0.104
3.556ArgSer: 3.556 ± 0.074
3.26ArgThr: 3.26 ± 0.068
4.412ArgVal: 4.412 ± 0.086
0.985ArgTrp: 0.985 ± 0.032
1.593ArgTyr: 1.593 ± 0.052
0.0ArgXaa: 0.0 ± 0.0
Ser
6.348SerAla: 6.348 ± 0.103
0.415SerCys: 0.415 ± 0.023
3.089SerAsp: 3.089 ± 0.07
3.585SerGlu: 3.585 ± 0.085
2.082SerPhe: 2.082 ± 0.054
5.083SerGly: 5.083 ± 0.093
1.113SerHis: 1.113 ± 0.042
2.905SerIle: 2.905 ± 0.07
2.322SerLys: 2.322 ± 0.064
5.47SerLeu: 5.47 ± 0.109
1.642SerMet: 1.642 ± 0.051
1.851SerAsn: 1.851 ± 0.055
2.655SerPro: 2.655 ± 0.065
2.158SerGln: 2.158 ± 0.057
3.132SerArg: 3.132 ± 0.066
3.612SerSer: 3.612 ± 0.094
3.604SerThr: 3.604 ± 0.071
4.446SerVal: 4.446 ± 0.096
0.824SerTrp: 0.824 ± 0.035
1.23SerTyr: 1.23 ± 0.043
0.0SerXaa: 0.0 ± 0.0
Thr
5.515ThrAla: 5.515 ± 0.092
0.451ThrCys: 0.451 ± 0.026
2.57ThrAsp: 2.57 ± 0.071
2.918ThrGlu: 2.918 ± 0.065
1.948ThrPhe: 1.948 ± 0.059
4.485ThrGly: 4.485 ± 0.085
1.204ThrHis: 1.204 ± 0.043
2.907ThrIle: 2.907 ± 0.066
1.93ThrLys: 1.93 ± 0.059
5.691ThrLeu: 5.691 ± 0.094
1.351ThrMet: 1.351 ± 0.051
1.482ThrAsn: 1.482 ± 0.046
3.508ThrPro: 3.508 ± 0.069
1.92ThrGln: 1.92 ± 0.064
2.845ThrArg: 2.845 ± 0.072
3.328ThrSer: 3.328 ± 0.078
3.251ThrThr: 3.251 ± 0.075
4.633ThrVal: 4.633 ± 0.092
0.827ThrTrp: 0.827 ± 0.032
1.252ThrTyr: 1.252 ± 0.046
0.0ThrXaa: 0.0 ± 0.0
Val
9.428ValAla: 9.428 ± 0.126
0.747ValCys: 0.747 ± 0.033
4.949ValAsp: 4.949 ± 0.09
5.476ValGlu: 5.476 ± 0.124
2.747ValPhe: 2.747 ± 0.067
6.188ValGly: 6.188 ± 0.093
1.818ValHis: 1.818 ± 0.054
4.379ValIle: 4.379 ± 0.084
2.238ValLys: 2.238 ± 0.072
8.847ValLeu: 8.847 ± 0.135
1.847ValMet: 1.847 ± 0.055
2.184ValAsn: 2.184 ± 0.056
4.062ValPro: 4.062 ± 0.077
2.631ValGln: 2.631 ± 0.063
4.706ValArg: 4.706 ± 0.084
4.687ValSer: 4.687 ± 0.084
4.351ValThr: 4.351 ± 0.09
7.367ValVal: 7.367 ± 0.114
0.97ValTrp: 0.97 ± 0.038
1.533ValTyr: 1.533 ± 0.048
0.0ValXaa: 0.0 ± 0.0
Trp
1.366TrpAla: 1.366 ± 0.045
0.142TrpCys: 0.142 ± 0.014
0.622TrpAsp: 0.622 ± 0.032
0.763TrpGlu: 0.763 ± 0.033
0.583TrpPhe: 0.583 ± 0.033
0.957TrpGly: 0.957 ± 0.037
0.354TrpHis: 0.354 ± 0.021
0.75TrpIle: 0.75 ± 0.036
0.462TrpLys: 0.462 ± 0.028
1.644TrpLeu: 1.644 ± 0.054
0.399TrpMet: 0.399 ± 0.023
0.463TrpAsn: 0.463 ± 0.027
0.612TrpPro: 0.612 ± 0.033
0.599TrpGln: 0.599 ± 0.028
0.989TrpArg: 0.989 ± 0.04
0.728TrpSer: 0.728 ± 0.034
0.66TrpThr: 0.66 ± 0.033
1.083TrpVal: 1.083 ± 0.044
0.345TrpTrp: 0.345 ± 0.027
0.32TrpTyr: 0.32 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.158TyrAla: 2.158 ± 0.059
0.202TyrCys: 0.202 ± 0.017
1.191TyrAsp: 1.191 ± 0.042
1.368TyrGlu: 1.368 ± 0.042
0.856TyrPhe: 0.856 ± 0.04
1.955TyrGly: 1.955 ± 0.054
0.415TyrHis: 0.415 ± 0.026
1.031TyrIle: 1.031 ± 0.039
0.517TyrLys: 0.517 ± 0.029
2.194TyrLeu: 2.194 ± 0.063
0.356TyrMet: 0.356 ± 0.021
0.628TyrAsn: 0.628 ± 0.031
1.14TyrPro: 1.14 ± 0.041
0.88TyrGln: 0.88 ± 0.035
1.467TyrArg: 1.467 ± 0.048
1.35TyrSer: 1.35 ± 0.044
1.172TyrThr: 1.172 ± 0.042
1.587TyrVal: 1.587 ± 0.048
0.294TyrTrp: 0.294 ± 0.02
0.542TyrTyr: 0.542 ± 0.028
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2094 proteins (684442 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski