Amino acid dipepetide frequency for Ichthyobacterium seriolicida

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.175AlaAla: 4.175 ± 0.389
0.422AlaCys: 0.422 ± 0.029
3.074AlaAsp: 3.074 ± 0.127
3.252AlaGlu: 3.252 ± 0.163
2.136AlaPhe: 2.136 ± 0.07
3.254AlaGly: 3.254 ± 0.16
0.795AlaHis: 0.795 ± 0.036
4.446AlaIle: 4.446 ± 0.118
4.748AlaLys: 4.748 ± 0.24
4.406AlaLeu: 4.406 ± 0.113
0.93AlaMet: 0.93 ± 0.049
3.493AlaAsn: 3.493 ± 0.102
1.581AlaPro: 1.581 ± 0.062
2.454AlaGln: 2.454 ± 0.134
1.59AlaArg: 1.59 ± 0.056
4.138AlaSer: 4.138 ± 0.125
4.231AlaThr: 4.231 ± 0.228
3.425AlaVal: 3.425 ± 0.111
0.297AlaTrp: 0.297 ± 0.027
1.615AlaTyr: 1.615 ± 0.057
0.002AlaXaa: 0.002 ± 0.002
Cys
0.415CysAla: 0.415 ± 0.028
0.102CysCys: 0.102 ± 0.015
0.608CysAsp: 0.608 ± 0.034
0.611CysGlu: 0.611 ± 0.033
0.438CysPhe: 0.438 ± 0.03
0.484CysGly: 0.484 ± 0.031
0.138CysHis: 0.138 ± 0.019
0.673CysIle: 0.673 ± 0.031
0.54CysLys: 0.54 ± 0.032
0.573CysLeu: 0.573 ± 0.038
0.158CysMet: 0.158 ± 0.019
0.446CysAsn: 0.446 ± 0.03
0.229CysPro: 0.229 ± 0.021
0.187CysGln: 0.187 ± 0.018
0.218CysArg: 0.218 ± 0.024
0.709CysSer: 0.709 ± 0.039
0.411CysThr: 0.411 ± 0.028
0.52CysVal: 0.52 ± 0.035
0.058CysTrp: 0.058 ± 0.012
0.3CysTyr: 0.3 ± 0.026
0.002CysXaa: 0.002 ± 0.002
Asp
3.083AspAla: 3.083 ± 0.145
0.44AspCys: 0.44 ± 0.031
3.038AspAsp: 3.038 ± 0.076
3.216AspGlu: 3.216 ± 0.094
3.14AspPhe: 3.14 ± 0.09
3.962AspGly: 3.962 ± 0.12
0.926AspHis: 0.926 ± 0.041
6.814AspIle: 6.814 ± 0.132
5.737AspLys: 5.737 ± 0.13
5.226AspLeu: 5.226 ± 0.097
1.224AspMet: 1.224 ± 0.055
4.135AspAsn: 4.135 ± 0.123
1.59AspPro: 1.59 ± 0.055
1.444AspGln: 1.444 ± 0.078
2.181AspArg: 2.181 ± 0.074
4.593AspSer: 4.593 ± 0.107
3.163AspThr: 3.163 ± 0.087
3.749AspVal: 3.749 ± 0.094
0.564AspTrp: 0.564 ± 0.034
2.527AspTyr: 2.527 ± 0.072
0.0AspXaa: 0.0 ± 0.0
Glu
3.536GluAla: 3.536 ± 0.196
0.38GluCys: 0.38 ± 0.026
3.74GluAsp: 3.74 ± 0.1
4.078GluGlu: 4.078 ± 0.119
2.532GluPhe: 2.532 ± 0.071
3.878GluGly: 3.878 ± 0.09
0.811GluHis: 0.811 ± 0.044
6.188GluIle: 6.188 ± 0.125
6.596GluLys: 6.596 ± 0.13
5.603GluLeu: 5.603 ± 0.118
1.322GluMet: 1.322 ± 0.05
4.89GluAsn: 4.89 ± 0.093
1.184GluPro: 1.184 ± 0.049
1.703GluGln: 1.703 ± 0.069
2.168GluArg: 2.168 ± 0.084
4.273GluSer: 4.273 ± 0.09
2.641GluThr: 2.641 ± 0.085
4.155GluVal: 4.155 ± 0.086
0.4GluTrp: 0.4 ± 0.031
2.605GluTyr: 2.605 ± 0.084
0.0GluXaa: 0.0 ± 0.0
Phe
1.694PheAla: 1.694 ± 0.055
0.548PheCys: 0.548 ± 0.032
2.619PheAsp: 2.619 ± 0.071
2.734PheGlu: 2.734 ± 0.078
2.141PhePhe: 2.141 ± 0.065
2.507PheGly: 2.507 ± 0.078
0.591PheHis: 0.591 ± 0.034
3.98PheIle: 3.98 ± 0.098
4.484PheLys: 4.484 ± 0.099
4.042PheLeu: 4.042 ± 0.112
0.96PheMet: 0.96 ± 0.045
2.972PheAsn: 2.972 ± 0.093
1.492PhePro: 1.492 ± 0.047
0.94PheGln: 0.94 ± 0.045
1.528PheArg: 1.528 ± 0.062
4.96PheSer: 4.96 ± 0.128
3.071PheThr: 3.071 ± 0.115
2.434PheVal: 2.434 ± 0.066
0.335PheTrp: 0.335 ± 0.024
1.955PheTyr: 1.955 ± 0.064
0.004PheXaa: 0.004 ± 0.002
Gly
3.967GlyAla: 3.967 ± 0.152
0.526GlyCys: 0.526 ± 0.035
4.253GlyAsp: 4.253 ± 0.119
4.202GlyGlu: 4.202 ± 0.169
2.739GlyPhe: 2.739 ± 0.085
3.829GlyGly: 3.829 ± 0.103
1.004GlyHis: 1.004 ± 0.042
5.594GlyIle: 5.594 ± 0.108
5.714GlyLys: 5.714 ± 0.149
5.011GlyLeu: 5.011 ± 0.101
1.468GlyMet: 1.468 ± 0.06
3.785GlyAsn: 3.785 ± 0.129
2.19GlyPro: 2.19 ± 0.303
1.559GlyGln: 1.559 ± 0.062
2.108GlyArg: 2.108 ± 0.078
4.706GlySer: 4.706 ± 0.122
4.087GlyThr: 4.087 ± 0.134
4.209GlyVal: 4.209 ± 0.107
0.464GlyTrp: 0.464 ± 0.032
2.483GlyTyr: 2.483 ± 0.077
0.0GlyXaa: 0.0 ± 0.0
His
0.64HisAla: 0.64 ± 0.036
0.151HisCys: 0.151 ± 0.016
0.862HisAsp: 0.862 ± 0.041
0.733HisGlu: 0.733 ± 0.036
0.917HisPhe: 0.917 ± 0.039
0.775HisGly: 0.775 ± 0.036
0.236HisHis: 0.236 ± 0.02
1.446HisIle: 1.446 ± 0.058
1.204HisLys: 1.204 ± 0.046
1.373HisLeu: 1.373 ± 0.054
0.32HisMet: 0.32 ± 0.024
0.982HisAsn: 0.982 ± 0.041
0.678HisPro: 0.678 ± 0.038
0.457HisGln: 0.457 ± 0.031
0.691HisArg: 0.691 ± 0.038
1.335HisSer: 1.335 ± 0.045
0.899HisThr: 0.899 ± 0.044
0.697HisVal: 0.697 ± 0.038
0.115HisTrp: 0.115 ± 0.016
0.726HisTyr: 0.726 ± 0.032
0.002HisXaa: 0.002 ± 0.002
Ile
4.491IleAla: 4.491 ± 0.102
0.722IleCys: 0.722 ± 0.037
6.126IleAsp: 6.126 ± 0.141
6.214IleGlu: 6.214 ± 0.131
3.807IlePhe: 3.807 ± 0.089
5.75IleGly: 5.75 ± 0.194
1.286IleHis: 1.286 ± 0.041
7.069IleIle: 7.069 ± 0.123
7.616IleLys: 7.616 ± 0.135
6.869IleLeu: 6.869 ± 0.169
1.433IleMet: 1.433 ± 0.056
5.084IleAsn: 5.084 ± 0.101
3.225IlePro: 3.225 ± 0.082
1.883IleGln: 1.883 ± 0.064
2.932IleArg: 2.932 ± 0.085
9.392IleSer: 9.392 ± 0.153
6.672IleThr: 6.672 ± 0.179
4.888IleVal: 4.888 ± 0.1
0.524IleTrp: 0.524 ± 0.032
2.976IleTyr: 2.976 ± 0.078
0.0IleXaa: 0.0 ± 0.0
Lys
4.884LysAla: 4.884 ± 0.202
0.487LysCys: 0.487 ± 0.029
5.284LysAsp: 5.284 ± 0.11
6.616LysGlu: 6.616 ± 0.14
3.332LysPhe: 3.332 ± 0.099
5.812LysGly: 5.812 ± 0.153
1.319LysHis: 1.319 ± 0.059
8.162LysIle: 8.162 ± 0.13
8.662LysLys: 8.662 ± 0.157
6.85LysLeu: 6.85 ± 0.146
1.832LysMet: 1.832 ± 0.058
6.699LysAsn: 6.699 ± 0.129
2.148LysPro: 2.148 ± 0.061
2.199LysGln: 2.199 ± 0.072
3.44LysArg: 3.44 ± 0.088
6.312LysSer: 6.312 ± 0.113
5.297LysThr: 5.297 ± 0.113
6.13LysVal: 6.13 ± 0.118
0.846LysTrp: 0.846 ± 0.042
3.562LysTyr: 3.562 ± 0.105
0.002LysXaa: 0.002 ± 0.002
Leu
3.678LeuAla: 3.678 ± 0.091
0.695LeuCys: 0.695 ± 0.039
5.175LeuAsp: 5.175 ± 0.116
5.53LeuGlu: 5.53 ± 0.126
3.825LeuPhe: 3.825 ± 0.104
4.975LeuGly: 4.975 ± 0.107
1.281LeuHis: 1.281 ± 0.052
5.821LeuIle: 5.821 ± 0.149
8.398LeuLys: 8.398 ± 0.121
6.823LeuLeu: 6.823 ± 0.147
1.504LeuMet: 1.504 ± 0.06
5.346LeuAsn: 5.346 ± 0.143
2.712LeuPro: 2.712 ± 0.072
1.804LeuGln: 1.804 ± 0.064
2.845LeuArg: 2.845 ± 0.078
8.446LeuSer: 8.446 ± 0.18
5.259LeuThr: 5.259 ± 0.153
4.073LeuVal: 4.073 ± 0.103
0.422LeuTrp: 0.422 ± 0.03
2.778LeuTyr: 2.778 ± 0.069
0.002LeuXaa: 0.002 ± 0.002
Met
1.022MetAla: 1.022 ± 0.055
0.231MetCys: 0.231 ± 0.018
0.93MetAsp: 0.93 ± 0.047
1.137MetGlu: 1.137 ± 0.052
1.33MetPhe: 1.33 ± 0.074
1.373MetGly: 1.373 ± 0.056
0.286MetHis: 0.286 ± 0.022
1.675MetIle: 1.675 ± 0.068
1.786MetLys: 1.786 ± 0.062
1.623MetLeu: 1.623 ± 0.058
0.44MetMet: 0.44 ± 0.027
1.101MetAsn: 1.101 ± 0.046
0.7MetPro: 0.7 ± 0.045
0.411MetGln: 0.411 ± 0.029
0.884MetArg: 0.884 ± 0.039
1.73MetSer: 1.73 ± 0.069
1.03MetThr: 1.03 ± 0.045
1.013MetVal: 1.013 ± 0.048
0.126MetTrp: 0.126 ± 0.016
0.684MetTyr: 0.684 ± 0.04
0.004MetXaa: 0.004 ± 0.003
Asn
3.256AsnAla: 3.256 ± 0.115
0.417AsnCys: 0.417 ± 0.029
3.191AsnAsp: 3.191 ± 0.085
3.196AsnGlu: 3.196 ± 0.069
2.881AsnPhe: 2.881 ± 0.075
3.765AsnGly: 3.765 ± 0.092
0.97AsnHis: 0.97 ± 0.049
6.661AsnIle: 6.661 ± 0.143
5.199AsnLys: 5.199 ± 0.107
5.133AsnLeu: 5.133 ± 0.102
1.599AsnMet: 1.599 ± 0.061
4.367AsnAsn: 4.367 ± 0.131
2.441AsnPro: 2.441 ± 0.07
1.601AsnGln: 1.601 ± 0.071
2.241AsnArg: 2.241 ± 0.067
5.515AsnSer: 5.515 ± 0.138
5.124AsnThr: 5.124 ± 0.167
3.789AsnVal: 3.789 ± 0.094
0.595AsnTrp: 0.595 ± 0.035
2.43AsnTyr: 2.43 ± 0.083
0.0AsnXaa: 0.0 ± 0.0
Pro
2.063ProAla: 2.063 ± 0.118
0.251ProCys: 0.251 ± 0.023
1.826ProAsp: 1.826 ± 0.053
1.841ProGlu: 1.841 ± 0.061
1.484ProPhe: 1.484 ± 0.055
1.752ProGly: 1.752 ± 0.121
0.675ProHis: 0.675 ± 0.034
2.539ProIle: 2.539 ± 0.07
2.385ProLys: 2.385 ± 0.065
2.19ProLeu: 2.19 ± 0.071
0.58ProMet: 0.58 ± 0.038
2.141ProAsn: 2.141 ± 0.059
0.784ProPro: 0.784 ± 0.05
1.086ProGln: 1.086 ± 0.07
0.806ProArg: 0.806 ± 0.04
2.827ProSer: 2.827 ± 0.096
2.434ProThr: 2.434 ± 0.106
2.025ProVal: 2.025 ± 0.068
0.222ProTrp: 0.222 ± 0.019
1.27ProTyr: 1.27 ± 0.048
0.0ProXaa: 0.0 ± 0.0
Gln
1.604GlnAla: 1.604 ± 0.13
0.146GlnCys: 0.146 ± 0.017
1.601GlnAsp: 1.601 ± 0.069
1.65GlnGlu: 1.65 ± 0.061
0.893GlnPhe: 0.893 ± 0.044
2.428GlnGly: 2.428 ± 0.216
0.426GlnHis: 0.426 ± 0.023
2.343GlnIle: 2.343 ± 0.073
2.292GlnLys: 2.292 ± 0.093
2.105GlnLeu: 2.105 ± 0.073
0.613GlnMet: 0.613 ± 0.037
1.694GlnAsn: 1.694 ± 0.058
0.548GlnPro: 0.548 ± 0.034
0.842GlnGln: 0.842 ± 0.047
0.995GlnArg: 0.995 ± 0.045
1.555GlnSer: 1.555 ± 0.063
1.472GlnThr: 1.472 ± 0.084
1.288GlnVal: 1.288 ± 0.049
0.182GlnTrp: 0.182 ± 0.019
0.826GlnTyr: 0.826 ± 0.039
0.0GlnXaa: 0.0 ± 0.0
Arg
1.768ArgAla: 1.768 ± 0.062
0.206ArgCys: 0.206 ± 0.019
2.507ArgAsp: 2.507 ± 0.077
2.659ArgGlu: 2.659 ± 0.078
1.464ArgPhe: 1.464 ± 0.05
2.523ArgGly: 2.523 ± 0.114
0.564ArgHis: 0.564 ± 0.034
3.24ArgIle: 3.24 ± 0.08
2.785ArgLys: 2.785 ± 0.072
2.632ArgLeu: 2.632 ± 0.089
0.74ArgMet: 0.74 ± 0.036
1.948ArgAsn: 1.948 ± 0.063
0.817ArgPro: 0.817 ± 0.043
0.717ArgGln: 0.717 ± 0.043
1.11ArgArg: 1.11 ± 0.058
2.196ArgSer: 2.196 ± 0.066
1.352ArgThr: 1.352 ± 0.054
2.399ArgVal: 2.399 ± 0.066
0.247ArgTrp: 0.247 ± 0.024
1.33ArgTyr: 1.33 ± 0.051
0.0ArgXaa: 0.0 ± 0.0
Ser
4.768SerAla: 4.768 ± 0.163
0.895SerCys: 0.895 ± 0.041
4.979SerAsp: 4.979 ± 0.113
4.942SerGlu: 4.942 ± 0.091
4.944SerPhe: 4.944 ± 0.118
5.472SerGly: 5.472 ± 0.141
1.441SerHis: 1.441 ± 0.055
7.649SerIle: 7.649 ± 0.137
7.172SerLys: 7.172 ± 0.129
6.865SerLeu: 6.865 ± 0.131
1.779SerMet: 1.779 ± 0.065
5.537SerAsn: 5.537 ± 0.146
2.628SerPro: 2.628 ± 0.087
2.023SerGln: 2.023 ± 0.062
2.403SerArg: 2.403 ± 0.075
7.42SerSer: 7.42 ± 0.186
5.001SerThr: 5.001 ± 0.133
5.217SerVal: 5.217 ± 0.109
0.651SerTrp: 0.651 ± 0.044
2.783SerTyr: 2.783 ± 0.087
0.002SerXaa: 0.002 ± 0.002
Thr
4.542ThrAla: 4.542 ± 0.306
0.349ThrCys: 0.349 ± 0.026
4.218ThrAsp: 4.218 ± 0.117
3.707ThrGlu: 3.707 ± 0.095
2.321ThrPhe: 2.321 ± 0.062
5.246ThrGly: 5.246 ± 0.184
0.953ThrHis: 0.953 ± 0.039
5.917ThrIle: 5.917 ± 0.196
5.201ThrLys: 5.201 ± 0.134
4.709ThrLeu: 4.709 ± 0.1
0.744ThrMet: 0.744 ± 0.037
3.362ThrAsn: 3.362 ± 0.104
2.792ThrPro: 2.792 ± 0.116
1.584ThrGln: 1.584 ± 0.063
1.615ThrArg: 1.615 ± 0.052
5.266ThrSer: 5.266 ± 0.155
4.127ThrThr: 4.127 ± 0.159
5.228ThrVal: 5.228 ± 0.24
0.22ThrTrp: 0.22 ± 0.02
2.327ThrTyr: 2.327 ± 0.082
0.0ThrXaa: 0.0 ± 0.0
Val
2.93ValAla: 2.93 ± 0.083
0.537ValCys: 0.537 ± 0.031
3.887ValAsp: 3.887 ± 0.104
3.727ValGlu: 3.727 ± 0.085
3.3ValPhe: 3.3 ± 0.088
3.5ValGly: 3.5 ± 0.104
0.837ValHis: 0.837 ± 0.035
5.304ValIle: 5.304 ± 0.105
5.359ValLys: 5.359 ± 0.091
5.737ValLeu: 5.737 ± 0.153
0.913ValMet: 0.913 ± 0.048
3.292ValAsn: 3.292 ± 0.082
2.212ValPro: 2.212 ± 0.079
1.246ValGln: 1.246 ± 0.055
1.75ValArg: 1.75 ± 0.057
5.204ValSer: 5.204 ± 0.122
4.709ValThr: 4.709 ± 0.199
3.933ValVal: 3.933 ± 0.104
0.327ValTrp: 0.327 ± 0.026
2.745ValTyr: 2.745 ± 0.074
0.002ValXaa: 0.002 ± 0.002
Trp
0.318TrpAla: 0.318 ± 0.028
0.036TrpCys: 0.036 ± 0.008
0.5TrpAsp: 0.5 ± 0.036
0.46TrpGlu: 0.46 ± 0.033
0.307TrpPhe: 0.307 ± 0.028
0.535TrpGly: 0.535 ± 0.036
0.184TrpHis: 0.184 ± 0.019
0.52TrpIle: 0.52 ± 0.03
0.555TrpLys: 0.555 ± 0.036
0.631TrpLeu: 0.631 ± 0.04
0.207TrpMet: 0.207 ± 0.022
0.591TrpAsn: 0.591 ± 0.054
0.109TrpPro: 0.109 ± 0.015
0.184TrpGln: 0.184 ± 0.017
0.226TrpArg: 0.226 ± 0.022
0.575TrpSer: 0.575 ± 0.037
0.335TrpThr: 0.335 ± 0.025
0.377TrpVal: 0.377 ± 0.027
0.062TrpTrp: 0.062 ± 0.012
0.249TrpTyr: 0.249 ± 0.024
0.002TrpXaa: 0.002 ± 0.002
Tyr
1.624TyrAla: 1.624 ± 0.05
0.282TyrCys: 0.282 ± 0.026
2.332TyrAsp: 2.332 ± 0.078
2.177TyrGlu: 2.177 ± 0.065
2.057TyrPhe: 2.057 ± 0.064
2.077TyrGly: 2.077 ± 0.073
0.518TyrHis: 0.518 ± 0.035
2.918TyrIle: 2.918 ± 0.081
3.442TyrLys: 3.442 ± 0.07
2.99TyrLeu: 2.99 ± 0.092
0.744TyrMet: 0.744 ± 0.037
2.503TyrAsn: 2.503 ± 0.073
1.119TyrPro: 1.119 ± 0.049
1.175TyrGln: 1.175 ± 0.044
1.472TyrArg: 1.472 ± 0.057
3.412TyrSer: 3.412 ± 0.083
3.114TyrThr: 3.114 ± 0.146
1.903TyrVal: 1.903 ± 0.068
0.307TyrTrp: 0.307 ± 0.026
1.548TyrTyr: 1.548 ± 0.07
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.002XaaAla: 0.002 ± 0.002
0.0XaaCys: 0.0 ± 0.0
0.002XaaAsp: 0.002 ± 0.002
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.002XaaGly: 0.002 ± 0.002
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.002XaaLys: 0.002 ± 0.002
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.004XaaAsn: 0.004 ± 0.002
0.0XaaPro: 0.0 ± 0.0
0.002XaaGln: 0.002 ± 0.002
0.0XaaArg: 0.0 ± 0.0
0.005XaaSer: 0.005 ± 0.004
0.002XaaThr: 0.002 ± 0.002
0.002XaaVal: 0.002 ± 0.002
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.002XaaXaa: 0.002 ± 0.002
Statistics based on 1467 proteins (549746 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski