Amino acid dipepetide frequency for Citrobacter phage vB_CfrM_CfP1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.65AlaAla: 4.65 ± 0.338
0.55AlaCys: 0.55 ± 0.09
4.011AlaAsp: 4.011 ± 0.308
4.863AlaGlu: 4.863 ± 0.302
2.556AlaPhe: 2.556 ± 0.225
4.703AlaGly: 4.703 ± 0.442
1.225AlaHis: 1.225 ± 0.152
4.81AlaIle: 4.81 ± 0.333
5.076AlaLys: 5.076 ± 0.319
6.176AlaLeu: 6.176 ± 0.371
2.662AlaMet: 2.662 ± 0.226
3.816AlaAsn: 3.816 ± 0.258
2.272AlaPro: 2.272 ± 0.215
2.201AlaGln: 2.201 ± 0.196
3.124AlaArg: 3.124 ± 0.282
3.549AlaSer: 3.549 ± 0.255
4.366AlaThr: 4.366 ± 0.523
5.182AlaVal: 5.182 ± 0.289
1.1AlaTrp: 1.1 ± 0.142
3.053AlaTyr: 3.053 ± 0.223
0.0AlaXaa: 0.0 ± 0.0
Cys
1.012CysAla: 1.012 ± 0.131
0.142CysCys: 0.142 ± 0.043
0.745CysAsp: 0.745 ± 0.125
0.763CysGlu: 0.763 ± 0.129
0.674CysPhe: 0.674 ± 0.112
0.994CysGly: 0.994 ± 0.16
0.248CysHis: 0.248 ± 0.061
0.674CysIle: 0.674 ± 0.121
0.905CysLys: 0.905 ± 0.131
0.657CysLeu: 0.657 ± 0.108
0.426CysMet: 0.426 ± 0.085
0.586CysAsn: 0.586 ± 0.09
0.515CysPro: 0.515 ± 0.125
0.266CysGln: 0.266 ± 0.074
0.426CysArg: 0.426 ± 0.098
0.621CysSer: 0.621 ± 0.087
0.497CysThr: 0.497 ± 0.093
0.852CysVal: 0.852 ± 0.136
0.142CysTrp: 0.142 ± 0.048
0.444CysTyr: 0.444 ± 0.084
0.0CysXaa: 0.0 ± 0.0
Asp
4.756AspAla: 4.756 ± 0.279
0.657AspCys: 0.657 ± 0.111
4.472AspAsp: 4.472 ± 0.283
4.597AspGlu: 4.597 ± 0.282
2.893AspPhe: 2.893 ± 0.268
4.685AspGly: 4.685 ± 0.34
1.136AspHis: 1.136 ± 0.154
4.845AspIle: 4.845 ± 0.256
4.206AspLys: 4.206 ± 0.35
5.573AspLeu: 5.573 ± 0.308
1.722AspMet: 1.722 ± 0.182
3.283AspAsn: 3.283 ± 0.225
2.804AspPro: 2.804 ± 0.205
1.686AspGln: 1.686 ± 0.179
2.911AspArg: 2.911 ± 0.224
4.064AspSer: 4.064 ± 0.254
3.709AspThr: 3.709 ± 0.269
4.366AspVal: 4.366 ± 0.334
1.029AspTrp: 1.029 ± 0.149
3.354AspTyr: 3.354 ± 0.288
0.0AspXaa: 0.0 ± 0.0
Glu
5.058GluAla: 5.058 ± 0.346
1.047GluCys: 1.047 ± 0.151
3.479GluAsp: 3.479 ± 0.256
5.04GluGlu: 5.04 ± 0.344
2.928GluPhe: 2.928 ± 0.193
3.23GluGly: 3.23 ± 0.265
1.189GluHis: 1.189 ± 0.144
5.839GluIle: 5.839 ± 0.346
5.715GluLys: 5.715 ± 0.345
6.407GluLeu: 6.407 ± 0.323
2.289GluMet: 2.289 ± 0.22
3.691GluAsn: 3.691 ± 0.253
1.846GluPro: 1.846 ± 0.22
2.662GluGln: 2.662 ± 0.225
3.159GluArg: 3.159 ± 0.242
3.603GluSer: 3.603 ± 0.315
3.762GluThr: 3.762 ± 0.246
4.029GluVal: 4.029 ± 0.283
1.136GluTrp: 1.136 ± 0.138
2.733GluTyr: 2.733 ± 0.236
0.0GluXaa: 0.0 ± 0.0
Phe
2.644PheAla: 2.644 ± 0.211
0.479PheCys: 0.479 ± 0.087
3.674PheAsp: 3.674 ± 0.259
2.928PheGlu: 2.928 ± 0.269
1.384PhePhe: 1.384 ± 0.172
2.627PheGly: 2.627 ± 0.241
0.55PheHis: 0.55 ± 0.106
2.485PheIle: 2.485 ± 0.226
2.893PheLys: 2.893 ± 0.224
2.573PheLeu: 2.573 ± 0.202
1.242PheMet: 1.242 ± 0.127
2.431PheAsn: 2.431 ± 0.2
1.402PhePro: 1.402 ± 0.165
1.438PheGln: 1.438 ± 0.15
2.005PheArg: 2.005 ± 0.204
2.715PheSer: 2.715 ± 0.221
2.538PheThr: 2.538 ± 0.215
3.124PheVal: 3.124 ± 0.228
0.586PheTrp: 0.586 ± 0.094
1.739PheTyr: 1.739 ± 0.164
0.0PheXaa: 0.0 ± 0.0
Gly
4.348GlyAla: 4.348 ± 0.421
0.923GlyCys: 0.923 ± 0.129
4.472GlyAsp: 4.472 ± 0.29
3.833GlyGlu: 3.833 ± 0.257
3.248GlyPhe: 3.248 ± 0.307
4.029GlyGly: 4.029 ± 0.378
1.26GlyHis: 1.26 ± 0.152
4.135GlyIle: 4.135 ± 0.294
4.898GlyLys: 4.898 ± 0.335
4.543GlyLeu: 4.543 ± 0.286
1.686GlyMet: 1.686 ± 0.165
3.301GlyAsn: 3.301 ± 0.366
0.887GlyPro: 0.887 ± 0.103
1.509GlyGln: 1.509 ± 0.204
2.502GlyArg: 2.502 ± 0.209
4.401GlySer: 4.401 ± 0.307
3.709GlyThr: 3.709 ± 0.426
4.65GlyVal: 4.65 ± 0.281
0.852GlyTrp: 0.852 ± 0.13
3.177GlyTyr: 3.177 ± 0.195
0.0GlyXaa: 0.0 ± 0.0
His
1.242HisAla: 1.242 ± 0.153
0.248HisCys: 0.248 ± 0.068
1.029HisAsp: 1.029 ± 0.192
1.225HisGlu: 1.225 ± 0.163
0.87HisPhe: 0.87 ± 0.115
1.012HisGly: 1.012 ± 0.15
0.39HisHis: 0.39 ± 0.096
1.509HisIle: 1.509 ± 0.171
1.047HisLys: 1.047 ± 0.155
1.402HisLeu: 1.402 ± 0.156
0.532HisMet: 0.532 ± 0.097
0.816HisAsn: 0.816 ± 0.119
1.029HisPro: 1.029 ± 0.13
0.603HisGln: 0.603 ± 0.092
0.994HisArg: 0.994 ± 0.129
0.799HisSer: 0.799 ± 0.105
0.923HisThr: 0.923 ± 0.126
1.349HisVal: 1.349 ± 0.14
0.213HisTrp: 0.213 ± 0.073
0.763HisTyr: 0.763 ± 0.116
0.0HisXaa: 0.0 ± 0.0
Ile
5.165IleAla: 5.165 ± 0.347
0.586IleCys: 0.586 ± 0.097
4.952IleAsp: 4.952 ± 0.283
4.863IleGlu: 4.863 ± 0.273
2.165IlePhe: 2.165 ± 0.153
3.514IleGly: 3.514 ± 0.246
1.136IleHis: 1.136 ± 0.15
3.709IleIle: 3.709 ± 0.248
4.934IleLys: 4.934 ± 0.363
4.668IleLeu: 4.668 ± 0.279
1.81IleMet: 1.81 ± 0.195
3.798IleAsn: 3.798 ± 0.274
2.627IlePro: 2.627 ± 0.243
1.792IleGln: 1.792 ± 0.153
3.958IleArg: 3.958 ± 0.257
3.532IleSer: 3.532 ± 0.267
4.721IleThr: 4.721 ± 0.276
4.756IleVal: 4.756 ± 0.302
0.763IleTrp: 0.763 ± 0.126
2.467IleTyr: 2.467 ± 0.229
0.0IleXaa: 0.0 ± 0.0
Lys
5.786LysAla: 5.786 ± 0.317
0.728LysCys: 0.728 ± 0.126
4.827LysAsp: 4.827 ± 0.336
5.2LysGlu: 5.2 ± 0.317
2.928LysPhe: 2.928 ± 0.24
3.816LysGly: 3.816 ± 0.266
1.686LysHis: 1.686 ± 0.217
4.632LysIle: 4.632 ± 0.28
4.685LysLys: 4.685 ± 0.405
5.999LysLeu: 5.999 ± 0.374
2.13LysMet: 2.13 ± 0.214
3.904LysAsn: 3.904 ± 0.263
2.627LysPro: 2.627 ± 0.226
2.946LysGln: 2.946 ± 0.264
3.958LysArg: 3.958 ± 0.335
3.372LysSer: 3.372 ± 0.259
4.472LysThr: 4.472 ± 0.294
4.827LysVal: 4.827 ± 0.33
1.171LysTrp: 1.171 ± 0.148
3.23LysTyr: 3.23 ± 0.29
0.0LysXaa: 0.0 ± 0.0
Leu
5.555LeuAla: 5.555 ± 0.384
1.065LeuCys: 1.065 ± 0.154
5.431LeuAsp: 5.431 ± 0.312
4.81LeuGlu: 4.81 ± 0.335
3.212LeuPhe: 3.212 ± 0.259
4.029LeuGly: 4.029 ± 0.226
1.42LeuHis: 1.42 ± 0.156
4.845LeuIle: 4.845 ± 0.321
5.981LeuLys: 5.981 ± 0.393
4.863LeuLeu: 4.863 ± 0.282
2.52LeuMet: 2.52 ± 0.21
4.401LeuAsn: 4.401 ± 0.248
3.62LeuPro: 3.62 ± 0.292
2.467LeuGln: 2.467 ± 0.242
4.029LeuArg: 4.029 ± 0.235
5.165LeuSer: 5.165 ± 0.274
4.082LeuThr: 4.082 ± 0.308
5.094LeuVal: 5.094 ± 0.281
0.816LeuTrp: 0.816 ± 0.111
3.23LeuTyr: 3.23 ± 0.257
0.0LeuXaa: 0.0 ± 0.0
Met
1.739MetAla: 1.739 ± 0.188
0.319MetCys: 0.319 ± 0.078
1.739MetAsp: 1.739 ± 0.182
1.863MetGlu: 1.863 ± 0.179
1.349MetPhe: 1.349 ± 0.157
1.615MetGly: 1.615 ± 0.158
0.479MetHis: 0.479 ± 0.093
2.023MetIle: 2.023 ± 0.203
2.946MetLys: 2.946 ± 0.266
2.112MetLeu: 2.112 ± 0.214
0.887MetMet: 0.887 ± 0.131
2.059MetAsn: 2.059 ± 0.173
0.763MetPro: 0.763 ± 0.102
1.313MetGln: 1.313 ± 0.148
1.491MetArg: 1.491 ± 0.174
1.917MetSer: 1.917 ± 0.182
1.722MetThr: 1.722 ± 0.177
1.509MetVal: 1.509 ± 0.137
0.284MetTrp: 0.284 ± 0.073
1.118MetTyr: 1.118 ± 0.147
0.0MetXaa: 0.0 ± 0.0
Asn
4.401AsnAla: 4.401 ± 0.319
0.586AsnCys: 0.586 ± 0.09
3.177AsnAsp: 3.177 ± 0.27
4.011AsnGlu: 4.011 ± 0.232
2.005AsnPhe: 2.005 ± 0.188
4.259AsnGly: 4.259 ± 0.324
0.941AsnHis: 0.941 ± 0.142
3.301AsnIle: 3.301 ± 0.207
3.461AsnLys: 3.461 ± 0.27
4.614AsnLeu: 4.614 ± 0.28
1.58AsnMet: 1.58 ± 0.174
2.982AsnAsn: 2.982 ± 0.289
2.715AsnPro: 2.715 ± 0.262
1.722AsnGln: 1.722 ± 0.183
2.538AsnArg: 2.538 ± 0.213
2.911AsnSer: 2.911 ± 0.188
3.319AsnThr: 3.319 ± 0.288
3.851AsnVal: 3.851 ± 0.268
0.479AsnTrp: 0.479 ± 0.088
1.81AsnTyr: 1.81 ± 0.199
0.0AsnXaa: 0.0 ± 0.0
Pro
2.591ProAla: 2.591 ± 0.243
0.461ProCys: 0.461 ± 0.087
3.035ProAsp: 3.035 ± 0.227
2.804ProGlu: 2.804 ± 0.25
1.615ProPhe: 1.615 ± 0.195
1.934ProGly: 1.934 ± 0.222
0.621ProHis: 0.621 ± 0.121
1.934ProIle: 1.934 ± 0.197
2.343ProLys: 2.343 ± 0.213
2.715ProLeu: 2.715 ± 0.226
0.568ProMet: 0.568 ± 0.1
1.722ProAsn: 1.722 ± 0.162
1.012ProPro: 1.012 ± 0.139
0.994ProGln: 0.994 ± 0.11
1.58ProArg: 1.58 ± 0.16
2.112ProSer: 2.112 ± 0.173
2.627ProThr: 2.627 ± 0.201
2.999ProVal: 2.999 ± 0.243
0.515ProTrp: 0.515 ± 0.085
1.97ProTyr: 1.97 ± 0.177
0.0ProXaa: 0.0 ± 0.0
Gln
2.414GlnAla: 2.414 ± 0.239
0.39GlnCys: 0.39 ± 0.076
1.633GlnAsp: 1.633 ± 0.218
1.97GlnGlu: 1.97 ± 0.212
1.278GlnPhe: 1.278 ± 0.151
1.828GlnGly: 1.828 ± 0.173
0.603GlnHis: 0.603 ± 0.118
2.325GlnIle: 2.325 ± 0.218
2.449GlnLys: 2.449 ± 0.252
2.609GlnLeu: 2.609 ± 0.244
1.171GlnMet: 1.171 ± 0.143
1.58GlnAsn: 1.58 ± 0.159
1.1GlnPro: 1.1 ± 0.134
1.402GlnGln: 1.402 ± 0.198
1.704GlnArg: 1.704 ± 0.207
1.863GlnSer: 1.863 ± 0.185
2.218GlnThr: 2.218 ± 0.161
2.147GlnVal: 2.147 ± 0.2
0.373GlnTrp: 0.373 ± 0.093
1.722GlnTyr: 1.722 ± 0.176
0.0GlnXaa: 0.0 ± 0.0
Arg
2.911ArgAla: 2.911 ± 0.24
0.408ArgCys: 0.408 ± 0.083
3.159ArgAsp: 3.159 ± 0.229
3.585ArgGlu: 3.585 ± 0.32
1.846ArgPhe: 1.846 ± 0.19
3.408ArgGly: 3.408 ± 0.217
0.657ArgHis: 0.657 ± 0.109
3.39ArgIle: 3.39 ± 0.242
3.603ArgLys: 3.603 ± 0.301
3.691ArgLeu: 3.691 ± 0.244
1.438ArgMet: 1.438 ± 0.153
2.875ArgAsn: 2.875 ± 0.24
1.455ArgPro: 1.455 ± 0.158
1.491ArgGln: 1.491 ± 0.168
2.36ArgArg: 2.36 ± 0.216
2.822ArgSer: 2.822 ± 0.217
2.644ArgThr: 2.644 ± 0.207
3.408ArgVal: 3.408 ± 0.246
0.87ArgTrp: 0.87 ± 0.136
2.041ArgTyr: 2.041 ± 0.217
0.0ArgXaa: 0.0 ± 0.0
Ser
3.656SerAla: 3.656 ± 0.213
0.621SerCys: 0.621 ± 0.098
3.603SerAsp: 3.603 ± 0.268
3.461SerGlu: 3.461 ± 0.277
2.928SerPhe: 2.928 ± 0.216
4.756SerGly: 4.756 ± 0.325
1.136SerHis: 1.136 ± 0.131
3.727SerIle: 3.727 ± 0.251
4.419SerLys: 4.419 ± 0.283
4.188SerLeu: 4.188 ± 0.275
1.704SerMet: 1.704 ± 0.184
2.982SerAsn: 2.982 ± 0.2
2.165SerPro: 2.165 ± 0.215
1.97SerGln: 1.97 ± 0.206
2.84SerArg: 2.84 ± 0.217
3.266SerSer: 3.266 ± 0.251
2.928SerThr: 2.928 ± 0.248
4.739SerVal: 4.739 ± 0.289
0.657SerTrp: 0.657 ± 0.107
2.005SerTyr: 2.005 ± 0.182
0.0SerXaa: 0.0 ± 0.0
Thr
3.904ThrAla: 3.904 ± 0.336
0.55ThrCys: 0.55 ± 0.099
3.603ThrAsp: 3.603 ± 0.294
3.833ThrGlu: 3.833 ± 0.232
2.485ThrPhe: 2.485 ± 0.242
4.508ThrGly: 4.508 ± 0.357
1.118ThrHis: 1.118 ± 0.145
3.869ThrIle: 3.869 ± 0.254
3.993ThrLys: 3.993 ± 0.259
5.324ThrLeu: 5.324 ± 0.376
1.349ThrMet: 1.349 ± 0.142
2.733ThrAsn: 2.733 ± 0.291
2.875ThrPro: 2.875 ± 0.237
2.005ThrGln: 2.005 ± 0.199
2.627ThrArg: 2.627 ± 0.193
3.301ThrSer: 3.301 ± 0.264
3.585ThrThr: 3.585 ± 0.369
4.792ThrVal: 4.792 ± 0.386
0.763ThrTrp: 0.763 ± 0.13
2.343ThrTyr: 2.343 ± 0.189
0.0ThrXaa: 0.0 ± 0.0
Val
4.49ValAla: 4.49 ± 0.29
1.047ValCys: 1.047 ± 0.163
5.573ValAsp: 5.573 ± 0.318
5.182ValGlu: 5.182 ± 0.324
2.964ValPhe: 2.964 ± 0.195
3.833ValGly: 3.833 ± 0.224
1.047ValHis: 1.047 ± 0.136
4.384ValIle: 4.384 ± 0.328
5.129ValLys: 5.129 ± 0.321
4.685ValLeu: 4.685 ± 0.298
1.792ValMet: 1.792 ± 0.175
4.206ValAsn: 4.206 ± 0.261
2.662ValPro: 2.662 ± 0.213
2.289ValGln: 2.289 ± 0.253
3.159ValArg: 3.159 ± 0.235
4.455ValSer: 4.455 ± 0.26
4.313ValThr: 4.313 ± 0.372
4.987ValVal: 4.987 ± 0.373
0.87ValTrp: 0.87 ± 0.146
3.603ValTyr: 3.603 ± 0.267
0.0ValXaa: 0.0 ± 0.0
Trp
0.781TrpAla: 0.781 ± 0.099
0.142TrpCys: 0.142 ± 0.05
1.047TrpAsp: 1.047 ± 0.124
1.012TrpGlu: 1.012 ± 0.139
0.568TrpPhe: 0.568 ± 0.092
0.852TrpGly: 0.852 ± 0.131
0.337TrpHis: 0.337 ± 0.081
0.639TrpIle: 0.639 ± 0.109
1.331TrpLys: 1.331 ± 0.151
0.976TrpLeu: 0.976 ± 0.146
0.568TrpMet: 0.568 ± 0.097
0.763TrpAsn: 0.763 ± 0.099
0.213TrpPro: 0.213 ± 0.057
0.515TrpGln: 0.515 ± 0.086
0.621TrpArg: 0.621 ± 0.115
0.515TrpSer: 0.515 ± 0.093
0.728TrpThr: 0.728 ± 0.119
0.905TrpVal: 0.905 ± 0.137
0.195TrpTrp: 0.195 ± 0.053
0.71TrpTyr: 0.71 ± 0.106
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.84TyrAla: 2.84 ± 0.273
0.639TyrCys: 0.639 ± 0.099
3.088TyrAsp: 3.088 ± 0.224
3.017TyrGlu: 3.017 ± 0.269
1.562TyrPhe: 1.562 ± 0.151
2.733TyrGly: 2.733 ± 0.224
0.816TyrHis: 0.816 ± 0.13
2.84TyrIle: 2.84 ± 0.213
2.911TyrLys: 2.911 ± 0.254
2.857TyrLeu: 2.857 ± 0.231
1.154TyrMet: 1.154 ± 0.145
2.733TyrAsn: 2.733 ± 0.242
1.562TyrPro: 1.562 ± 0.166
1.509TyrGln: 1.509 ± 0.165
2.076TyrArg: 2.076 ± 0.206
2.769TyrSer: 2.769 ± 0.247
2.698TyrThr: 2.698 ± 0.214
3.106TyrVal: 3.106 ± 0.279
0.603TyrTrp: 0.603 ± 0.103
1.775TyrTyr: 1.775 ± 0.174
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 272 proteins (56347 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski