Amino acid dipepetide frequency for Campylobacter virus NCTC12673

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.985AlaAla: 0.985 ± 0.166
0.369AlaCys: 0.369 ± 0.082
1.995AlaAsp: 1.995 ± 0.269
1.946AlaGlu: 1.946 ± 0.271
1.699AlaPhe: 1.699 ± 0.245
1.601AlaGly: 1.601 ± 0.245
0.345AlaHis: 0.345 ± 0.081
3.227AlaIle: 3.227 ± 0.278
2.931AlaLys: 2.931 ± 0.318
3.325AlaLeu: 3.325 ± 0.337
0.813AlaMet: 0.813 ± 0.154
2.562AlaAsn: 2.562 ± 0.295
0.739AlaPro: 0.739 ± 0.123
0.911AlaGln: 0.911 ± 0.149
1.281AlaArg: 1.281 ± 0.158
2.266AlaSer: 2.266 ± 0.21
1.773AlaThr: 1.773 ± 0.282
2.094AlaVal: 2.094 ± 0.275
0.197AlaTrp: 0.197 ± 0.072
1.502AlaTyr: 1.502 ± 0.159
0.0AlaXaa: 0.0 ± 0.0
Cys
0.517CysAla: 0.517 ± 0.11
0.246CysCys: 0.246 ± 0.085
1.33CysAsp: 1.33 ± 0.219
1.182CysGlu: 1.182 ± 0.244
0.493CysPhe: 0.493 ± 0.108
1.207CysGly: 1.207 ± 0.144
0.148CysHis: 0.148 ± 0.06
1.527CysIle: 1.527 ± 0.211
2.118CysLys: 2.118 ± 0.301
1.33CysLeu: 1.33 ± 0.188
0.345CysMet: 0.345 ± 0.115
2.044CysAsn: 2.044 ± 0.251
1.502CysPro: 1.502 ± 0.335
0.369CysGln: 0.369 ± 0.121
0.419CysArg: 0.419 ± 0.088
1.084CysSer: 1.084 ± 0.173
0.739CysThr: 0.739 ± 0.152
0.985CysVal: 0.985 ± 0.157
0.025CysTrp: 0.025 ± 0.022
1.158CysTyr: 1.158 ± 0.198
0.0CysXaa: 0.0 ± 0.0
Asp
1.995AspAla: 1.995 ± 0.242
1.01AspCys: 1.01 ± 0.164
5.049AspAsp: 5.049 ± 0.54
4.951AspGlu: 4.951 ± 0.381
3.867AspPhe: 3.867 ± 0.302
3.005AspGly: 3.005 ± 0.256
0.493AspHis: 0.493 ± 0.121
7.118AspIle: 7.118 ± 0.506
6.157AspLys: 6.157 ± 0.36
5.172AspLeu: 5.172 ± 0.393
1.305AspMet: 1.305 ± 0.169
5.542AspAsn: 5.542 ± 0.432
1.527AspPro: 1.527 ± 0.193
1.059AspGln: 1.059 ± 0.177
1.773AspArg: 1.773 ± 0.201
3.448AspSer: 3.448 ± 0.293
2.34AspThr: 2.34 ± 0.268
2.709AspVal: 2.709 ± 0.281
0.714AspTrp: 0.714 ± 0.118
3.571AspTyr: 3.571 ± 0.272
0.0AspXaa: 0.0 ± 0.0
Glu
2.266GluAla: 2.266 ± 0.227
1.601GluCys: 1.601 ± 0.282
3.054GluAsp: 3.054 ± 0.337
3.497GluGlu: 3.497 ± 0.354
3.842GluPhe: 3.842 ± 0.332
2.094GluGly: 2.094 ± 0.246
0.985GluHis: 0.985 ± 0.138
6.305GluIle: 6.305 ± 0.415
7.414GluLys: 7.414 ± 0.496
7.685GluLeu: 7.685 ± 0.447
1.207GluMet: 1.207 ± 0.156
6.355GluAsn: 6.355 ± 0.422
1.207GluPro: 1.207 ± 0.181
1.379GluGln: 1.379 ± 0.192
1.773GluArg: 1.773 ± 0.237
4.754GluSer: 4.754 ± 0.312
3.054GluThr: 3.054 ± 0.32
4.557GluVal: 4.557 ± 0.323
0.616GluTrp: 0.616 ± 0.117
4.458GluTyr: 4.458 ± 0.338
0.0GluXaa: 0.0 ± 0.0
Phe
1.724PheAla: 1.724 ± 0.224
0.862PheCys: 0.862 ± 0.148
3.965PheAsp: 3.965 ± 0.285
4.039PheGlu: 4.039 ± 0.29
1.724PhePhe: 1.724 ± 0.214
2.488PheGly: 2.488 ± 0.192
0.69PheHis: 0.69 ± 0.134
4.286PheIle: 4.286 ± 0.323
6.576PheLys: 6.576 ± 0.483
3.497PheLeu: 3.497 ± 0.317
1.256PheMet: 1.256 ± 0.162
3.965PheAsn: 3.965 ± 0.317
0.936PhePro: 0.936 ± 0.15
0.961PheGln: 0.961 ± 0.176
1.453PheArg: 1.453 ± 0.197
3.374PheSer: 3.374 ± 0.27
3.424PheThr: 3.424 ± 0.301
2.34PheVal: 2.34 ± 0.235
0.222PheTrp: 0.222 ± 0.07
2.463PheTyr: 2.463 ± 0.258
0.0PheXaa: 0.0 ± 0.0
Gly
2.241GlyAla: 2.241 ± 0.263
0.665GlyCys: 0.665 ± 0.121
3.374GlyAsp: 3.374 ± 0.293
2.192GlyGlu: 2.192 ± 0.227
3.029GlyPhe: 3.029 ± 0.333
2.438GlyGly: 2.438 ± 0.269
1.699GlyHis: 1.699 ± 0.379
4.261GlyIle: 4.261 ± 0.358
4.877GlyLys: 4.877 ± 0.316
3.596GlyLeu: 3.596 ± 0.275
0.985GlyMet: 0.985 ± 0.157
4.015GlyAsn: 4.015 ± 0.261
0.419GlyPro: 0.419 ± 0.092
1.355GlyGln: 1.355 ± 0.162
1.379GlyArg: 1.379 ± 0.218
4.409GlySer: 4.409 ± 0.361
2.759GlyThr: 2.759 ± 0.291
2.438GlyVal: 2.438 ± 0.244
0.222GlyTrp: 0.222 ± 0.073
3.473GlyTyr: 3.473 ± 0.368
0.0GlyXaa: 0.0 ± 0.0
His
0.419HisAla: 0.419 ± 0.125
0.222HisCys: 0.222 ± 0.065
0.739HisAsp: 0.739 ± 0.154
0.64HisGlu: 0.64 ± 0.144
0.739HisPhe: 0.739 ± 0.169
0.566HisGly: 0.566 ± 0.105
0.246HisHis: 0.246 ± 0.079
2.094HisIle: 2.094 ± 0.31
1.699HisLys: 1.699 ± 0.241
1.626HisLeu: 1.626 ± 0.23
0.197HisMet: 0.197 ± 0.072
1.158HisAsn: 1.158 ± 0.22
0.369HisPro: 0.369 ± 0.103
0.222HisGln: 0.222 ± 0.066
0.345HisArg: 0.345 ± 0.087
1.158HisSer: 1.158 ± 0.185
1.108HisThr: 1.108 ± 0.193
1.01HisVal: 1.01 ± 0.204
0.099HisTrp: 0.099 ± 0.042
1.01HisTyr: 1.01 ± 0.151
0.0HisXaa: 0.0 ± 0.0
Ile
2.635IleAla: 2.635 ± 0.291
2.044IleCys: 2.044 ± 0.273
6.428IleAsp: 6.428 ± 0.523
6.478IleGlu: 6.478 ± 0.394
4.261IlePhe: 4.261 ± 0.331
3.596IleGly: 3.596 ± 0.27
1.084IleHis: 1.084 ± 0.154
8.35IleIle: 8.35 ± 0.491
10.295IleLys: 10.295 ± 0.52
8.103IleLeu: 8.103 ± 0.574
2.118IleMet: 2.118 ± 0.2
8.645IleAsn: 8.645 ± 0.449
3.005IlePro: 3.005 ± 0.273
2.98IleGln: 2.98 ± 0.302
2.291IleArg: 2.291 ± 0.222
7.02IleSer: 7.02 ± 0.406
5.492IleThr: 5.492 ± 0.417
4.384IleVal: 4.384 ± 0.396
0.887IleTrp: 0.887 ± 0.13
3.497IleTyr: 3.497 ± 0.286
0.0IleXaa: 0.0 ± 0.0
Lys
2.906LysAla: 2.906 ± 0.364
2.635LysCys: 2.635 ± 0.518
7.044LysAsp: 7.044 ± 0.399
8.128LysGlu: 8.128 ± 0.559
4.335LysPhe: 4.335 ± 0.369
4.901LysGly: 4.901 ± 0.437
2.167LysHis: 2.167 ± 0.296
9.088LysIle: 9.088 ± 0.512
7.537LysLys: 7.537 ± 0.463
9.433LysLeu: 9.433 ± 0.486
2.192LysMet: 2.192 ± 0.235
9.901LysAsn: 9.901 ± 0.512
2.414LysPro: 2.414 ± 0.293
3.448LysGln: 3.448 ± 0.3
2.808LysArg: 2.808 ± 0.265
5.837LysSer: 5.837 ± 0.367
5.295LysThr: 5.295 ± 0.422
4.63LysVal: 4.63 ± 0.373
0.911LysTrp: 0.911 ± 0.152
6.256LysTyr: 6.256 ± 0.463
0.0LysXaa: 0.0 ± 0.0
Leu
3.054LeuAla: 3.054 ± 0.295
1.921LeuCys: 1.921 ± 0.252
5.714LeuAsp: 5.714 ± 0.384
6.995LeuGlu: 6.995 ± 0.468
3.079LeuPhe: 3.079 ± 0.266
5.369LeuGly: 5.369 ± 0.483
1.453LeuHis: 1.453 ± 0.181
6.428LeuIle: 6.428 ± 0.395
10.443LeuLys: 10.443 ± 0.634
7.512LeuLeu: 7.512 ± 0.528
2.118LeuMet: 2.118 ± 0.242
7.241LeuAsn: 7.241 ± 0.462
2.931LeuPro: 2.931 ± 0.274
2.931LeuGln: 2.931 ± 0.298
2.291LeuArg: 2.291 ± 0.209
5.936LeuSer: 5.936 ± 0.386
3.965LeuThr: 3.965 ± 0.353
3.473LeuVal: 3.473 ± 0.295
0.468LeuTrp: 0.468 ± 0.111
4.089LeuTyr: 4.089 ± 0.262
0.0LeuXaa: 0.0 ± 0.0
Met
1.256MetAla: 1.256 ± 0.181
0.566MetCys: 0.566 ± 0.113
1.182MetAsp: 1.182 ± 0.174
1.231MetGlu: 1.231 ± 0.188
1.429MetPhe: 1.429 ± 0.2
0.961MetGly: 0.961 ± 0.123
0.222MetHis: 0.222 ± 0.078
1.453MetIle: 1.453 ± 0.175
2.562MetLys: 2.562 ± 0.245
2.291MetLeu: 2.291 ± 0.282
0.345MetMet: 0.345 ± 0.089
1.601MetAsn: 1.601 ± 0.191
0.493MetPro: 0.493 ± 0.121
0.468MetGln: 0.468 ± 0.112
0.542MetArg: 0.542 ± 0.116
1.601MetSer: 1.601 ± 0.199
0.813MetThr: 0.813 ± 0.128
0.961MetVal: 0.961 ± 0.154
0.197MetTrp: 0.197 ± 0.069
1.01MetTyr: 1.01 ± 0.144
0.0MetXaa: 0.0 ± 0.0
Asn
2.167AsnAla: 2.167 ± 0.244
1.453AsnCys: 1.453 ± 0.222
4.63AsnAsp: 4.63 ± 0.392
5.345AsnGlu: 5.345 ± 0.329
4.162AsnPhe: 4.162 ± 0.294
5.616AsnGly: 5.616 ± 0.476
1.65AsnHis: 1.65 ± 0.216
10.591AsnIle: 10.591 ± 0.53
8.67AsnLys: 8.67 ± 0.523
7.34AsnLeu: 7.34 ± 0.464
2.143AsnMet: 2.143 ± 0.228
7.463AsnAsn: 7.463 ± 0.524
1.97AsnPro: 1.97 ± 0.278
2.069AsnGln: 2.069 ± 0.268
2.143AsnArg: 2.143 ± 0.256
5.345AsnSer: 5.345 ± 0.341
4.286AsnThr: 4.286 ± 0.361
4.507AsnVal: 4.507 ± 0.313
0.443AsnTrp: 0.443 ± 0.103
4.31AsnTyr: 4.31 ± 0.377
0.0AsnXaa: 0.0 ± 0.0
Pro
0.591ProAla: 0.591 ± 0.118
0.222ProCys: 0.222 ± 0.071
1.33ProAsp: 1.33 ± 0.207
2.069ProGlu: 2.069 ± 0.205
1.207ProPhe: 1.207 ± 0.149
1.305ProGly: 1.305 ± 0.133
0.369ProHis: 0.369 ± 0.096
2.611ProIle: 2.611 ± 0.242
2.66ProLys: 2.66 ± 0.258
1.921ProLeu: 1.921 ± 0.235
0.32ProMet: 0.32 ± 0.09
2.266ProAsn: 2.266 ± 0.262
0.64ProPro: 0.64 ± 0.139
0.69ProGln: 0.69 ± 0.14
0.665ProArg: 0.665 ± 0.113
2.66ProSer: 2.66 ± 0.279
1.502ProThr: 1.502 ± 0.175
1.33ProVal: 1.33 ± 0.21
0.148ProTrp: 0.148 ± 0.052
1.281ProTyr: 1.281 ± 0.191
0.0ProXaa: 0.0 ± 0.0
Gln
1.108GlnAla: 1.108 ± 0.183
0.517GlnCys: 0.517 ± 0.104
1.256GlnAsp: 1.256 ± 0.167
1.872GlnGlu: 1.872 ± 0.219
1.675GlnPhe: 1.675 ± 0.202
1.626GlnGly: 1.626 ± 0.215
0.32GlnHis: 0.32 ± 0.088
1.675GlnIle: 1.675 ± 0.224
2.266GlnLys: 2.266 ± 0.26
2.857GlnLeu: 2.857 ± 0.233
0.517GlnMet: 0.517 ± 0.119
2.118GlnAsn: 2.118 ± 0.217
0.64GlnPro: 0.64 ± 0.122
1.231GlnGln: 1.231 ± 0.18
0.739GlnArg: 0.739 ± 0.119
1.478GlnSer: 1.478 ± 0.166
1.33GlnThr: 1.33 ± 0.178
1.379GlnVal: 1.379 ± 0.204
0.246GlnTrp: 0.246 ± 0.082
1.429GlnTyr: 1.429 ± 0.174
0.0GlnXaa: 0.0 ± 0.0
Arg
1.01ArgAla: 1.01 ± 0.181
0.32ArgCys: 0.32 ± 0.078
1.355ArgAsp: 1.355 ± 0.181
2.02ArgGlu: 2.02 ± 0.229
1.379ArgPhe: 1.379 ± 0.173
1.429ArgGly: 1.429 ± 0.146
0.345ArgHis: 0.345 ± 0.089
2.094ArgIle: 2.094 ± 0.238
2.857ArgLys: 2.857 ± 0.261
2.709ArgLeu: 2.709 ± 0.213
0.616ArgMet: 0.616 ± 0.126
2.094ArgAsn: 2.094 ± 0.212
0.542ArgPro: 0.542 ± 0.107
0.961ArgGln: 0.961 ± 0.151
0.764ArgArg: 0.764 ± 0.175
1.453ArgSer: 1.453 ± 0.187
1.552ArgThr: 1.552 ± 0.194
1.626ArgVal: 1.626 ± 0.165
0.296ArgTrp: 0.296 ± 0.08
1.379ArgTyr: 1.379 ± 0.208
0.0ArgXaa: 0.0 ± 0.0
Ser
2.241SerAla: 2.241 ± 0.294
0.788SerCys: 0.788 ± 0.141
4.458SerAsp: 4.458 ± 0.345
4.409SerGlu: 4.409 ± 0.342
4.064SerPhe: 4.064 ± 0.298
3.374SerGly: 3.374 ± 0.396
1.084SerHis: 1.084 ± 0.161
7.315SerIle: 7.315 ± 0.443
7.167SerLys: 7.167 ± 0.466
6.527SerLeu: 6.527 ± 0.379
1.527SerMet: 1.527 ± 0.172
5.492SerAsn: 5.492 ± 0.384
1.453SerPro: 1.453 ± 0.187
1.355SerGln: 1.355 ± 0.182
1.897SerArg: 1.897 ± 0.21
4.778SerSer: 4.778 ± 0.486
3.35SerThr: 3.35 ± 0.315
3.571SerVal: 3.571 ± 0.361
0.665SerTrp: 0.665 ± 0.127
3.276SerTyr: 3.276 ± 0.281
0.0SerXaa: 0.0 ± 0.0
Thr
1.527ThrAla: 1.527 ± 0.2
1.01ThrCys: 1.01 ± 0.196
3.103ThrAsp: 3.103 ± 0.308
3.497ThrGlu: 3.497 ± 0.259
3.029ThrPhe: 3.029 ± 0.292
2.956ThrGly: 2.956 ± 0.301
0.616ThrHis: 0.616 ± 0.118
4.704ThrIle: 4.704 ± 0.397
4.384ThrLys: 4.384 ± 0.345
4.286ThrLeu: 4.286 ± 0.357
0.665ThrMet: 0.665 ± 0.128
4.236ThrAsn: 4.236 ± 0.323
2.315ThrPro: 2.315 ± 0.306
1.601ThrGln: 1.601 ± 0.278
1.33ThrArg: 1.33 ± 0.161
3.276ThrSer: 3.276 ± 0.266
2.857ThrThr: 2.857 ± 0.291
2.956ThrVal: 2.956 ± 0.3
0.468ThrTrp: 0.468 ± 0.115
2.857ThrTyr: 2.857 ± 0.27
0.0ThrXaa: 0.0 ± 0.0
Val
2.069ValAla: 2.069 ± 0.275
1.01ValCys: 1.01 ± 0.186
2.931ValAsp: 2.931 ± 0.347
3.842ValGlu: 3.842 ± 0.295
3.005ValPhe: 3.005 ± 0.308
2.537ValGly: 2.537 ± 0.261
0.419ValHis: 0.419 ± 0.104
4.951ValIle: 4.951 ± 0.371
5.714ValLys: 5.714 ± 0.442
3.818ValLeu: 3.818 ± 0.347
0.887ValMet: 0.887 ± 0.152
3.694ValAsn: 3.694 ± 0.295
1.281ValPro: 1.281 ± 0.165
0.911ValGln: 0.911 ± 0.145
1.355ValArg: 1.355 ± 0.149
3.842ValSer: 3.842 ± 0.365
2.759ValThr: 2.759 ± 0.338
2.882ValVal: 2.882 ± 0.264
0.837ValTrp: 0.837 ± 0.156
2.463ValTyr: 2.463 ± 0.234
0.0ValXaa: 0.0 ± 0.0
Trp
0.32TrpAla: 0.32 ± 0.077
0.246TrpCys: 0.246 ± 0.083
0.739TrpAsp: 0.739 ± 0.112
0.887TrpGlu: 0.887 ± 0.13
0.32TrpPhe: 0.32 ± 0.085
0.32TrpGly: 0.32 ± 0.086
0.369TrpHis: 0.369 ± 0.085
0.665TrpIle: 0.665 ± 0.139
0.468TrpLys: 0.468 ± 0.099
0.517TrpLeu: 0.517 ± 0.132
0.271TrpMet: 0.271 ± 0.085
0.739TrpAsn: 0.739 ± 0.114
0.025TrpPro: 0.025 ± 0.025
0.123TrpGln: 0.123 ± 0.053
0.197TrpArg: 0.197 ± 0.065
0.468TrpSer: 0.468 ± 0.104
0.468TrpThr: 0.468 ± 0.106
0.591TrpVal: 0.591 ± 0.119
0.025TrpTrp: 0.025 ± 0.026
0.493TrpTyr: 0.493 ± 0.11
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.527TyrAla: 1.527 ± 0.236
1.059TyrCys: 1.059 ± 0.167
3.128TyrAsp: 3.128 ± 0.279
2.882TyrGlu: 2.882 ± 0.25
2.931TyrPhe: 2.931 ± 0.308
2.438TyrGly: 2.438 ± 0.259
0.961TyrHis: 0.961 ± 0.141
4.852TyrIle: 4.852 ± 0.364
5.074TyrLys: 5.074 ± 0.339
3.867TyrLeu: 3.867 ± 0.345
1.33TyrMet: 1.33 ± 0.166
5.172TyrAsn: 5.172 ± 0.37
1.379TyrPro: 1.379 ± 0.175
1.256TyrGln: 1.256 ± 0.158
1.355TyrArg: 1.355 ± 0.151
4.606TyrSer: 4.606 ± 0.365
2.808TyrThr: 2.808 ± 0.25
2.783TyrVal: 2.783 ± 0.285
0.566TyrTrp: 0.566 ± 0.107
2.906TyrTyr: 2.906 ± 0.252
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 166 proteins (40602 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski