Amino acid dipepetide frequency for Vibrio phage vB_VpaS_MAR10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.285AlaAla: 7.285 ± 0.892
0.708AlaCys: 0.708 ± 0.165
5.162AlaAsp: 5.162 ± 0.55
5.079AlaGlu: 5.079 ± 0.538
3.247AlaPhe: 3.247 ± 0.387
5.412AlaGly: 5.412 ± 0.48
1.748AlaHis: 1.748 ± 0.286
4.33AlaIle: 4.33 ± 0.514
4.288AlaLys: 4.288 ± 0.448
7.285AlaLeu: 7.285 ± 0.522
2.206AlaMet: 2.206 ± 0.428
3.414AlaAsn: 3.414 ± 0.44
2.456AlaPro: 2.456 ± 0.38
2.664AlaGln: 2.664 ± 0.391
3.955AlaArg: 3.955 ± 0.424
4.413AlaSer: 4.413 ± 0.472
5.703AlaThr: 5.703 ± 0.56
5.828AlaVal: 5.828 ± 0.519
0.833AlaTrp: 0.833 ± 0.175
2.248AlaTyr: 2.248 ± 0.338
0.0AlaXaa: 0.0 ± 0.0
Cys
1.041CysAla: 1.041 ± 0.244
0.375CysCys: 0.375 ± 0.126
1.082CysAsp: 1.082 ± 0.226
1.041CysGlu: 1.041 ± 0.215
0.375CysPhe: 0.375 ± 0.14
0.791CysGly: 0.791 ± 0.193
0.208CysHis: 0.208 ± 0.087
0.749CysIle: 0.749 ± 0.153
0.541CysLys: 0.541 ± 0.133
1.374CysLeu: 1.374 ± 0.246
0.375CysMet: 0.375 ± 0.138
0.375CysAsn: 0.375 ± 0.111
0.458CysPro: 0.458 ± 0.144
0.291CysGln: 0.291 ± 0.087
0.749CysArg: 0.749 ± 0.165
0.666CysSer: 0.666 ± 0.171
0.333CysThr: 0.333 ± 0.111
0.624CysVal: 0.624 ± 0.144
0.375CysTrp: 0.375 ± 0.126
0.5CysTyr: 0.5 ± 0.149
0.0CysXaa: 0.0 ± 0.0
Asp
4.871AspAla: 4.871 ± 0.409
0.5AspCys: 0.5 ± 0.119
4.996AspAsp: 4.996 ± 0.595
4.954AspGlu: 4.954 ± 0.561
3.33AspPhe: 3.33 ± 0.341
4.579AspGly: 4.579 ± 0.458
1.291AspHis: 1.291 ± 0.204
3.705AspIle: 3.705 ± 0.395
4.038AspLys: 4.038 ± 0.469
6.245AspLeu: 6.245 ± 0.519
1.957AspMet: 1.957 ± 0.287
2.539AspAsn: 2.539 ± 0.369
2.956AspPro: 2.956 ± 0.377
2.04AspGln: 2.04 ± 0.376
3.414AspArg: 3.414 ± 0.401
3.164AspSer: 3.164 ± 0.334
3.247AspThr: 3.247 ± 0.429
3.705AspVal: 3.705 ± 0.404
1.415AspTrp: 1.415 ± 0.244
2.331AspTyr: 2.331 ± 0.312
0.0AspXaa: 0.0 ± 0.0
Glu
5.703GluAla: 5.703 ± 0.501
0.874GluCys: 0.874 ± 0.227
5.287GluAsp: 5.287 ± 0.477
5.287GluGlu: 5.287 ± 0.879
2.706GluPhe: 2.706 ± 0.257
4.912GluGly: 4.912 ± 0.424
1.54GluHis: 1.54 ± 0.279
3.705GluIle: 3.705 ± 0.395
3.455GluLys: 3.455 ± 0.454
7.119GluLeu: 7.119 ± 0.575
2.498GluMet: 2.498 ± 0.282
2.581GluAsn: 2.581 ± 0.379
2.165GluPro: 2.165 ± 0.332
2.498GluGln: 2.498 ± 0.373
3.539GluArg: 3.539 ± 0.483
4.121GluSer: 4.121 ± 0.451
3.372GluThr: 3.372 ± 0.331
5.079GluVal: 5.079 ± 0.541
0.916GluTrp: 0.916 ± 0.164
2.706GluTyr: 2.706 ± 0.402
0.0GluXaa: 0.0 ± 0.0
Phe
3.081PheAla: 3.081 ± 0.383
0.5PheCys: 0.5 ± 0.136
2.914PheAsp: 2.914 ± 0.322
2.831PheGlu: 2.831 ± 0.299
1.624PhePhe: 1.624 ± 0.298
2.872PheGly: 2.872 ± 0.306
0.874PheHis: 0.874 ± 0.207
1.748PheIle: 1.748 ± 0.278
2.29PheLys: 2.29 ± 0.32
2.664PheLeu: 2.664 ± 0.332
0.666PheMet: 0.666 ± 0.17
1.707PheAsn: 1.707 ± 0.217
1.624PhePro: 1.624 ± 0.269
1.249PheGln: 1.249 ± 0.207
2.415PheArg: 2.415 ± 0.312
2.956PheSer: 2.956 ± 0.377
3.122PheThr: 3.122 ± 0.334
2.706PheVal: 2.706 ± 0.29
0.749PheTrp: 0.749 ± 0.216
1.374PheTyr: 1.374 ± 0.247
0.0PheXaa: 0.0 ± 0.0
Gly
5.121GlyAla: 5.121 ± 0.651
0.874GlyCys: 0.874 ± 0.197
5.329GlyAsp: 5.329 ± 0.563
5.287GlyGlu: 5.287 ± 0.492
3.164GlyPhe: 3.164 ± 0.41
5.537GlyGly: 5.537 ± 0.456
1.249GlyHis: 1.249 ± 0.237
3.747GlyIle: 3.747 ± 0.298
5.828GlyLys: 5.828 ± 0.417
4.996GlyLeu: 4.996 ± 0.538
2.623GlyMet: 2.623 ± 0.417
2.706GlyAsn: 2.706 ± 0.286
1.457GlyPro: 1.457 ± 0.251
2.539GlyGln: 2.539 ± 0.343
3.372GlyArg: 3.372 ± 0.452
4.496GlySer: 4.496 ± 0.486
4.371GlyThr: 4.371 ± 0.447
4.912GlyVal: 4.912 ± 0.433
0.833GlyTrp: 0.833 ± 0.212
3.039GlyTyr: 3.039 ± 0.337
0.0GlyXaa: 0.0 ± 0.0
His
1.291HisAla: 1.291 ± 0.232
0.416HisCys: 0.416 ± 0.162
1.249HisAsp: 1.249 ± 0.286
1.082HisGlu: 1.082 ± 0.232
1.124HisPhe: 1.124 ± 0.242
1.332HisGly: 1.332 ± 0.266
0.666HisHis: 0.666 ± 0.168
1.166HisIle: 1.166 ± 0.225
0.999HisLys: 0.999 ± 0.196
1.957HisLeu: 1.957 ± 0.255
0.458HisMet: 0.458 ± 0.147
0.666HisAsn: 0.666 ± 0.13
1.041HisPro: 1.041 ± 0.212
0.749HisGln: 0.749 ± 0.181
1.54HisArg: 1.54 ± 0.244
1.124HisSer: 1.124 ± 0.205
1.207HisThr: 1.207 ± 0.22
1.207HisVal: 1.207 ± 0.228
0.583HisTrp: 0.583 ± 0.139
0.749HisTyr: 0.749 ± 0.19
0.0HisXaa: 0.0 ± 0.0
Ile
3.788IleAla: 3.788 ± 0.471
0.624IleCys: 0.624 ± 0.16
3.622IleAsp: 3.622 ± 0.432
4.704IleGlu: 4.704 ± 0.487
1.499IlePhe: 1.499 ± 0.204
3.164IleGly: 3.164 ± 0.415
1.207IleHis: 1.207 ± 0.229
2.706IleIle: 2.706 ± 0.31
2.748IleLys: 2.748 ± 0.345
4.704IleLeu: 4.704 ± 0.442
1.166IleMet: 1.166 ± 0.246
2.082IleAsn: 2.082 ± 0.327
2.539IlePro: 2.539 ± 0.365
2.082IleGln: 2.082 ± 0.282
2.872IleArg: 2.872 ± 0.334
3.206IleSer: 3.206 ± 0.439
3.955IleThr: 3.955 ± 0.433
3.289IleVal: 3.289 ± 0.415
0.583IleTrp: 0.583 ± 0.146
1.832IleTyr: 1.832 ± 0.25
0.0IleXaa: 0.0 ± 0.0
Lys
5.454LysAla: 5.454 ± 0.534
0.749LysCys: 0.749 ± 0.178
3.58LysAsp: 3.58 ± 0.363
4.288LysGlu: 4.288 ± 0.381
1.957LysPhe: 1.957 ± 0.267
4.787LysGly: 4.787 ± 0.442
1.707LysHis: 1.707 ± 0.233
2.997LysIle: 2.997 ± 0.375
4.871LysLys: 4.871 ± 0.64
5.162LysLeu: 5.162 ± 0.471
2.082LysMet: 2.082 ± 0.244
2.706LysAsn: 2.706 ± 0.265
3.872LysPro: 3.872 ± 0.489
2.914LysGln: 2.914 ± 0.38
3.747LysArg: 3.747 ± 0.415
3.247LysSer: 3.247 ± 0.279
3.705LysThr: 3.705 ± 0.413
4.371LysVal: 4.371 ± 0.386
1.291LysTrp: 1.291 ± 0.221
2.248LysTyr: 2.248 ± 0.274
0.0LysXaa: 0.0 ± 0.0
Leu
6.661LeuAla: 6.661 ± 0.627
0.708LeuCys: 0.708 ± 0.203
5.87LeuAsp: 5.87 ± 0.449
5.204LeuGlu: 5.204 ± 0.43
3.164LeuPhe: 3.164 ± 0.315
5.204LeuGly: 5.204 ± 0.456
1.415LeuHis: 1.415 ± 0.246
4.538LeuIle: 4.538 ± 0.416
5.578LeuLys: 5.578 ± 0.478
7.119LeuLeu: 7.119 ± 0.729
1.582LeuMet: 1.582 ± 0.209
4.371LeuAsn: 4.371 ± 0.462
2.872LeuPro: 2.872 ± 0.315
2.623LeuGln: 2.623 ± 0.354
4.829LeuArg: 4.829 ± 0.476
4.954LeuSer: 4.954 ± 0.528
6.369LeuThr: 6.369 ± 0.643
6.911LeuVal: 6.911 ± 0.555
1.166LeuTrp: 1.166 ± 0.215
2.623LeuTyr: 2.623 ± 0.324
0.0LeuXaa: 0.0 ± 0.0
Met
2.831MetAla: 2.831 ± 0.406
0.541MetCys: 0.541 ± 0.151
1.54MetAsp: 1.54 ± 0.212
1.79MetGlu: 1.79 ± 0.245
1.332MetPhe: 1.332 ± 0.221
1.748MetGly: 1.748 ± 0.342
0.458MetHis: 0.458 ± 0.138
0.957MetIle: 0.957 ± 0.158
1.457MetLys: 1.457 ± 0.268
1.873MetLeu: 1.873 ± 0.3
0.583MetMet: 0.583 ± 0.223
0.791MetAsn: 0.791 ± 0.179
1.291MetPro: 1.291 ± 0.215
1.041MetGln: 1.041 ± 0.223
1.79MetArg: 1.79 ± 0.244
2.29MetSer: 2.29 ± 0.291
1.166MetThr: 1.166 ± 0.219
1.665MetVal: 1.665 ± 0.237
0.083MetTrp: 0.083 ± 0.059
0.833MetTyr: 0.833 ± 0.195
0.0MetXaa: 0.0 ± 0.0
Asn
2.997AsnAla: 2.997 ± 0.357
0.5AsnCys: 0.5 ± 0.154
2.581AsnAsp: 2.581 ± 0.298
1.957AsnGlu: 1.957 ± 0.319
2.04AsnPhe: 2.04 ± 0.249
3.622AsnGly: 3.622 ± 0.401
1.082AsnHis: 1.082 ± 0.239
1.915AsnIle: 1.915 ± 0.262
2.748AsnLys: 2.748 ± 0.318
3.414AsnLeu: 3.414 ± 0.382
1.332AsnMet: 1.332 ± 0.217
2.498AsnAsn: 2.498 ± 0.465
2.248AsnPro: 2.248 ± 0.241
1.582AsnGln: 1.582 ± 0.231
2.373AsnArg: 2.373 ± 0.308
2.456AsnSer: 2.456 ± 0.292
2.748AsnThr: 2.748 ± 0.305
2.748AsnVal: 2.748 ± 0.339
0.624AsnTrp: 0.624 ± 0.154
1.249AsnTyr: 1.249 ± 0.235
0.0AsnXaa: 0.0 ± 0.0
Pro
3.122ProAla: 3.122 ± 0.414
0.5ProCys: 0.5 ± 0.133
2.498ProAsp: 2.498 ± 0.297
3.164ProGlu: 3.164 ± 0.336
1.624ProPhe: 1.624 ± 0.235
2.331ProGly: 2.331 ± 0.291
0.833ProHis: 0.833 ± 0.198
2.123ProIle: 2.123 ± 0.298
2.914ProLys: 2.914 ± 0.344
2.706ProLeu: 2.706 ± 0.361
0.833ProMet: 0.833 ± 0.215
1.79ProAsn: 1.79 ± 0.291
1.582ProPro: 1.582 ± 0.267
1.249ProGln: 1.249 ± 0.22
2.248ProArg: 2.248 ± 0.328
2.331ProSer: 2.331 ± 0.322
3.122ProThr: 3.122 ± 0.359
3.33ProVal: 3.33 ± 0.423
0.5ProTrp: 0.5 ± 0.15
1.457ProTyr: 1.457 ± 0.232
0.0ProXaa: 0.0 ± 0.0
Gln
2.664GlnAla: 2.664 ± 0.342
0.375GlnCys: 0.375 ± 0.147
1.915GlnAsp: 1.915 ± 0.276
2.29GlnGlu: 2.29 ± 0.312
1.54GlnPhe: 1.54 ± 0.25
2.082GlnGly: 2.082 ± 0.306
0.708GlnHis: 0.708 ± 0.176
1.873GlnIle: 1.873 ± 0.263
1.957GlnLys: 1.957 ± 0.36
3.414GlnLeu: 3.414 ± 0.41
0.791GlnMet: 0.791 ± 0.159
1.291GlnAsn: 1.291 ± 0.241
1.166GlnPro: 1.166 ± 0.239
1.582GlnGln: 1.582 ± 0.352
2.581GlnArg: 2.581 ± 0.419
1.873GlnSer: 1.873 ± 0.292
1.79GlnThr: 1.79 ± 0.272
2.664GlnVal: 2.664 ± 0.276
0.749GlnTrp: 0.749 ± 0.174
1.249GlnTyr: 1.249 ± 0.251
0.0GlnXaa: 0.0 ± 0.0
Arg
3.997ArgAla: 3.997 ± 0.405
0.999ArgCys: 0.999 ± 0.233
3.372ArgAsp: 3.372 ± 0.412
4.538ArgGlu: 4.538 ± 0.489
1.998ArgPhe: 1.998 ± 0.295
4.371ArgGly: 4.371 ± 0.668
0.999ArgHis: 0.999 ± 0.168
3.081ArgIle: 3.081 ± 0.356
3.913ArgLys: 3.913 ± 0.45
4.288ArgLeu: 4.288 ± 0.416
1.415ArgMet: 1.415 ± 0.252
2.206ArgAsn: 2.206 ± 0.309
2.165ArgPro: 2.165 ± 0.265
1.624ArgGln: 1.624 ± 0.313
4.08ArgArg: 4.08 ± 0.777
3.747ArgSer: 3.747 ± 0.464
3.164ArgThr: 3.164 ± 0.363
3.913ArgVal: 3.913 ± 0.521
0.999ArgTrp: 0.999 ± 0.199
1.873ArgTyr: 1.873 ± 0.303
0.0ArgXaa: 0.0 ± 0.0
Ser
3.622SerAla: 3.622 ± 0.336
0.5SerCys: 0.5 ± 0.138
3.289SerAsp: 3.289 ± 0.421
3.58SerGlu: 3.58 ± 0.341
2.664SerPhe: 2.664 ± 0.357
5.079SerGly: 5.079 ± 0.535
1.249SerHis: 1.249 ± 0.242
3.747SerIle: 3.747 ± 0.409
4.496SerLys: 4.496 ± 0.417
4.33SerLeu: 4.33 ± 0.507
1.082SerMet: 1.082 ± 0.193
2.165SerAsn: 2.165 ± 0.259
2.623SerPro: 2.623 ± 0.252
2.206SerGln: 2.206 ± 0.286
3.622SerArg: 3.622 ± 0.527
3.58SerSer: 3.58 ± 0.363
3.788SerThr: 3.788 ± 0.383
4.205SerVal: 4.205 ± 0.501
0.833SerTrp: 0.833 ± 0.185
1.915SerTyr: 1.915 ± 0.247
0.0SerXaa: 0.0 ± 0.0
Thr
5.578ThrAla: 5.578 ± 0.537
0.666ThrCys: 0.666 ± 0.188
3.206ThrAsp: 3.206 ± 0.417
3.164ThrGlu: 3.164 ± 0.348
2.664ThrPhe: 2.664 ± 0.335
4.954ThrGly: 4.954 ± 0.475
1.166ThrHis: 1.166 ± 0.216
3.122ThrIle: 3.122 ± 0.374
5.787ThrLys: 5.787 ± 0.459
5.745ThrLeu: 5.745 ± 0.411
1.249ThrMet: 1.249 ± 0.22
3.122ThrAsn: 3.122 ± 0.36
3.164ThrPro: 3.164 ± 0.366
1.707ThrGln: 1.707 ± 0.248
2.539ThrArg: 2.539 ± 0.279
2.872ThrSer: 2.872 ± 0.38
3.122ThrThr: 3.122 ± 0.405
4.413ThrVal: 4.413 ± 0.486
0.916ThrTrp: 0.916 ± 0.177
2.165ThrTyr: 2.165 ± 0.272
0.0ThrXaa: 0.0 ± 0.0
Val
5.287ValAla: 5.287 ± 0.493
1.082ValCys: 1.082 ± 0.23
3.955ValAsp: 3.955 ± 0.356
5.37ValGlu: 5.37 ± 0.481
2.331ValPhe: 2.331 ± 0.269
4.787ValGly: 4.787 ± 0.47
1.041ValHis: 1.041 ± 0.166
3.58ValIle: 3.58 ± 0.33
4.787ValLys: 4.787 ± 0.463
5.495ValLeu: 5.495 ± 0.534
1.998ValMet: 1.998 ± 0.265
3.414ValAsn: 3.414 ± 0.437
2.623ValPro: 2.623 ± 0.339
2.331ValGln: 2.331 ± 0.343
3.913ValArg: 3.913 ± 0.346
4.371ValSer: 4.371 ± 0.438
4.663ValThr: 4.663 ± 0.494
4.996ValVal: 4.996 ± 0.489
1.166ValTrp: 1.166 ± 0.216
2.498ValTyr: 2.498 ± 0.361
0.0ValXaa: 0.0 ± 0.0
Trp
1.249TrpAla: 1.249 ± 0.232
0.25TrpCys: 0.25 ± 0.096
1.207TrpAsp: 1.207 ± 0.196
1.624TrpGlu: 1.624 ± 0.241
0.541TrpPhe: 0.541 ± 0.181
0.749TrpGly: 0.749 ± 0.171
0.25TrpHis: 0.25 ± 0.103
0.957TrpIle: 0.957 ± 0.206
0.916TrpLys: 0.916 ± 0.202
1.54TrpLeu: 1.54 ± 0.281
0.208TrpMet: 0.208 ± 0.091
0.624TrpAsn: 0.624 ± 0.144
0.0TrpPro: 0.0 ± 0.0
0.5TrpGln: 0.5 ± 0.14
0.999TrpArg: 0.999 ± 0.204
0.957TrpSer: 0.957 ± 0.193
0.749TrpThr: 0.749 ± 0.173
1.374TrpVal: 1.374 ± 0.189
0.167TrpTrp: 0.167 ± 0.082
0.375TrpTyr: 0.375 ± 0.117
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.623TyrAla: 2.623 ± 0.387
0.583TyrCys: 0.583 ± 0.141
2.623TyrAsp: 2.623 ± 0.3
2.831TyrGlu: 2.831 ± 0.375
1.041TyrPhe: 1.041 ± 0.177
3.206TyrGly: 3.206 ± 0.389
0.916TyrHis: 0.916 ± 0.179
1.624TyrIle: 1.624 ± 0.263
2.123TyrLys: 2.123 ± 0.32
2.331TyrLeu: 2.331 ± 0.29
0.916TyrMet: 0.916 ± 0.184
1.665TyrAsn: 1.665 ± 0.291
1.957TyrPro: 1.957 ± 0.312
1.082TyrGln: 1.082 ± 0.226
2.165TyrArg: 2.165 ± 0.345
1.748TyrSer: 1.748 ± 0.281
1.624TyrThr: 1.624 ± 0.251
1.748TyrVal: 1.748 ± 0.291
0.458TyrTrp: 0.458 ± 0.133
1.748TyrTyr: 1.748 ± 0.269
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 107 proteins (24022 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski