Amino acid dipepetide frequency for Vibrio phage Vc1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.857AlaAla: 7.857 ± 1.303
0.915AlaCys: 0.915 ± 0.236
6.178AlaAsp: 6.178 ± 0.551
6.484AlaGlu: 6.484 ± 0.994
2.822AlaPhe: 2.822 ± 0.43
7.018AlaGly: 7.018 ± 0.584
1.526AlaHis: 1.526 ± 0.396
5.339AlaIle: 5.339 ± 0.533
5.187AlaLys: 5.187 ± 0.838
7.399AlaLeu: 7.399 ± 0.772
2.67AlaMet: 2.67 ± 0.412
4.348AlaAsn: 4.348 ± 0.61
2.517AlaPro: 2.517 ± 0.331
3.432AlaGln: 3.432 ± 0.518
4.805AlaArg: 4.805 ± 0.75
5.034AlaSer: 5.034 ± 0.66
5.416AlaThr: 5.416 ± 0.826
5.034AlaVal: 5.034 ± 0.729
0.992AlaTrp: 0.992 ± 0.32
2.899AlaTyr: 2.899 ± 0.454
0.0AlaXaa: 0.0 ± 0.0
Cys
0.763CysAla: 0.763 ± 0.225
0.076CysCys: 0.076 ± 0.078
0.686CysAsp: 0.686 ± 0.281
0.381CysGlu: 0.381 ± 0.18
0.839CysPhe: 0.839 ± 0.218
0.686CysGly: 0.686 ± 0.275
0.153CysHis: 0.153 ± 0.107
0.686CysIle: 0.686 ± 0.233
1.068CysLys: 1.068 ± 0.285
0.686CysLeu: 0.686 ± 0.256
0.458CysMet: 0.458 ± 0.201
0.305CysAsn: 0.305 ± 0.179
0.61CysPro: 0.61 ± 0.217
0.229CysGln: 0.229 ± 0.118
0.839CysArg: 0.839 ± 0.318
0.381CysSer: 0.381 ± 0.164
0.229CysThr: 0.229 ± 0.127
0.992CysVal: 0.992 ± 0.254
0.305CysTrp: 0.305 ± 0.16
0.458CysTyr: 0.458 ± 0.193
0.0CysXaa: 0.0 ± 0.0
Asp
6.636AspAla: 6.636 ± 0.648
0.534AspCys: 0.534 ± 0.202
3.051AspAsp: 3.051 ± 0.41
4.653AspGlu: 4.653 ± 0.697
3.127AspPhe: 3.127 ± 0.479
5.263AspGly: 5.263 ± 0.732
0.61AspHis: 0.61 ± 0.193
3.585AspIle: 3.585 ± 0.681
5.339AspLys: 5.339 ± 0.803
4.805AspLeu: 4.805 ± 0.63
2.517AspMet: 2.517 ± 0.407
2.441AspAsn: 2.441 ± 0.487
1.831AspPro: 1.831 ± 0.3
1.678AspGln: 1.678 ± 0.268
2.975AspArg: 2.975 ± 0.494
3.509AspSer: 3.509 ± 0.583
4.195AspThr: 4.195 ± 0.677
4.424AspVal: 4.424 ± 0.828
1.297AspTrp: 1.297 ± 0.29
2.822AspTyr: 2.822 ± 0.291
0.0AspXaa: 0.0 ± 0.0
Glu
7.018GluAla: 7.018 ± 0.908
0.534GluCys: 0.534 ± 0.216
3.814GluAsp: 3.814 ± 0.53
6.331GluGlu: 6.331 ± 0.922
2.822GluPhe: 2.822 ± 0.483
4.348GluGly: 4.348 ± 0.53
1.831GluHis: 1.831 ± 0.364
3.051GluIle: 3.051 ± 0.481
4.195GluLys: 4.195 ± 0.739
6.178GluLeu: 6.178 ± 0.693
2.365GluMet: 2.365 ± 0.414
2.899GluAsn: 2.899 ± 0.38
1.297GluPro: 1.297 ± 0.237
3.356GluGln: 3.356 ± 0.6
3.966GluArg: 3.966 ± 0.424
4.272GluSer: 4.272 ± 0.444
3.509GluThr: 3.509 ± 0.573
5.111GluVal: 5.111 ± 0.72
1.22GluTrp: 1.22 ± 0.264
2.67GluTyr: 2.67 ± 0.412
0.0GluXaa: 0.0 ± 0.0
Phe
3.127PheAla: 3.127 ± 0.47
0.381PheCys: 0.381 ± 0.198
3.966PheAsp: 3.966 ± 0.839
2.288PheGlu: 2.288 ± 0.451
1.144PhePhe: 1.144 ± 0.303
2.441PheGly: 2.441 ± 0.641
0.839PheHis: 0.839 ± 0.268
1.907PheIle: 1.907 ± 0.385
2.441PheLys: 2.441 ± 0.412
2.288PheLeu: 2.288 ± 0.452
1.373PheMet: 1.373 ± 0.361
2.441PheAsn: 2.441 ± 0.407
1.22PhePro: 1.22 ± 0.422
1.068PheGln: 1.068 ± 0.239
1.754PheArg: 1.754 ± 0.395
2.593PheSer: 2.593 ± 0.376
2.212PheThr: 2.212 ± 0.476
2.822PheVal: 2.822 ± 0.645
0.534PheTrp: 0.534 ± 0.242
1.144PheTyr: 1.144 ± 0.293
0.0PheXaa: 0.0 ± 0.0
Gly
6.026GlyAla: 6.026 ± 0.706
1.068GlyCys: 1.068 ± 0.288
5.187GlyAsp: 5.187 ± 0.511
6.331GlyGlu: 6.331 ± 0.83
2.136GlyPhe: 2.136 ± 0.359
6.178GlyGly: 6.178 ± 0.881
1.373GlyHis: 1.373 ± 0.336
3.89GlyIle: 3.89 ± 0.67
5.645GlyLys: 5.645 ± 0.798
5.034GlyLeu: 5.034 ± 0.641
2.441GlyMet: 2.441 ± 0.403
3.966GlyAsn: 3.966 ± 0.733
0.0GlyPro: 0.0 ± 0.0
2.441GlyGln: 2.441 ± 0.469
3.89GlyArg: 3.89 ± 0.671
4.805GlySer: 4.805 ± 0.619
4.805GlyThr: 4.805 ± 0.66
4.043GlyVal: 4.043 ± 0.435
1.22GlyTrp: 1.22 ± 0.276
4.043GlyTyr: 4.043 ± 0.688
0.0GlyXaa: 0.0 ± 0.0
His
1.449HisAla: 1.449 ± 0.268
0.229HisCys: 0.229 ± 0.149
1.602HisAsp: 1.602 ± 0.438
0.839HisGlu: 0.839 ± 0.215
0.686HisPhe: 0.686 ± 0.179
1.144HisGly: 1.144 ± 0.442
0.381HisHis: 0.381 ± 0.19
0.763HisIle: 0.763 ± 0.27
1.678HisLys: 1.678 ± 0.426
2.059HisLeu: 2.059 ± 0.425
0.534HisMet: 0.534 ± 0.222
1.373HisAsn: 1.373 ± 0.323
0.534HisPro: 0.534 ± 0.222
0.686HisGln: 0.686 ± 0.214
1.068HisArg: 1.068 ± 0.285
1.144HisSer: 1.144 ± 0.285
1.22HisThr: 1.22 ± 0.325
1.144HisVal: 1.144 ± 0.278
0.229HisTrp: 0.229 ± 0.123
0.686HisTyr: 0.686 ± 0.249
0.0HisXaa: 0.0 ± 0.0
Ile
3.966IleAla: 3.966 ± 0.419
0.229IleCys: 0.229 ± 0.103
4.043IleAsp: 4.043 ± 0.65
3.661IleGlu: 3.661 ± 0.606
1.22IlePhe: 1.22 ± 0.298
4.653IleGly: 4.653 ± 0.423
1.449IleHis: 1.449 ± 0.42
3.661IleIle: 3.661 ± 0.882
4.577IleLys: 4.577 ± 0.503
2.365IleLeu: 2.365 ± 0.394
0.915IleMet: 0.915 ± 0.232
3.661IleAsn: 3.661 ± 0.53
2.212IlePro: 2.212 ± 0.342
1.449IleGln: 1.449 ± 0.304
2.746IleArg: 2.746 ± 0.388
3.509IleSer: 3.509 ± 0.63
2.593IleThr: 2.593 ± 0.28
3.356IleVal: 3.356 ± 0.704
0.534IleTrp: 0.534 ± 0.21
1.373IleTyr: 1.373 ± 0.351
0.0IleXaa: 0.0 ± 0.0
Lys
6.026LysAla: 6.026 ± 0.923
1.068LysCys: 1.068 ± 0.33
4.424LysAsp: 4.424 ± 0.673
4.805LysGlu: 4.805 ± 0.806
2.288LysPhe: 2.288 ± 0.336
5.492LysGly: 5.492 ± 0.825
1.602LysHis: 1.602 ± 0.305
2.975LysIle: 2.975 ± 0.583
3.814LysLys: 3.814 ± 0.542
5.339LysLeu: 5.339 ± 0.82
1.831LysMet: 1.831 ± 0.373
2.899LysAsn: 2.899 ± 0.452
3.127LysPro: 3.127 ± 0.502
2.975LysGln: 2.975 ± 0.454
3.509LysArg: 3.509 ± 0.538
3.204LysSer: 3.204 ± 0.415
2.975LysThr: 2.975 ± 0.373
3.585LysVal: 3.585 ± 0.589
0.992LysTrp: 0.992 ± 0.293
2.365LysTyr: 2.365 ± 0.439
0.0LysXaa: 0.0 ± 0.0
Leu
6.178LeuAla: 6.178 ± 0.692
0.915LeuCys: 0.915 ± 0.292
5.568LeuAsp: 5.568 ± 0.625
5.416LeuGlu: 5.416 ± 0.711
2.212LeuPhe: 2.212 ± 0.302
6.102LeuGly: 6.102 ± 0.433
1.831LeuHis: 1.831 ± 0.422
3.585LeuIle: 3.585 ± 0.429
5.034LeuLys: 5.034 ± 0.791
6.484LeuLeu: 6.484 ± 0.794
2.67LeuMet: 2.67 ± 0.477
3.127LeuAsn: 3.127 ± 0.351
4.119LeuPro: 4.119 ± 0.468
3.89LeuGln: 3.89 ± 0.427
4.5LeuArg: 4.5 ± 0.722
5.263LeuSer: 5.263 ± 0.872
4.424LeuThr: 4.424 ± 0.564
5.187LeuVal: 5.187 ± 0.754
0.686LeuTrp: 0.686 ± 0.282
2.67LeuTyr: 2.67 ± 0.47
0.0LeuXaa: 0.0 ± 0.0
Met
3.814MetAla: 3.814 ± 0.51
0.153MetCys: 0.153 ± 0.101
1.526MetAsp: 1.526 ± 0.352
1.983MetGlu: 1.983 ± 0.585
1.678MetPhe: 1.678 ± 0.31
1.678MetGly: 1.678 ± 0.323
0.61MetHis: 0.61 ± 0.192
1.22MetIle: 1.22 ± 0.287
2.212MetLys: 2.212 ± 0.383
3.127MetLeu: 3.127 ± 0.414
1.068MetMet: 1.068 ± 0.321
0.915MetAsn: 0.915 ± 0.229
1.449MetPro: 1.449 ± 0.371
1.22MetGln: 1.22 ± 0.388
1.373MetArg: 1.373 ± 0.276
1.678MetSer: 1.678 ± 0.404
2.593MetThr: 2.593 ± 0.394
1.678MetVal: 1.678 ± 0.383
0.305MetTrp: 0.305 ± 0.122
0.992MetTyr: 0.992 ± 0.227
0.0MetXaa: 0.0 ± 0.0
Asn
3.89AsnAla: 3.89 ± 0.742
0.534AsnCys: 0.534 ± 0.155
2.517AsnAsp: 2.517 ± 0.345
2.822AsnGlu: 2.822 ± 0.518
2.136AsnPhe: 2.136 ± 0.493
4.119AsnGly: 4.119 ± 0.566
0.763AsnHis: 0.763 ± 0.271
2.441AsnIle: 2.441 ± 0.518
3.356AsnLys: 3.356 ± 0.598
4.195AsnLeu: 4.195 ± 0.608
1.449AsnMet: 1.449 ± 0.273
2.593AsnAsn: 2.593 ± 0.434
2.593AsnPro: 2.593 ± 0.408
1.754AsnGln: 1.754 ± 0.335
2.593AsnArg: 2.593 ± 0.47
2.517AsnSer: 2.517 ± 0.419
3.356AsnThr: 3.356 ± 0.681
3.127AsnVal: 3.127 ± 0.638
0.381AsnTrp: 0.381 ± 0.129
1.831AsnTyr: 1.831 ± 0.298
0.0AsnXaa: 0.0 ± 0.0
Pro
2.441ProAla: 2.441 ± 0.335
0.305ProCys: 0.305 ± 0.137
2.822ProAsp: 2.822 ± 0.373
3.585ProGlu: 3.585 ± 0.458
1.297ProPhe: 1.297 ± 0.289
0.229ProGly: 0.229 ± 0.115
0.839ProHis: 0.839 ± 0.233
1.907ProIle: 1.907 ± 0.421
1.449ProLys: 1.449 ± 0.246
3.127ProLeu: 3.127 ± 0.503
0.915ProMet: 0.915 ± 0.245
1.983ProAsn: 1.983 ± 0.41
0.686ProPro: 0.686 ± 0.179
1.449ProGln: 1.449 ± 0.33
1.373ProArg: 1.373 ± 0.299
2.441ProSer: 2.441 ± 0.439
2.059ProThr: 2.059 ± 0.365
2.67ProVal: 2.67 ± 0.375
0.763ProTrp: 0.763 ± 0.253
1.22ProTyr: 1.22 ± 0.26
0.0ProXaa: 0.0 ± 0.0
Gln
4.424GlnAla: 4.424 ± 0.608
0.305GlnCys: 0.305 ± 0.16
1.907GlnAsp: 1.907 ± 0.315
3.738GlnGlu: 3.738 ± 0.496
1.907GlnPhe: 1.907 ± 0.337
3.127GlnGly: 3.127 ± 0.367
0.992GlnHis: 0.992 ± 0.277
1.831GlnIle: 1.831 ± 0.323
1.678GlnLys: 1.678 ± 0.391
3.509GlnLeu: 3.509 ± 0.482
1.373GlnMet: 1.373 ± 0.409
1.831GlnAsn: 1.831 ± 0.263
0.763GlnPro: 0.763 ± 0.224
2.136GlnGln: 2.136 ± 0.62
2.059GlnArg: 2.059 ± 0.331
2.136GlnSer: 2.136 ± 0.453
1.754GlnThr: 1.754 ± 0.365
2.288GlnVal: 2.288 ± 0.428
0.229GlnTrp: 0.229 ± 0.111
0.915GlnTyr: 0.915 ± 0.27
0.0GlnXaa: 0.0 ± 0.0
Arg
4.348ArgAla: 4.348 ± 0.575
0.61ArgCys: 0.61 ± 0.195
2.899ArgAsp: 2.899 ± 0.392
2.975ArgGlu: 2.975 ± 0.434
2.059ArgPhe: 2.059 ± 0.351
3.89ArgGly: 3.89 ± 0.604
0.915ArgHis: 0.915 ± 0.236
2.136ArgIle: 2.136 ± 0.371
2.975ArgLys: 2.975 ± 0.523
4.195ArgLeu: 4.195 ± 0.524
1.983ArgMet: 1.983 ± 0.438
2.517ArgAsn: 2.517 ± 0.595
1.526ArgPro: 1.526 ± 0.355
2.136ArgGln: 2.136 ± 0.368
2.593ArgArg: 2.593 ± 0.406
2.365ArgSer: 2.365 ± 0.468
2.899ArgThr: 2.899 ± 0.6
4.577ArgVal: 4.577 ± 0.588
0.915ArgTrp: 0.915 ± 0.221
1.983ArgTyr: 1.983 ± 0.303
0.0ArgXaa: 0.0 ± 0.0
Ser
5.492SerAla: 5.492 ± 0.873
0.305SerCys: 0.305 ± 0.176
4.272SerAsp: 4.272 ± 0.575
2.517SerGlu: 2.517 ± 0.445
2.441SerPhe: 2.441 ± 0.465
4.577SerGly: 4.577 ± 0.694
0.686SerHis: 0.686 ± 0.245
3.356SerIle: 3.356 ± 0.458
2.975SerLys: 2.975 ± 0.423
5.492SerLeu: 5.492 ± 0.795
1.373SerMet: 1.373 ± 0.389
2.593SerAsn: 2.593 ± 0.519
1.831SerPro: 1.831 ± 0.265
2.059SerGln: 2.059 ± 0.478
3.204SerArg: 3.204 ± 0.498
3.356SerSer: 3.356 ± 0.6
3.89SerThr: 3.89 ± 0.627
4.043SerVal: 4.043 ± 0.527
0.839SerTrp: 0.839 ± 0.291
2.136SerTyr: 2.136 ± 0.397
0.0SerXaa: 0.0 ± 0.0
Thr
4.043ThrAla: 4.043 ± 0.589
1.22ThrCys: 1.22 ± 0.347
3.738ThrAsp: 3.738 ± 0.64
3.814ThrGlu: 3.814 ± 0.614
3.127ThrPhe: 3.127 ± 0.736
5.416ThrGly: 5.416 ± 0.727
0.915ThrHis: 0.915 ± 0.255
3.966ThrIle: 3.966 ± 0.527
3.89ThrLys: 3.89 ± 0.548
4.577ThrLeu: 4.577 ± 0.656
1.526ThrMet: 1.526 ± 0.294
2.975ThrAsn: 2.975 ± 0.494
3.127ThrPro: 3.127 ± 0.504
2.059ThrGln: 2.059 ± 0.367
2.288ThrArg: 2.288 ± 0.418
2.593ThrSer: 2.593 ± 0.625
4.348ThrThr: 4.348 ± 0.697
4.805ThrVal: 4.805 ± 0.677
0.763ThrTrp: 0.763 ± 0.212
1.144ThrTyr: 1.144 ± 0.367
0.0ThrXaa: 0.0 ± 0.0
Val
6.255ValAla: 6.255 ± 0.633
0.458ValCys: 0.458 ± 0.167
4.348ValAsp: 4.348 ± 0.539
4.577ValGlu: 4.577 ± 0.555
2.365ValPhe: 2.365 ± 0.405
4.882ValGly: 4.882 ± 0.59
1.373ValHis: 1.373 ± 0.381
3.432ValIle: 3.432 ± 0.565
3.966ValLys: 3.966 ± 0.677
4.272ValLeu: 4.272 ± 0.677
2.441ValMet: 2.441 ± 0.392
3.509ValAsn: 3.509 ± 0.526
2.67ValPro: 2.67 ± 0.415
3.051ValGln: 3.051 ± 0.422
3.051ValArg: 3.051 ± 0.481
3.966ValSer: 3.966 ± 0.676
4.272ValThr: 4.272 ± 0.768
4.958ValVal: 4.958 ± 0.669
0.686ValTrp: 0.686 ± 0.19
1.754ValTyr: 1.754 ± 0.432
0.0ValXaa: 0.0 ± 0.0
Trp
0.839TrpAla: 0.839 ± 0.229
0.534TrpCys: 0.534 ± 0.196
0.763TrpAsp: 0.763 ± 0.174
0.992TrpGlu: 0.992 ± 0.345
0.534TrpPhe: 0.534 ± 0.203
1.068TrpGly: 1.068 ± 0.227
0.229TrpHis: 0.229 ± 0.111
0.61TrpIle: 0.61 ± 0.192
1.297TrpLys: 1.297 ± 0.365
1.602TrpLeu: 1.602 ± 0.408
0.534TrpMet: 0.534 ± 0.179
0.458TrpAsn: 0.458 ± 0.16
0.0TrpPro: 0.0 ± 0.0
0.763TrpGln: 0.763 ± 0.195
0.305TrpArg: 0.305 ± 0.171
0.839TrpSer: 0.839 ± 0.33
0.839TrpThr: 0.839 ± 0.292
0.763TrpVal: 0.763 ± 0.214
0.076TrpTrp: 0.076 ± 0.1
0.686TrpTyr: 0.686 ± 0.165
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.28TyrAla: 3.28 ± 0.428
0.458TyrCys: 0.458 ± 0.164
1.983TyrAsp: 1.983 ± 0.4
2.593TyrGlu: 2.593 ± 0.344
1.144TyrPhe: 1.144 ± 0.213
2.212TyrGly: 2.212 ± 0.366
0.534TyrHis: 0.534 ± 0.159
1.831TyrIle: 1.831 ± 0.308
2.517TyrLys: 2.517 ± 0.537
3.127TyrLeu: 3.127 ± 0.428
0.763TyrMet: 0.763 ± 0.216
2.136TyrAsn: 2.136 ± 0.392
1.449TyrPro: 1.449 ± 0.357
1.068TyrGln: 1.068 ± 0.444
1.602TyrArg: 1.602 ± 0.26
1.831TyrSer: 1.831 ± 0.496
2.746TyrThr: 2.746 ± 0.404
1.754TyrVal: 1.754 ± 0.538
0.763TyrTrp: 0.763 ± 0.322
1.144TyrTyr: 1.144 ± 0.34
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 44 proteins (13111 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski