Amino acid dipepetide frequency for Lentibacter virus vB_LenP_ICBM2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.941AlaAla: 9.941 ± 1.949
0.466AlaCys: 0.466 ± 0.177
5.825AlaAsp: 5.825 ± 0.786
6.058AlaGlu: 6.058 ± 0.807
2.718AlaPhe: 2.718 ± 0.393
6.99AlaGly: 6.99 ± 0.931
0.699AlaHis: 0.699 ± 0.264
3.728AlaIle: 3.728 ± 0.652
5.359AlaLys: 5.359 ± 0.77
7.223AlaLeu: 7.223 ± 0.938
1.709AlaMet: 1.709 ± 0.353
3.262AlaAsn: 3.262 ± 0.496
3.107AlaPro: 3.107 ± 0.724
4.582AlaGln: 4.582 ± 0.735
4.349AlaArg: 4.349 ± 0.571
6.834AlaSer: 6.834 ± 1.317
7.145AlaThr: 7.145 ± 1.28
4.582AlaVal: 4.582 ± 0.924
1.476AlaTrp: 1.476 ± 0.407
3.573AlaTyr: 3.573 ± 0.458
0.0AlaXaa: 0.0 ± 0.0
Cys
0.233CysAla: 0.233 ± 0.123
0.0CysCys: 0.0 ± 0.0
0.544CysAsp: 0.544 ± 0.226
0.544CysGlu: 0.544 ± 0.206
0.466CysPhe: 0.466 ± 0.203
0.233CysGly: 0.233 ± 0.126
0.311CysHis: 0.311 ± 0.129
0.155CysIle: 0.155 ± 0.121
0.932CysLys: 0.932 ± 0.281
0.388CysLeu: 0.388 ± 0.206
0.311CysMet: 0.311 ± 0.134
0.311CysAsn: 0.311 ± 0.126
0.155CysPro: 0.155 ± 0.106
0.155CysGln: 0.155 ± 0.104
0.311CysArg: 0.311 ± 0.184
0.311CysSer: 0.311 ± 0.223
0.155CysThr: 0.155 ± 0.112
0.544CysVal: 0.544 ± 0.221
0.0CysTrp: 0.0 ± 0.0
0.311CysTyr: 0.311 ± 0.148
0.0CysXaa: 0.0 ± 0.0
Asp
5.126AspAla: 5.126 ± 0.493
0.311AspCys: 0.311 ± 0.142
4.427AspAsp: 4.427 ± 0.605
4.272AspGlu: 4.272 ± 0.657
3.184AspPhe: 3.184 ± 0.548
4.893AspGly: 4.893 ± 0.797
1.087AspHis: 1.087 ± 0.313
4.582AspIle: 4.582 ± 0.588
3.34AspLys: 3.34 ± 0.471
5.669AspLeu: 5.669 ± 0.587
1.398AspMet: 1.398 ± 0.303
3.495AspAsn: 3.495 ± 0.564
3.728AspPro: 3.728 ± 0.438
2.408AspGln: 2.408 ± 0.441
3.107AspArg: 3.107 ± 0.614
3.806AspSer: 3.806 ± 0.715
4.66AspThr: 4.66 ± 0.444
4.349AspVal: 4.349 ± 0.431
1.553AspTrp: 1.553 ± 0.44
2.641AspTyr: 2.641 ± 0.514
0.0AspXaa: 0.0 ± 0.0
Glu
6.291GluAla: 6.291 ± 0.755
0.466GluCys: 0.466 ± 0.186
4.272GluAsp: 4.272 ± 0.55
4.97GluGlu: 4.97 ± 0.855
2.641GluPhe: 2.641 ± 0.448
4.737GluGly: 4.737 ± 0.58
1.243GluHis: 1.243 ± 0.333
2.641GluIle: 2.641 ± 0.348
3.107GluLys: 3.107 ± 0.545
5.436GluLeu: 5.436 ± 0.735
2.019GluMet: 2.019 ± 0.462
3.34GluAsn: 3.34 ± 0.558
1.709GluPro: 1.709 ± 0.392
3.65GluGln: 3.65 ± 0.502
3.65GluArg: 3.65 ± 0.58
4.116GluSer: 4.116 ± 0.543
3.573GluThr: 3.573 ± 0.618
4.893GluVal: 4.893 ± 0.551
1.553GluTrp: 1.553 ± 0.435
2.874GluTyr: 2.874 ± 0.511
0.0GluXaa: 0.0 ± 0.0
Phe
2.097PheAla: 2.097 ± 0.514
0.311PheCys: 0.311 ± 0.154
2.951PheAsp: 2.951 ± 0.332
2.408PheGlu: 2.408 ± 0.581
1.631PhePhe: 1.631 ± 0.506
3.806PheGly: 3.806 ± 0.746
0.388PheHis: 0.388 ± 0.181
1.864PheIle: 1.864 ± 0.375
2.641PheLys: 2.641 ± 0.528
2.796PheLeu: 2.796 ± 0.466
0.854PheMet: 0.854 ± 0.238
2.485PheAsn: 2.485 ± 0.341
1.398PhePro: 1.398 ± 0.271
0.699PheGln: 0.699 ± 0.213
1.476PheArg: 1.476 ± 0.488
3.34PheSer: 3.34 ± 0.641
3.029PheThr: 3.029 ± 0.383
2.408PheVal: 2.408 ± 0.453
0.233PheTrp: 0.233 ± 0.139
1.631PheTyr: 1.631 ± 0.316
0.0PheXaa: 0.0 ± 0.0
Gly
5.514GlyAla: 5.514 ± 0.686
0.388GlyCys: 0.388 ± 0.194
3.883GlyAsp: 3.883 ± 0.656
4.505GlyGlu: 4.505 ± 0.613
2.874GlyPhe: 2.874 ± 0.465
3.728GlyGly: 3.728 ± 0.586
1.087GlyHis: 1.087 ± 0.338
3.883GlyIle: 3.883 ± 0.547
4.815GlyLys: 4.815 ± 0.622
5.126GlyLeu: 5.126 ± 0.882
1.32GlyMet: 1.32 ± 0.298
2.408GlyAsn: 2.408 ± 0.373
1.864GlyPro: 1.864 ± 0.39
2.097GlyGln: 2.097 ± 0.388
3.262GlyArg: 3.262 ± 0.393
6.679GlySer: 6.679 ± 1.025
5.592GlyThr: 5.592 ± 1.006
5.514GlyVal: 5.514 ± 0.645
1.243GlyTrp: 1.243 ± 0.268
2.718GlyTyr: 2.718 ± 0.594
0.0GlyXaa: 0.0 ± 0.0
His
1.243HisAla: 1.243 ± 0.29
0.311HisCys: 0.311 ± 0.123
1.01HisAsp: 1.01 ± 0.286
0.932HisGlu: 0.932 ± 0.27
0.777HisPhe: 0.777 ± 0.207
0.621HisGly: 0.621 ± 0.235
0.233HisHis: 0.233 ± 0.12
1.087HisIle: 1.087 ± 0.29
0.777HisLys: 0.777 ± 0.245
1.709HisLeu: 1.709 ± 0.346
0.466HisMet: 0.466 ± 0.184
0.777HisAsn: 0.777 ± 0.219
0.544HisPro: 0.544 ± 0.189
0.311HisGln: 0.311 ± 0.158
0.544HisArg: 0.544 ± 0.187
0.777HisSer: 0.777 ± 0.275
0.621HisThr: 0.621 ± 0.234
1.087HisVal: 1.087 ± 0.325
0.233HisTrp: 0.233 ± 0.124
0.699HisTyr: 0.699 ± 0.263
0.0HisXaa: 0.0 ± 0.0
Ile
4.737IleAla: 4.737 ± 0.766
0.078IleCys: 0.078 ± 0.077
4.116IleAsp: 4.116 ± 0.68
3.34IleGlu: 3.34 ± 0.559
1.398IlePhe: 1.398 ± 0.42
2.252IleGly: 2.252 ± 0.357
1.01IleHis: 1.01 ± 0.269
2.019IleIle: 2.019 ± 0.42
2.874IleLys: 2.874 ± 0.498
4.116IleLeu: 4.116 ± 0.631
0.777IleMet: 0.777 ± 0.212
2.485IleAsn: 2.485 ± 0.463
1.709IlePro: 1.709 ± 0.347
2.097IleGln: 2.097 ± 0.384
2.485IleArg: 2.485 ± 0.3
3.961IleSer: 3.961 ± 0.545
2.485IleThr: 2.485 ± 0.473
3.495IleVal: 3.495 ± 0.524
1.087IleTrp: 1.087 ± 0.257
1.01IleTyr: 1.01 ± 0.263
0.0IleXaa: 0.0 ± 0.0
Lys
6.524LysAla: 6.524 ± 0.963
0.311LysCys: 0.311 ± 0.234
2.951LysAsp: 2.951 ± 0.516
4.505LysGlu: 4.505 ± 0.707
1.786LysPhe: 1.786 ± 0.426
4.349LysGly: 4.349 ± 0.772
1.32LysHis: 1.32 ± 0.418
2.718LysIle: 2.718 ± 0.478
3.573LysLys: 3.573 ± 0.902
5.281LysLeu: 5.281 ± 0.644
1.553LysMet: 1.553 ± 0.46
2.641LysAsn: 2.641 ± 0.453
2.485LysPro: 2.485 ± 0.481
2.563LysGln: 2.563 ± 0.563
2.252LysArg: 2.252 ± 0.498
2.874LysSer: 2.874 ± 0.478
4.737LysThr: 4.737 ± 0.76
3.65LysVal: 3.65 ± 0.551
0.544LysTrp: 0.544 ± 0.244
1.631LysTyr: 1.631 ± 0.463
0.0LysXaa: 0.0 ± 0.0
Leu
6.368LeuAla: 6.368 ± 0.896
0.621LeuCys: 0.621 ± 0.294
6.601LeuAsp: 6.601 ± 0.751
5.98LeuGlu: 5.98 ± 0.719
2.563LeuPhe: 2.563 ± 0.383
6.213LeuGly: 6.213 ± 0.896
1.165LeuHis: 1.165 ± 0.26
3.107LeuIle: 3.107 ± 0.499
5.126LeuLys: 5.126 ± 0.836
5.436LeuLeu: 5.436 ± 0.641
1.864LeuMet: 1.864 ± 0.402
4.039LeuAsn: 4.039 ± 0.487
2.33LeuPro: 2.33 ± 0.359
2.718LeuGln: 2.718 ± 0.524
3.262LeuArg: 3.262 ± 0.416
7.611LeuSer: 7.611 ± 0.901
6.213LeuThr: 6.213 ± 0.647
5.592LeuVal: 5.592 ± 0.608
0.854LeuTrp: 0.854 ± 0.202
2.408LeuTyr: 2.408 ± 0.325
0.0LeuXaa: 0.0 ± 0.0
Met
2.33MetAla: 2.33 ± 0.392
0.466MetCys: 0.466 ± 0.231
1.786MetAsp: 1.786 ± 0.278
1.32MetGlu: 1.32 ± 0.311
0.699MetPhe: 0.699 ± 0.237
1.553MetGly: 1.553 ± 0.465
0.311MetHis: 0.311 ± 0.157
1.165MetIle: 1.165 ± 0.305
1.398MetLys: 1.398 ± 0.384
1.942MetLeu: 1.942 ± 0.399
0.311MetMet: 0.311 ± 0.176
0.621MetAsn: 0.621 ± 0.223
0.777MetPro: 0.777 ± 0.287
1.243MetGln: 1.243 ± 0.281
1.864MetArg: 1.864 ± 0.326
2.097MetSer: 2.097 ± 0.351
2.019MetThr: 2.019 ± 0.346
1.631MetVal: 1.631 ± 0.417
0.388MetTrp: 0.388 ± 0.159
0.699MetTyr: 0.699 ± 0.416
0.0MetXaa: 0.0 ± 0.0
Asn
4.272AsnAla: 4.272 ± 0.754
0.544AsnCys: 0.544 ± 0.184
2.408AsnAsp: 2.408 ± 0.411
1.942AsnGlu: 1.942 ± 0.343
1.942AsnPhe: 1.942 ± 0.433
3.65AsnGly: 3.65 ± 0.561
0.388AsnHis: 0.388 ± 0.161
2.485AsnIle: 2.485 ± 0.512
2.874AsnLys: 2.874 ± 0.48
4.582AsnLeu: 4.582 ± 0.619
1.398AsnMet: 1.398 ± 0.287
2.019AsnAsn: 2.019 ± 0.511
2.874AsnPro: 2.874 ± 0.422
1.864AsnGln: 1.864 ± 0.345
2.408AsnArg: 2.408 ± 0.594
2.796AsnSer: 2.796 ± 0.467
2.796AsnThr: 2.796 ± 0.508
3.961AsnVal: 3.961 ± 0.537
0.233AsnTrp: 0.233 ± 0.166
1.786AsnTyr: 1.786 ± 0.356
0.0AsnXaa: 0.0 ± 0.0
Pro
2.951ProAla: 2.951 ± 0.629
0.311ProCys: 0.311 ± 0.164
3.029ProAsp: 3.029 ± 0.432
4.505ProGlu: 4.505 ± 0.542
0.777ProPhe: 0.777 ± 0.268
0.0ProGly: 0.0 ± 0.0
0.699ProHis: 0.699 ± 0.2
1.476ProIle: 1.476 ± 0.331
2.641ProLys: 2.641 ± 0.454
2.097ProLeu: 2.097 ± 0.465
1.32ProMet: 1.32 ± 0.353
1.631ProAsn: 1.631 ± 0.294
1.243ProPro: 1.243 ± 0.339
1.398ProGln: 1.398 ± 0.451
1.087ProArg: 1.087 ± 0.287
3.883ProSer: 3.883 ± 0.568
2.408ProThr: 2.408 ± 0.514
2.641ProVal: 2.641 ± 0.61
0.544ProTrp: 0.544 ± 0.197
2.019ProTyr: 2.019 ± 0.363
0.0ProXaa: 0.0 ± 0.0
Gln
5.747GlnAla: 5.747 ± 0.845
0.155GlnCys: 0.155 ± 0.1
2.252GlnAsp: 2.252 ± 0.43
2.252GlnGlu: 2.252 ± 0.441
2.019GlnPhe: 2.019 ± 0.368
2.019GlnGly: 2.019 ± 0.436
0.544GlnHis: 0.544 ± 0.184
1.786GlnIle: 1.786 ± 0.474
1.864GlnLys: 1.864 ± 0.347
3.262GlnLeu: 3.262 ± 0.529
0.854GlnMet: 0.854 ± 0.217
1.942GlnAsn: 1.942 ± 0.432
0.388GlnPro: 0.388 ± 0.164
1.32GlnGln: 1.32 ± 0.37
2.563GlnArg: 2.563 ± 0.428
1.942GlnSer: 1.942 ± 0.408
2.796GlnThr: 2.796 ± 0.436
2.563GlnVal: 2.563 ± 0.337
0.466GlnTrp: 0.466 ± 0.157
1.398GlnTyr: 1.398 ± 0.414
0.0GlnXaa: 0.0 ± 0.0
Arg
2.718ArgAla: 2.718 ± 0.413
0.155ArgCys: 0.155 ± 0.09
3.495ArgAsp: 3.495 ± 0.446
2.796ArgGlu: 2.796 ± 0.58
1.786ArgPhe: 1.786 ± 0.45
2.485ArgGly: 2.485 ± 0.446
0.777ArgHis: 0.777 ± 0.257
2.951ArgIle: 2.951 ± 0.505
3.184ArgLys: 3.184 ± 0.584
3.883ArgLeu: 3.883 ± 0.522
2.33ArgMet: 2.33 ± 0.468
2.563ArgAsn: 2.563 ± 0.457
1.864ArgPro: 1.864 ± 0.48
2.019ArgGln: 2.019 ± 0.359
2.097ArgArg: 2.097 ± 0.508
3.184ArgSer: 3.184 ± 0.569
2.485ArgThr: 2.485 ± 0.383
2.796ArgVal: 2.796 ± 0.605
0.621ArgTrp: 0.621 ± 0.175
2.252ArgTyr: 2.252 ± 0.457
0.0ArgXaa: 0.0 ± 0.0
Ser
6.446SerAla: 6.446 ± 1.116
0.544SerCys: 0.544 ± 0.245
5.126SerAsp: 5.126 ± 0.812
4.194SerGlu: 4.194 ± 0.572
4.116SerPhe: 4.116 ± 0.619
7.456SerGly: 7.456 ± 1.244
0.854SerHis: 0.854 ± 0.29
3.806SerIle: 3.806 ± 0.621
3.184SerLys: 3.184 ± 0.562
5.281SerLeu: 5.281 ± 0.545
1.786SerMet: 1.786 ± 0.453
4.116SerAsn: 4.116 ± 0.55
2.252SerPro: 2.252 ± 0.369
1.942SerGln: 1.942 ± 0.509
2.563SerArg: 2.563 ± 0.417
5.126SerSer: 5.126 ± 0.965
5.281SerThr: 5.281 ± 0.952
5.359SerVal: 5.359 ± 0.951
1.32SerTrp: 1.32 ± 0.319
2.563SerTyr: 2.563 ± 0.561
0.0SerXaa: 0.0 ± 0.0
Thr
7.3ThrAla: 7.3 ± 0.955
0.233ThrCys: 0.233 ± 0.127
4.427ThrAsp: 4.427 ± 0.586
4.116ThrGlu: 4.116 ± 0.648
2.33ThrPhe: 2.33 ± 0.366
5.203ThrGly: 5.203 ± 0.81
1.087ThrHis: 1.087 ± 0.385
3.573ThrIle: 3.573 ± 0.603
4.116ThrLys: 4.116 ± 0.58
6.368ThrLeu: 6.368 ± 0.76
1.32ThrMet: 1.32 ± 0.305
3.184ThrAsn: 3.184 ± 0.741
3.34ThrPro: 3.34 ± 0.457
3.029ThrGln: 3.029 ± 0.487
3.029ThrArg: 3.029 ± 0.546
3.806ThrSer: 3.806 ± 0.902
5.592ThrThr: 5.592 ± 0.81
5.592ThrVal: 5.592 ± 0.821
1.32ThrTrp: 1.32 ± 0.342
2.874ThrTyr: 2.874 ± 0.546
0.0ThrXaa: 0.0 ± 0.0
Val
5.902ValAla: 5.902 ± 0.749
0.311ValCys: 0.311 ± 0.139
5.281ValAsp: 5.281 ± 0.556
4.97ValGlu: 4.97 ± 0.847
2.951ValPhe: 2.951 ± 0.556
5.436ValGly: 5.436 ± 0.579
0.621ValHis: 0.621 ± 0.26
2.796ValIle: 2.796 ± 0.432
4.737ValLys: 4.737 ± 0.627
4.893ValLeu: 4.893 ± 0.662
1.32ValMet: 1.32 ± 0.348
3.107ValAsn: 3.107 ± 0.418
3.107ValPro: 3.107 ± 0.45
1.942ValGln: 1.942 ± 0.41
3.262ValArg: 3.262 ± 0.487
6.446ValSer: 6.446 ± 1.088
5.747ValThr: 5.747 ± 0.695
5.592ValVal: 5.592 ± 0.925
0.311ValTrp: 0.311 ± 0.118
2.175ValTyr: 2.175 ± 0.447
0.0ValXaa: 0.0 ± 0.0
Trp
1.398TrpAla: 1.398 ± 0.35
0.155TrpCys: 0.155 ± 0.098
0.621TrpAsp: 0.621 ± 0.192
1.243TrpGlu: 1.243 ± 0.257
0.777TrpPhe: 0.777 ± 0.259
0.699TrpGly: 0.699 ± 0.315
0.544TrpHis: 0.544 ± 0.175
0.311TrpIle: 0.311 ± 0.149
0.466TrpLys: 0.466 ± 0.196
1.553TrpLeu: 1.553 ± 0.427
0.233TrpMet: 0.233 ± 0.127
0.699TrpAsn: 0.699 ± 0.245
0.621TrpPro: 0.621 ± 0.215
0.621TrpGln: 0.621 ± 0.181
0.699TrpArg: 0.699 ± 0.229
1.165TrpSer: 1.165 ± 0.228
1.087TrpThr: 1.087 ± 0.331
1.553TrpVal: 1.553 ± 0.388
0.311TrpTrp: 0.311 ± 0.142
0.311TrpTyr: 0.311 ± 0.156
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.485TyrAla: 2.485 ± 0.441
0.233TyrCys: 0.233 ± 0.146
3.029TyrAsp: 3.029 ± 0.518
2.33TyrGlu: 2.33 ± 0.356
1.01TyrPhe: 1.01 ± 0.202
2.175TyrGly: 2.175 ± 0.461
0.388TyrHis: 0.388 ± 0.169
1.398TyrIle: 1.398 ± 0.319
1.32TyrLys: 1.32 ± 0.287
3.029TyrLeu: 3.029 ± 0.48
1.398TyrMet: 1.398 ± 0.372
2.097TyrAsn: 2.097 ± 0.452
1.165TyrPro: 1.165 ± 0.379
1.476TyrGln: 1.476 ± 0.447
2.175TyrArg: 2.175 ± 0.329
2.563TyrSer: 2.563 ± 0.699
3.417TyrThr: 3.417 ± 0.61
3.107TyrVal: 3.107 ± 0.493
0.777TyrTrp: 0.777 ± 0.23
1.476TyrTyr: 1.476 ± 0.333
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (12877 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski