Amino acid dipepetide frequency for Lysinibacillus phage vB_LspM-01

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.304AlaAla: 4.304 ± 0.964
0.146AlaCys: 0.146 ± 0.106
3.72AlaAsp: 3.72 ± 0.474
4.377AlaGlu: 4.377 ± 0.555
2.626AlaPhe: 2.626 ± 0.422
3.72AlaGly: 3.72 ± 0.918
0.73AlaHis: 0.73 ± 0.291
5.107AlaIle: 5.107 ± 0.754
4.815AlaLys: 4.815 ± 0.591
5.325AlaLeu: 5.325 ± 1.016
1.459AlaMet: 1.459 ± 0.421
3.502AlaAsn: 3.502 ± 0.684
2.189AlaPro: 2.189 ± 0.38
1.97AlaGln: 1.97 ± 0.357
3.502AlaArg: 3.502 ± 0.62
3.72AlaSer: 3.72 ± 0.649
3.939AlaThr: 3.939 ± 0.61
2.991AlaVal: 2.991 ± 0.678
0.73AlaTrp: 0.73 ± 0.215
2.407AlaTyr: 2.407 ± 0.423
0.0AlaXaa: 0.0 ± 0.0
Cys
0.146CysAla: 0.146 ± 0.092
0.219CysCys: 0.219 ± 0.128
0.438CysAsp: 0.438 ± 0.163
0.584CysGlu: 0.584 ± 0.228
0.292CysPhe: 0.292 ± 0.152
0.73CysGly: 0.73 ± 0.251
0.365CysHis: 0.365 ± 0.149
0.292CysIle: 0.292 ± 0.15
0.365CysLys: 0.365 ± 0.139
0.73CysLeu: 0.73 ± 0.224
0.219CysMet: 0.219 ± 0.124
0.219CysAsn: 0.219 ± 0.123
0.73CysPro: 0.73 ± 0.251
0.219CysGln: 0.219 ± 0.137
0.365CysArg: 0.365 ± 0.153
0.438CysSer: 0.438 ± 0.175
0.365CysThr: 0.365 ± 0.16
0.584CysVal: 0.584 ± 0.227
0.146CysTrp: 0.146 ± 0.1
0.292CysTyr: 0.292 ± 0.132
0.0CysXaa: 0.0 ± 0.0
Asp
4.523AspAla: 4.523 ± 0.629
0.438AspCys: 0.438 ± 0.186
4.304AspAsp: 4.304 ± 0.747
3.575AspGlu: 3.575 ± 0.54
2.116AspPhe: 2.116 ± 0.363
5.179AspGly: 5.179 ± 0.861
0.657AspHis: 0.657 ± 0.232
4.085AspIle: 4.085 ± 0.575
4.815AspLys: 4.815 ± 0.737
5.034AspLeu: 5.034 ± 0.716
1.605AspMet: 1.605 ± 0.357
2.845AspAsn: 2.845 ± 0.491
1.459AspPro: 1.459 ± 0.325
1.313AspGln: 1.313 ± 0.288
1.97AspArg: 1.97 ± 0.395
2.991AspSer: 2.991 ± 0.529
3.575AspThr: 3.575 ± 0.42
5.252AspVal: 5.252 ± 0.647
0.657AspTrp: 0.657 ± 0.21
2.553AspTyr: 2.553 ± 0.43
0.0AspXaa: 0.0 ± 0.0
Glu
4.231GluAla: 4.231 ± 0.624
0.875GluCys: 0.875 ± 0.258
2.699GluAsp: 2.699 ± 0.467
5.763GluGlu: 5.763 ± 0.812
3.137GluPhe: 3.137 ± 0.514
5.544GluGly: 5.544 ± 0.582
1.605GluHis: 1.605 ± 0.393
5.398GluIle: 5.398 ± 0.562
6.857GluLys: 6.857 ± 0.906
6.93GluLeu: 6.93 ± 0.897
2.918GluMet: 2.918 ± 0.422
3.866GluAsn: 3.866 ± 0.603
1.459GluPro: 1.459 ± 0.34
2.699GluGln: 2.699 ± 0.42
4.523GluArg: 4.523 ± 0.597
2.991GluSer: 2.991 ± 0.539
3.939GluThr: 3.939 ± 0.527
5.836GluVal: 5.836 ± 0.655
1.678GluTrp: 1.678 ± 0.325
3.064GluTyr: 3.064 ± 0.473
0.0GluXaa: 0.0 ± 0.0
Phe
2.334PheAla: 2.334 ± 0.359
0.219PheCys: 0.219 ± 0.135
3.21PheAsp: 3.21 ± 0.601
3.283PheGlu: 3.283 ± 0.488
1.459PhePhe: 1.459 ± 0.373
1.97PheGly: 1.97 ± 0.413
0.511PheHis: 0.511 ± 0.234
3.429PheIle: 3.429 ± 0.506
4.596PheLys: 4.596 ± 0.555
2.772PheLeu: 2.772 ± 0.446
1.24PheMet: 1.24 ± 0.36
2.334PheAsn: 2.334 ± 0.467
1.094PhePro: 1.094 ± 0.34
0.875PheGln: 0.875 ± 0.286
1.751PheArg: 1.751 ± 0.29
2.043PheSer: 2.043 ± 0.408
1.97PheThr: 1.97 ± 0.458
2.918PheVal: 2.918 ± 0.49
0.219PheTrp: 0.219 ± 0.123
1.605PheTyr: 1.605 ± 0.249
0.0PheXaa: 0.0 ± 0.0
Gly
4.377GlyAla: 4.377 ± 0.771
0.438GlyCys: 0.438 ± 0.173
3.793GlyAsp: 3.793 ± 0.522
3.575GlyGlu: 3.575 ± 0.538
2.918GlyPhe: 2.918 ± 0.5
6.493GlyGly: 6.493 ± 2.149
1.094GlyHis: 1.094 ± 0.267
5.325GlyIle: 5.325 ± 0.833
4.45GlyLys: 4.45 ± 0.529
4.961GlyLeu: 4.961 ± 0.511
2.626GlyMet: 2.626 ± 0.458
3.648GlyAsn: 3.648 ± 0.634
0.438GlyPro: 0.438 ± 0.174
2.043GlyGln: 2.043 ± 0.426
2.845GlyArg: 2.845 ± 0.407
4.304GlySer: 4.304 ± 0.897
5.107GlyThr: 5.107 ± 0.671
5.763GlyVal: 5.763 ± 0.528
1.094GlyTrp: 1.094 ± 0.3
3.793GlyTyr: 3.793 ± 0.606
0.0GlyXaa: 0.0 ± 0.0
His
0.875HisAla: 0.875 ± 0.261
0.146HisCys: 0.146 ± 0.106
1.459HisAsp: 1.459 ± 0.404
1.459HisGlu: 1.459 ± 0.354
0.438HisPhe: 0.438 ± 0.204
1.313HisGly: 1.313 ± 0.289
0.219HisHis: 0.219 ± 0.111
1.678HisIle: 1.678 ± 0.38
0.802HisLys: 0.802 ± 0.244
1.459HisLeu: 1.459 ± 0.363
0.365HisMet: 0.365 ± 0.146
0.875HisAsn: 0.875 ± 0.286
0.292HisPro: 0.292 ± 0.135
0.292HisGln: 0.292 ± 0.143
0.802HisArg: 0.802 ± 0.244
0.73HisSer: 0.73 ± 0.321
0.948HisThr: 0.948 ± 0.265
1.021HisVal: 1.021 ± 0.245
0.146HisTrp: 0.146 ± 0.099
0.875HisTyr: 0.875 ± 0.261
0.0HisXaa: 0.0 ± 0.0
Ile
5.107IleAla: 5.107 ± 0.899
0.511IleCys: 0.511 ± 0.199
5.107IleAsp: 5.107 ± 0.585
5.398IleGlu: 5.398 ± 0.667
2.189IlePhe: 2.189 ± 0.461
3.283IleGly: 3.283 ± 0.823
0.657IleHis: 0.657 ± 0.206
4.961IleIle: 4.961 ± 0.968
5.398IleLys: 5.398 ± 0.51
5.471IleLeu: 5.471 ± 0.621
1.605IleMet: 1.605 ± 0.265
4.596IleAsn: 4.596 ± 0.542
2.699IlePro: 2.699 ± 0.38
2.845IleGln: 2.845 ± 0.581
3.575IleArg: 3.575 ± 0.409
4.523IleSer: 4.523 ± 0.589
4.961IleThr: 4.961 ± 0.703
4.304IleVal: 4.304 ± 0.478
0.511IleTrp: 0.511 ± 0.194
2.918IleTyr: 2.918 ± 0.367
0.0IleXaa: 0.0 ± 0.0
Lys
4.523LysAla: 4.523 ± 0.629
0.802LysCys: 0.802 ± 0.251
4.523LysAsp: 4.523 ± 0.739
7.003LysGlu: 7.003 ± 0.857
2.991LysPhe: 2.991 ± 0.433
4.523LysGly: 4.523 ± 0.634
1.751LysHis: 1.751 ± 0.375
3.939LysIle: 3.939 ± 0.653
5.69LysLys: 5.69 ± 0.842
5.763LysLeu: 5.763 ± 0.699
2.772LysMet: 2.772 ± 0.484
3.575LysAsn: 3.575 ± 0.465
2.189LysPro: 2.189 ± 0.428
3.72LysGln: 3.72 ± 0.416
3.21LysArg: 3.21 ± 0.588
4.012LysSer: 4.012 ± 0.494
4.012LysThr: 4.012 ± 0.507
5.763LysVal: 5.763 ± 0.658
0.875LysTrp: 0.875 ± 0.259
2.991LysTyr: 2.991 ± 0.549
0.0LysXaa: 0.0 ± 0.0
Leu
5.179LeuAla: 5.179 ± 0.64
0.438LeuCys: 0.438 ± 0.201
4.45LeuAsp: 4.45 ± 0.661
7.222LeuGlu: 7.222 ± 0.783
3.648LeuPhe: 3.648 ± 0.494
5.252LeuGly: 5.252 ± 0.54
1.313LeuHis: 1.313 ± 0.323
5.398LeuIle: 5.398 ± 0.766
5.544LeuLys: 5.544 ± 0.628
5.909LeuLeu: 5.909 ± 0.924
2.334LeuMet: 2.334 ± 0.451
5.252LeuAsn: 5.252 ± 0.723
2.626LeuPro: 2.626 ± 0.453
4.012LeuGln: 4.012 ± 0.495
3.429LeuArg: 3.429 ± 0.468
5.034LeuSer: 5.034 ± 0.544
4.815LeuThr: 4.815 ± 0.577
4.888LeuVal: 4.888 ± 0.631
1.167LeuTrp: 1.167 ± 0.278
2.699LeuTyr: 2.699 ± 0.42
0.0LeuXaa: 0.0 ± 0.0
Met
1.459MetAla: 1.459 ± 0.293
0.146MetCys: 0.146 ± 0.102
1.532MetAsp: 1.532 ± 0.306
2.553MetGlu: 2.553 ± 0.435
0.657MetPhe: 0.657 ± 0.212
1.386MetGly: 1.386 ± 0.299
0.438MetHis: 0.438 ± 0.171
2.261MetIle: 2.261 ± 0.59
2.261MetLys: 2.261 ± 0.447
2.553MetLeu: 2.553 ± 0.441
0.875MetMet: 0.875 ± 0.307
2.334MetAsn: 2.334 ± 0.366
1.605MetPro: 1.605 ± 0.38
2.116MetGln: 2.116 ± 0.429
1.167MetArg: 1.167 ± 0.32
1.605MetSer: 1.605 ± 0.374
2.407MetThr: 2.407 ± 0.48
1.605MetVal: 1.605 ± 0.304
0.365MetTrp: 0.365 ± 0.159
1.021MetTyr: 1.021 ± 0.226
0.0MetXaa: 0.0 ± 0.0
Asn
4.961AsnAla: 4.961 ± 0.829
0.365AsnCys: 0.365 ± 0.161
2.699AsnAsp: 2.699 ± 0.442
5.107AsnGlu: 5.107 ± 0.628
1.459AsnPhe: 1.459 ± 0.233
5.982AsnGly: 5.982 ± 0.63
0.802AsnHis: 0.802 ± 0.213
4.158AsnIle: 4.158 ± 0.474
4.377AsnLys: 4.377 ± 0.516
3.793AsnLeu: 3.793 ± 0.617
1.897AsnMet: 1.897 ± 0.363
3.356AsnAsn: 3.356 ± 0.633
2.699AsnPro: 2.699 ± 0.434
1.605AsnGln: 1.605 ± 0.331
2.189AsnArg: 2.189 ± 0.515
2.845AsnSer: 2.845 ± 0.481
3.648AsnThr: 3.648 ± 0.606
3.866AsnVal: 3.866 ± 0.53
0.875AsnTrp: 0.875 ± 0.254
1.605AsnTyr: 1.605 ± 0.279
0.0AsnXaa: 0.0 ± 0.0
Pro
1.678ProAla: 1.678 ± 0.324
0.365ProCys: 0.365 ± 0.172
1.897ProAsp: 1.897 ± 0.385
2.189ProGlu: 2.189 ± 0.574
1.751ProPhe: 1.751 ± 0.337
1.313ProGly: 1.313 ± 0.307
0.657ProHis: 0.657 ± 0.2
2.626ProIle: 2.626 ± 0.458
2.261ProLys: 2.261 ± 0.36
2.407ProLeu: 2.407 ± 0.455
1.021ProMet: 1.021 ± 0.319
1.605ProAsn: 1.605 ± 0.354
1.167ProPro: 1.167 ± 0.295
1.386ProGln: 1.386 ± 0.306
0.802ProArg: 0.802 ± 0.207
1.751ProSer: 1.751 ± 0.309
1.897ProThr: 1.897 ± 0.371
2.189ProVal: 2.189 ± 0.429
0.219ProTrp: 0.219 ± 0.13
1.167ProTyr: 1.167 ± 0.295
0.0ProXaa: 0.0 ± 0.0
Gln
2.48GlnAla: 2.48 ± 0.515
0.146GlnCys: 0.146 ± 0.107
1.824GlnAsp: 1.824 ± 0.3
3.866GlnGlu: 3.866 ± 0.633
1.313GlnPhe: 1.313 ± 0.286
2.043GlnGly: 2.043 ± 0.428
0.875GlnHis: 0.875 ± 0.216
2.334GlnIle: 2.334 ± 0.494
1.897GlnLys: 1.897 ± 0.498
3.72GlnLeu: 3.72 ± 0.502
1.386GlnMet: 1.386 ± 0.364
1.97GlnAsn: 1.97 ± 0.414
0.875GlnPro: 0.875 ± 0.268
2.772GlnGln: 2.772 ± 0.778
2.043GlnArg: 2.043 ± 0.307
1.897GlnSer: 1.897 ± 0.267
1.751GlnThr: 1.751 ± 0.284
1.97GlnVal: 1.97 ± 0.417
0.802GlnTrp: 0.802 ± 0.253
2.261GlnTyr: 2.261 ± 0.353
0.0GlnXaa: 0.0 ± 0.0
Arg
1.386ArgAla: 1.386 ± 0.403
0.511ArgCys: 0.511 ± 0.211
3.283ArgAsp: 3.283 ± 0.437
2.991ArgGlu: 2.991 ± 0.587
2.189ArgPhe: 2.189 ± 0.325
2.845ArgGly: 2.845 ± 0.309
1.167ArgHis: 1.167 ± 0.301
2.626ArgIle: 2.626 ± 0.519
3.866ArgLys: 3.866 ± 0.61
3.72ArgLeu: 3.72 ± 0.498
1.824ArgMet: 1.824 ± 0.339
2.626ArgAsn: 2.626 ± 0.394
1.167ArgPro: 1.167 ± 0.376
1.605ArgGln: 1.605 ± 0.288
2.48ArgArg: 2.48 ± 0.464
1.897ArgSer: 1.897 ± 0.418
2.918ArgThr: 2.918 ± 0.575
3.575ArgVal: 3.575 ± 0.476
0.657ArgTrp: 0.657 ± 0.186
2.772ArgTyr: 2.772 ± 0.462
0.0ArgXaa: 0.0 ± 0.0
Ser
2.699SerAla: 2.699 ± 0.531
0.511SerCys: 0.511 ± 0.213
3.064SerAsp: 3.064 ± 0.494
3.137SerGlu: 3.137 ± 0.441
2.553SerPhe: 2.553 ± 0.408
4.304SerGly: 4.304 ± 0.733
0.438SerHis: 0.438 ± 0.174
4.815SerIle: 4.815 ± 0.488
3.502SerLys: 3.502 ± 0.516
4.158SerLeu: 4.158 ± 0.465
1.24SerMet: 1.24 ± 0.247
4.158SerAsn: 4.158 ± 0.52
1.751SerPro: 1.751 ± 0.265
2.918SerGln: 2.918 ± 0.437
2.918SerArg: 2.918 ± 0.426
3.429SerSer: 3.429 ± 0.482
2.699SerThr: 2.699 ± 0.423
3.866SerVal: 3.866 ± 0.582
0.584SerTrp: 0.584 ± 0.189
2.407SerTyr: 2.407 ± 0.467
0.0SerXaa: 0.0 ± 0.0
Thr
4.085ThrAla: 4.085 ± 0.82
0.219ThrCys: 0.219 ± 0.126
2.626ThrAsp: 2.626 ± 0.578
3.064ThrGlu: 3.064 ± 0.425
2.918ThrPhe: 2.918 ± 0.396
4.815ThrGly: 4.815 ± 0.683
0.73ThrHis: 0.73 ± 0.224
4.085ThrIle: 4.085 ± 0.585
4.596ThrLys: 4.596 ± 0.599
5.617ThrLeu: 5.617 ± 0.627
1.532ThrMet: 1.532 ± 0.323
3.648ThrAsn: 3.648 ± 0.391
2.918ThrPro: 2.918 ± 0.48
1.897ThrGln: 1.897 ± 0.352
2.48ThrArg: 2.48 ± 0.39
3.21ThrSer: 3.21 ± 0.456
4.085ThrThr: 4.085 ± 0.588
5.034ThrVal: 5.034 ± 0.682
0.657ThrTrp: 0.657 ± 0.188
2.845ThrTyr: 2.845 ± 0.507
0.0ThrXaa: 0.0 ± 0.0
Val
4.085ValAla: 4.085 ± 0.62
0.875ValCys: 0.875 ± 0.269
4.596ValAsp: 4.596 ± 0.634
5.325ValGlu: 5.325 ± 0.583
2.553ValPhe: 2.553 ± 0.387
3.939ValGly: 3.939 ± 0.534
1.386ValHis: 1.386 ± 0.323
4.815ValIle: 4.815 ± 0.604
4.523ValLys: 4.523 ± 0.786
5.69ValLeu: 5.69 ± 0.659
1.678ValMet: 1.678 ± 0.379
4.377ValAsn: 4.377 ± 0.572
2.189ValPro: 2.189 ± 0.383
2.116ValGln: 2.116 ± 0.469
3.575ValArg: 3.575 ± 0.455
3.866ValSer: 3.866 ± 0.575
5.325ValThr: 5.325 ± 0.689
5.471ValVal: 5.471 ± 0.642
0.584ValTrp: 0.584 ± 0.176
3.21ValTyr: 3.21 ± 0.478
0.0ValXaa: 0.0 ± 0.0
Trp
0.292TrpAla: 0.292 ± 0.114
0.0TrpCys: 0.0 ± 0.0
1.094TrpAsp: 1.094 ± 0.303
1.094TrpGlu: 1.094 ± 0.288
0.73TrpPhe: 0.73 ± 0.19
0.511TrpGly: 0.511 ± 0.197
0.219TrpHis: 0.219 ± 0.114
0.948TrpIle: 0.948 ± 0.284
1.021TrpLys: 1.021 ± 0.271
1.386TrpLeu: 1.386 ± 0.365
0.511TrpMet: 0.511 ± 0.19
0.802TrpAsn: 0.802 ± 0.237
0.073TrpPro: 0.073 ± 0.069
0.511TrpGln: 0.511 ± 0.235
0.73TrpArg: 0.73 ± 0.208
0.875TrpSer: 0.875 ± 0.234
0.219TrpThr: 0.219 ± 0.129
0.802TrpVal: 0.802 ± 0.246
0.219TrpTrp: 0.219 ± 0.113
0.657TrpTyr: 0.657 ± 0.257
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.553TyrAla: 2.553 ± 0.497
0.365TyrCys: 0.365 ± 0.159
2.699TyrAsp: 2.699 ± 0.441
4.45TyrGlu: 4.45 ± 0.621
1.897TyrPhe: 1.897 ± 0.376
3.939TyrGly: 3.939 ± 0.578
0.584TyrHis: 0.584 ± 0.202
2.334TyrIle: 2.334 ± 0.408
2.991TyrLys: 2.991 ± 0.497
3.21TyrLeu: 3.21 ± 0.378
1.167TyrMet: 1.167 ± 0.263
2.699TyrAsn: 2.699 ± 0.517
0.73TyrPro: 0.73 ± 0.234
1.313TyrGln: 1.313 ± 0.337
1.751TyrArg: 1.751 ± 0.402
2.991TyrSer: 2.991 ± 0.434
2.48TyrThr: 2.48 ± 0.518
2.48TyrVal: 2.48 ± 0.409
0.511TyrTrp: 0.511 ± 0.175
1.532TyrTyr: 1.532 ± 0.396
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 84 proteins (13709 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski