Amino acid dipepetide frequency for Bacteriophage Reminis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.107AlaAla: 9.107 ± 1.238
0.578AlaCys: 0.578 ± 0.192
5.421AlaAsp: 5.421 ± 0.533
6.143AlaGlu: 6.143 ± 0.904
3.18AlaPhe: 3.18 ± 0.532
5.71AlaGly: 5.71 ± 0.654
1.373AlaHis: 1.373 ± 0.307
5.565AlaIle: 5.565 ± 0.666
5.854AlaLys: 5.854 ± 0.712
7.517AlaLeu: 7.517 ± 0.666
2.819AlaMet: 2.819 ± 0.513
3.397AlaAsn: 3.397 ± 0.505
2.241AlaPro: 2.241 ± 0.327
3.686AlaGln: 3.686 ± 0.768
3.469AlaArg: 3.469 ± 0.625
6.071AlaSer: 6.071 ± 0.772
4.12AlaThr: 4.12 ± 0.637
4.626AlaVal: 4.626 ± 0.758
1.662AlaTrp: 1.662 ± 0.343
2.891AlaTyr: 2.891 ± 0.386
0.0AlaXaa: 0.0 ± 0.0
Cys
0.578CysAla: 0.578 ± 0.207
0.289CysCys: 0.289 ± 0.145
0.506CysAsp: 0.506 ± 0.187
0.506CysGlu: 0.506 ± 0.22
0.434CysPhe: 0.434 ± 0.209
0.217CysGly: 0.217 ± 0.119
0.217CysHis: 0.217 ± 0.132
0.723CysIle: 0.723 ± 0.231
0.65CysLys: 0.65 ± 0.188
0.867CysLeu: 0.867 ± 0.304
0.289CysMet: 0.289 ± 0.13
0.506CysAsn: 0.506 ± 0.213
0.578CysPro: 0.578 ± 0.185
0.361CysGln: 0.361 ± 0.183
0.506CysArg: 0.506 ± 0.2
0.795CysSer: 0.795 ± 0.253
0.723CysThr: 0.723 ± 0.261
0.65CysVal: 0.65 ± 0.245
0.072CysTrp: 0.072 ± 0.08
0.506CysTyr: 0.506 ± 0.179
0.0CysXaa: 0.0 ± 0.0
Asp
5.565AspAla: 5.565 ± 0.653
0.506AspCys: 0.506 ± 0.245
3.252AspAsp: 3.252 ± 0.579
4.987AspGlu: 4.987 ± 0.707
1.951AspPhe: 1.951 ± 0.39
5.204AspGly: 5.204 ± 0.619
1.084AspHis: 1.084 ± 0.319
3.108AspIle: 3.108 ± 0.472
4.192AspLys: 4.192 ± 0.651
5.276AspLeu: 5.276 ± 0.595
1.59AspMet: 1.59 ± 0.356
2.963AspAsn: 2.963 ± 0.656
2.602AspPro: 2.602 ± 0.407
1.373AspGln: 1.373 ± 0.373
2.385AspArg: 2.385 ± 0.425
3.397AspSer: 3.397 ± 0.667
3.469AspThr: 3.469 ± 0.526
3.614AspVal: 3.614 ± 0.493
1.518AspTrp: 1.518 ± 0.373
2.891AspTyr: 2.891 ± 0.502
0.0AspXaa: 0.0 ± 0.0
Glu
5.927GluAla: 5.927 ± 1.046
0.65GluCys: 0.65 ± 0.253
4.047GluAsp: 4.047 ± 0.595
4.409GluGlu: 4.409 ± 0.742
2.602GluPhe: 2.602 ± 0.453
4.409GluGly: 4.409 ± 0.484
1.301GluHis: 1.301 ± 0.355
3.252GluIle: 3.252 ± 0.332
3.758GluLys: 3.758 ± 0.564
6.288GluLeu: 6.288 ± 0.761
2.457GluMet: 2.457 ± 0.387
1.518GluAsn: 1.518 ± 0.345
2.313GluPro: 2.313 ± 0.41
4.047GluGln: 4.047 ± 0.622
2.891GluArg: 2.891 ± 0.521
3.614GluSer: 3.614 ± 0.494
2.819GluThr: 2.819 ± 0.42
5.348GluVal: 5.348 ± 0.613
1.084GluTrp: 1.084 ± 0.471
1.879GluTyr: 1.879 ± 0.36
0.0GluXaa: 0.0 ± 0.0
Phe
2.168PheAla: 2.168 ± 0.411
0.145PheCys: 0.145 ± 0.127
2.602PheAsp: 2.602 ± 0.484
1.879PheGlu: 1.879 ± 0.335
1.156PhePhe: 1.156 ± 0.313
2.674PheGly: 2.674 ± 0.443
0.578PheHis: 0.578 ± 0.25
2.602PheIle: 2.602 ± 0.564
2.674PheLys: 2.674 ± 0.54
2.457PheLeu: 2.457 ± 0.456
1.951PheMet: 1.951 ± 0.358
2.53PheAsn: 2.53 ± 0.327
1.156PhePro: 1.156 ± 0.303
0.94PheGln: 0.94 ± 0.292
1.807PheArg: 1.807 ± 0.387
2.168PheSer: 2.168 ± 0.492
2.096PheThr: 2.096 ± 0.36
2.385PheVal: 2.385 ± 0.377
0.361PheTrp: 0.361 ± 0.153
1.518PheTyr: 1.518 ± 0.302
0.0PheXaa: 0.0 ± 0.0
Gly
5.999GlyAla: 5.999 ± 0.724
0.94GlyCys: 0.94 ± 0.238
4.047GlyAsp: 4.047 ± 0.605
4.553GlyGlu: 4.553 ± 0.498
2.674GlyPhe: 2.674 ± 0.457
4.481GlyGly: 4.481 ± 0.509
1.59GlyHis: 1.59 ± 0.299
3.541GlyIle: 3.541 ± 0.509
6.722GlyLys: 6.722 ± 0.756
5.71GlyLeu: 5.71 ± 0.692
1.951GlyMet: 1.951 ± 0.338
3.252GlyAsn: 3.252 ± 0.408
0.506GlyPro: 0.506 ± 0.199
2.313GlyGln: 2.313 ± 0.595
4.481GlyArg: 4.481 ± 0.558
4.12GlySer: 4.12 ± 0.544
4.987GlyThr: 4.987 ± 0.858
4.698GlyVal: 4.698 ± 0.664
1.084GlyTrp: 1.084 ± 0.244
2.891GlyTyr: 2.891 ± 0.422
0.0GlyXaa: 0.0 ± 0.0
His
1.084HisAla: 1.084 ± 0.254
0.217HisCys: 0.217 ± 0.152
1.373HisAsp: 1.373 ± 0.369
1.012HisGlu: 1.012 ± 0.328
0.65HisPhe: 0.65 ± 0.183
1.446HisGly: 1.446 ± 0.337
0.434HisHis: 0.434 ± 0.187
1.446HisIle: 1.446 ± 0.305
1.807HisLys: 1.807 ± 0.47
2.241HisLeu: 2.241 ± 0.341
0.434HisMet: 0.434 ± 0.202
0.578HisAsn: 0.578 ± 0.193
0.506HisPro: 0.506 ± 0.214
0.578HisGln: 0.578 ± 0.135
1.301HisArg: 1.301 ± 0.347
0.65HisSer: 0.65 ± 0.174
1.446HisThr: 1.446 ± 0.403
1.373HisVal: 1.373 ± 0.325
0.434HisTrp: 0.434 ± 0.17
0.94HisTyr: 0.94 ± 0.266
0.0HisXaa: 0.0 ± 0.0
Ile
4.698IleAla: 4.698 ± 0.551
0.65IleCys: 0.65 ± 0.252
3.975IleAsp: 3.975 ± 0.509
3.541IleGlu: 3.541 ± 0.526
1.518IlePhe: 1.518 ± 0.346
3.252IleGly: 3.252 ± 0.45
2.096IleHis: 2.096 ± 0.376
2.602IleIle: 2.602 ± 0.41
4.481IleLys: 4.481 ± 0.55
4.192IleLeu: 4.192 ± 0.647
1.446IleMet: 1.446 ± 0.288
3.614IleAsn: 3.614 ± 0.498
2.024IlePro: 2.024 ± 0.365
2.241IleGln: 2.241 ± 0.386
2.602IleArg: 2.602 ± 0.357
3.397IleSer: 3.397 ± 0.499
3.975IleThr: 3.975 ± 0.704
2.674IleVal: 2.674 ± 0.38
0.578IleTrp: 0.578 ± 0.208
2.168IleTyr: 2.168 ± 0.301
0.0IleXaa: 0.0 ± 0.0
Lys
6.432LysAla: 6.432 ± 0.912
0.506LysCys: 0.506 ± 0.197
3.614LysAsp: 3.614 ± 0.648
5.059LysGlu: 5.059 ± 0.688
2.602LysPhe: 2.602 ± 0.34
5.132LysGly: 5.132 ± 0.672
1.59LysHis: 1.59 ± 0.484
3.758LysIle: 3.758 ± 0.508
3.541LysLys: 3.541 ± 0.674
6.649LysLeu: 6.649 ± 0.762
2.602LysMet: 2.602 ± 0.481
1.735LysAsn: 1.735 ± 0.368
2.385LysPro: 2.385 ± 0.41
3.614LysGln: 3.614 ± 0.739
3.686LysArg: 3.686 ± 0.586
3.831LysSer: 3.831 ± 0.53
3.036LysThr: 3.036 ± 0.447
5.348LysVal: 5.348 ± 0.597
0.65LysTrp: 0.65 ± 0.235
2.602LysTyr: 2.602 ± 0.425
0.0LysXaa: 0.0 ± 0.0
Leu
7.733LeuAla: 7.733 ± 0.761
0.506LeuCys: 0.506 ± 0.212
5.637LeuAsp: 5.637 ± 0.585
5.493LeuGlu: 5.493 ± 0.688
3.252LeuPhe: 3.252 ± 0.554
5.854LeuGly: 5.854 ± 0.577
1.59LeuHis: 1.59 ± 0.335
3.831LeuIle: 3.831 ± 0.49
4.987LeuLys: 4.987 ± 0.606
6.143LeuLeu: 6.143 ± 0.748
2.313LeuMet: 2.313 ± 0.395
3.975LeuAsn: 3.975 ± 0.432
3.614LeuPro: 3.614 ± 0.53
3.397LeuGln: 3.397 ± 0.564
3.758LeuArg: 3.758 ± 0.573
6.649LeuSer: 6.649 ± 0.754
5.782LeuThr: 5.782 ± 0.75
5.493LeuVal: 5.493 ± 0.534
0.65LeuTrp: 0.65 ± 0.253
2.674LeuTyr: 2.674 ± 0.59
0.0LeuXaa: 0.0 ± 0.0
Met
2.746MetAla: 2.746 ± 0.427
0.145MetCys: 0.145 ± 0.093
1.735MetAsp: 1.735 ± 0.306
2.241MetGlu: 2.241 ± 0.537
0.578MetPhe: 0.578 ± 0.184
2.313MetGly: 2.313 ± 0.417
0.65MetHis: 0.65 ± 0.254
1.012MetIle: 1.012 ± 0.292
2.891MetLys: 2.891 ± 0.514
2.674MetLeu: 2.674 ± 0.337
1.084MetMet: 1.084 ± 0.343
1.012MetAsn: 1.012 ± 0.261
1.373MetPro: 1.373 ± 0.328
1.301MetGln: 1.301 ± 0.352
1.735MetArg: 1.735 ± 0.294
3.325MetSer: 3.325 ± 0.488
1.446MetThr: 1.446 ± 0.33
1.735MetVal: 1.735 ± 0.253
0.217MetTrp: 0.217 ± 0.117
1.229MetTyr: 1.229 ± 0.332
0.0MetXaa: 0.0 ± 0.0
Asn
2.457AsnAla: 2.457 ± 0.394
0.506AsnCys: 0.506 ± 0.248
2.024AsnAsp: 2.024 ± 0.394
2.168AsnGlu: 2.168 ± 0.412
2.096AsnPhe: 2.096 ± 0.445
3.325AsnGly: 3.325 ± 0.47
0.94AsnHis: 0.94 ± 0.252
2.602AsnIle: 2.602 ± 0.455
3.686AsnLys: 3.686 ± 0.457
3.758AsnLeu: 3.758 ± 0.581
1.301AsnMet: 1.301 ± 0.278
2.096AsnAsn: 2.096 ± 0.462
2.313AsnPro: 2.313 ± 0.407
2.168AsnGln: 2.168 ± 0.344
2.53AsnArg: 2.53 ± 0.492
2.891AsnSer: 2.891 ± 0.511
2.313AsnThr: 2.313 ± 0.317
2.891AsnVal: 2.891 ± 0.492
0.94AsnTrp: 0.94 ± 0.259
1.59AsnTyr: 1.59 ± 0.397
0.0AsnXaa: 0.0 ± 0.0
Pro
3.036ProAla: 3.036 ± 0.433
0.578ProCys: 0.578 ± 0.229
3.252ProAsp: 3.252 ± 0.545
3.252ProGlu: 3.252 ± 0.558
1.156ProPhe: 1.156 ± 0.367
1.373ProGly: 1.373 ± 0.307
0.723ProHis: 0.723 ± 0.227
1.446ProIle: 1.446 ± 0.484
1.807ProLys: 1.807 ± 0.417
2.457ProLeu: 2.457 ± 0.499
0.65ProMet: 0.65 ± 0.244
1.156ProAsn: 1.156 ± 0.33
1.012ProPro: 1.012 ± 0.238
1.662ProGln: 1.662 ± 0.287
0.94ProArg: 0.94 ± 0.259
2.096ProSer: 2.096 ± 0.368
2.241ProThr: 2.241 ± 0.46
3.469ProVal: 3.469 ± 0.451
0.578ProTrp: 0.578 ± 0.194
1.446ProTyr: 1.446 ± 0.224
0.0ProXaa: 0.0 ± 0.0
Gln
4.481GlnAla: 4.481 ± 0.834
0.289GlnCys: 0.289 ± 0.127
2.096GlnAsp: 2.096 ± 0.366
2.746GlnGlu: 2.746 ± 0.488
1.879GlnPhe: 1.879 ± 0.321
3.036GlnGly: 3.036 ± 0.661
1.012GlnHis: 1.012 ± 0.237
2.457GlnIle: 2.457 ± 0.385
1.951GlnLys: 1.951 ± 0.421
4.409GlnLeu: 4.409 ± 0.643
2.024GlnMet: 2.024 ± 0.42
1.735GlnAsn: 1.735 ± 0.455
1.012GlnPro: 1.012 ± 0.269
3.325GlnGln: 3.325 ± 0.669
1.879GlnArg: 1.879 ± 0.405
2.819GlnSer: 2.819 ± 0.614
1.879GlnThr: 1.879 ± 0.317
3.397GlnVal: 3.397 ± 0.595
0.506GlnTrp: 0.506 ± 0.158
1.301GlnTyr: 1.301 ± 0.256
0.0GlnXaa: 0.0 ± 0.0
Arg
4.192ArgAla: 4.192 ± 0.672
0.506ArgCys: 0.506 ± 0.239
3.108ArgAsp: 3.108 ± 0.409
2.819ArgGlu: 2.819 ± 0.419
1.59ArgPhe: 1.59 ± 0.42
3.036ArgGly: 3.036 ± 0.477
0.94ArgHis: 0.94 ± 0.274
2.457ArgIle: 2.457 ± 0.365
3.903ArgLys: 3.903 ± 0.564
3.758ArgLeu: 3.758 ± 0.533
1.879ArgMet: 1.879 ± 0.369
2.096ArgAsn: 2.096 ± 0.406
1.229ArgPro: 1.229 ± 0.325
2.168ArgGln: 2.168 ± 0.343
1.518ArgArg: 1.518 ± 0.33
2.963ArgSer: 2.963 ± 0.565
2.024ArgThr: 2.024 ± 0.38
3.831ArgVal: 3.831 ± 0.513
0.795ArgTrp: 0.795 ± 0.222
1.951ArgTyr: 1.951 ± 0.39
0.0ArgXaa: 0.0 ± 0.0
Ser
5.204SerAla: 5.204 ± 0.934
0.361SerCys: 0.361 ± 0.204
3.758SerAsp: 3.758 ± 0.506
3.614SerGlu: 3.614 ± 0.542
1.59SerPhe: 1.59 ± 0.258
5.276SerGly: 5.276 ± 0.671
0.723SerHis: 0.723 ± 0.226
4.481SerIle: 4.481 ± 0.643
4.337SerLys: 4.337 ± 0.524
5.854SerLeu: 5.854 ± 1.052
1.518SerMet: 1.518 ± 0.356
3.108SerAsn: 3.108 ± 0.644
2.385SerPro: 2.385 ± 0.437
3.758SerGln: 3.758 ± 0.695
3.036SerArg: 3.036 ± 0.416
4.12SerSer: 4.12 ± 0.609
3.831SerThr: 3.831 ± 0.589
4.337SerVal: 4.337 ± 0.531
0.867SerTrp: 0.867 ± 0.302
2.891SerTyr: 2.891 ± 0.47
0.0SerXaa: 0.0 ± 0.0
Thr
4.264ThrAla: 4.264 ± 0.679
0.867ThrCys: 0.867 ± 0.247
3.325ThrAsp: 3.325 ± 0.325
3.541ThrGlu: 3.541 ± 0.606
2.024ThrPhe: 2.024 ± 0.409
5.421ThrGly: 5.421 ± 0.807
1.084ThrHis: 1.084 ± 0.264
3.541ThrIle: 3.541 ± 0.536
3.903ThrLys: 3.903 ± 0.554
4.481ThrLeu: 4.481 ± 0.425
1.662ThrMet: 1.662 ± 0.337
2.53ThrAsn: 2.53 ± 0.478
2.313ThrPro: 2.313 ± 0.388
2.024ThrGln: 2.024 ± 0.361
2.385ThrArg: 2.385 ± 0.437
3.831ThrSer: 3.831 ± 0.71
4.192ThrThr: 4.192 ± 0.694
3.252ThrVal: 3.252 ± 0.566
0.795ThrTrp: 0.795 ± 0.224
2.963ThrTyr: 2.963 ± 0.517
0.0ThrXaa: 0.0 ± 0.0
Val
5.782ValAla: 5.782 ± 0.568
0.867ValCys: 0.867 ± 0.223
4.047ValAsp: 4.047 ± 0.62
3.686ValGlu: 3.686 ± 0.413
2.746ValPhe: 2.746 ± 0.405
4.77ValGly: 4.77 ± 0.585
0.94ValHis: 0.94 ± 0.294
4.12ValIle: 4.12 ± 0.556
3.975ValLys: 3.975 ± 0.567
3.903ValLeu: 3.903 ± 0.521
2.024ValMet: 2.024 ± 0.377
3.831ValAsn: 3.831 ± 0.529
2.53ValPro: 2.53 ± 0.417
2.891ValGln: 2.891 ± 0.503
3.397ValArg: 3.397 ± 0.57
4.626ValSer: 4.626 ± 0.661
5.204ValThr: 5.204 ± 0.793
4.409ValVal: 4.409 ± 0.554
1.012ValTrp: 1.012 ± 0.262
2.819ValTyr: 2.819 ± 0.486
0.0ValXaa: 0.0 ± 0.0
Trp
1.373TrpAla: 1.373 ± 0.381
0.361TrpCys: 0.361 ± 0.17
0.795TrpAsp: 0.795 ± 0.208
0.723TrpGlu: 0.723 ± 0.293
0.65TrpPhe: 0.65 ± 0.219
1.084TrpGly: 1.084 ± 0.282
0.289TrpHis: 0.289 ± 0.141
0.723TrpIle: 0.723 ± 0.253
1.301TrpLys: 1.301 ± 0.236
1.301TrpLeu: 1.301 ± 0.287
0.289TrpMet: 0.289 ± 0.153
0.65TrpAsn: 0.65 ± 0.219
0.506TrpPro: 0.506 ± 0.313
0.795TrpGln: 0.795 ± 0.204
0.361TrpArg: 0.361 ± 0.16
1.156TrpSer: 1.156 ± 0.385
0.361TrpThr: 0.361 ± 0.176
1.156TrpVal: 1.156 ± 0.217
0.217TrpTrp: 0.217 ± 0.128
0.217TrpTyr: 0.217 ± 0.109
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.602TyrAla: 2.602 ± 0.459
0.506TyrCys: 0.506 ± 0.2
2.313TyrAsp: 2.313 ± 0.402
2.241TyrGlu: 2.241 ± 0.37
1.518TyrPhe: 1.518 ± 0.316
2.602TyrGly: 2.602 ± 0.423
0.795TyrHis: 0.795 ± 0.205
2.457TyrIle: 2.457 ± 0.429
1.951TyrLys: 1.951 ± 0.444
3.397TyrLeu: 3.397 ± 0.471
1.084TyrMet: 1.084 ± 0.291
2.313TyrAsn: 2.313 ± 0.369
1.735TyrPro: 1.735 ± 0.399
1.59TyrGln: 1.59 ± 0.4
2.096TyrArg: 2.096 ± 0.42
2.53TyrSer: 2.53 ± 0.494
2.457TyrThr: 2.457 ± 0.421
2.891TyrVal: 2.891 ± 0.589
0.289TyrTrp: 0.289 ± 0.127
1.229TyrTyr: 1.229 ± 0.355
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 61 proteins (13837 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski