Amino acid dipepetide frequency for Enterococcus phage vB_EfaS_Ef6.4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.557AlaAla: 0.557 ± 0.281
0.159AlaCys: 0.159 ± 0.113
3.424AlaAsp: 3.424 ± 0.488
4.539AlaGlu: 4.539 ± 0.688
2.548AlaPhe: 2.548 ± 0.393
3.504AlaGly: 3.504 ± 0.552
0.956AlaHis: 0.956 ± 0.241
5.017AlaIle: 5.017 ± 0.773
6.37AlaLys: 6.37 ± 1.0
4.857AlaLeu: 4.857 ± 0.727
2.389AlaMet: 2.389 ± 0.571
3.265AlaAsn: 3.265 ± 0.585
1.991AlaPro: 1.991 ± 0.381
1.593AlaGln: 1.593 ± 0.303
1.513AlaArg: 1.513 ± 0.257
2.867AlaSer: 2.867 ± 0.527
4.698AlaThr: 4.698 ± 0.656
4.141AlaVal: 4.141 ± 0.477
0.478AlaTrp: 0.478 ± 0.224
2.787AlaTyr: 2.787 ± 0.35
0.0AlaXaa: 0.0 ± 0.0
Cys
0.319CysAla: 0.319 ± 0.135
0.0CysCys: 0.0 ± 0.0
0.398CysAsp: 0.398 ± 0.183
0.557CysGlu: 0.557 ± 0.24
0.239CysPhe: 0.239 ± 0.157
0.239CysGly: 0.239 ± 0.132
0.159CysHis: 0.159 ± 0.11
0.398CysIle: 0.398 ± 0.207
0.637CysLys: 0.637 ± 0.232
0.319CysLeu: 0.319 ± 0.173
0.239CysMet: 0.239 ± 0.147
0.398CysAsn: 0.398 ± 0.164
0.0CysPro: 0.0 ± 0.0
0.159CysGln: 0.159 ± 0.106
0.159CysArg: 0.159 ± 0.1
0.478CysSer: 0.478 ± 0.162
0.557CysThr: 0.557 ± 0.272
0.398CysVal: 0.398 ± 0.189
0.159CysTrp: 0.159 ± 0.098
0.239CysTyr: 0.239 ± 0.139
0.0CysXaa: 0.0 ± 0.0
Asp
2.946AspAla: 2.946 ± 0.556
0.159AspCys: 0.159 ± 0.103
2.628AspAsp: 2.628 ± 0.56
4.778AspGlu: 4.778 ± 0.676
2.707AspPhe: 2.707 ± 0.443
5.176AspGly: 5.176 ± 0.65
0.637AspHis: 0.637 ± 0.252
4.141AspIle: 4.141 ± 0.612
6.609AspLys: 6.609 ± 0.759
5.415AspLeu: 5.415 ± 0.525
1.593AspMet: 1.593 ± 0.359
3.822AspAsn: 3.822 ± 0.566
2.15AspPro: 2.15 ± 0.407
1.593AspGln: 1.593 ± 0.28
1.752AspArg: 1.752 ± 0.349
3.424AspSer: 3.424 ± 0.547
3.743AspThr: 3.743 ± 0.498
5.017AspVal: 5.017 ± 0.627
0.796AspTrp: 0.796 ± 0.249
3.106AspTyr: 3.106 ± 0.568
0.0AspXaa: 0.0 ± 0.0
Glu
4.3GluAla: 4.3 ± 0.547
0.319GluCys: 0.319 ± 0.135
5.096GluAsp: 5.096 ± 0.774
6.45GluGlu: 6.45 ± 0.933
3.265GluPhe: 3.265 ± 0.596
4.061GluGly: 4.061 ± 0.593
1.194GluHis: 1.194 ± 0.301
4.141GluIle: 4.141 ± 0.657
7.007GluLys: 7.007 ± 0.662
9.078GluLeu: 9.078 ± 0.903
2.946GluMet: 2.946 ± 0.545
4.061GluAsn: 4.061 ± 0.524
3.106GluPro: 3.106 ± 0.579
3.424GluGln: 3.424 ± 0.742
3.424GluArg: 3.424 ± 0.524
3.504GluSer: 3.504 ± 0.61
5.256GluThr: 5.256 ± 0.612
6.37GluVal: 6.37 ± 0.807
1.593GluTrp: 1.593 ± 0.292
3.185GluTyr: 3.185 ± 0.617
0.0GluXaa: 0.0 ± 0.0
Phe
1.832PheAla: 1.832 ± 0.331
0.239PheCys: 0.239 ± 0.112
2.548PheAsp: 2.548 ± 0.644
2.867PheGlu: 2.867 ± 0.643
0.956PhePhe: 0.956 ± 0.253
2.787PheGly: 2.787 ± 0.46
0.239PheHis: 0.239 ± 0.141
4.3PheIle: 4.3 ± 0.591
4.459PheLys: 4.459 ± 0.621
2.15PheLeu: 2.15 ± 0.35
1.035PheMet: 1.035 ± 0.222
2.867PheAsn: 2.867 ± 0.386
0.796PhePro: 0.796 ± 0.213
1.672PheGln: 1.672 ± 0.454
1.513PheArg: 1.513 ± 0.284
2.548PheSer: 2.548 ± 0.442
4.22PheThr: 4.22 ± 0.639
2.469PheVal: 2.469 ± 0.529
0.478PheTrp: 0.478 ± 0.182
1.035PheTyr: 1.035 ± 0.202
0.0PheXaa: 0.0 ± 0.0
Gly
3.982GlyAla: 3.982 ± 1.17
0.478GlyCys: 0.478 ± 0.195
3.424GlyAsp: 3.424 ± 0.532
4.539GlyGlu: 4.539 ± 0.495
3.185GlyPhe: 3.185 ± 0.415
4.38GlyGly: 4.38 ± 0.962
0.796GlyHis: 0.796 ± 0.198
5.256GlyIle: 5.256 ± 0.907
6.769GlyLys: 6.769 ± 0.755
4.937GlyLeu: 4.937 ± 0.673
1.593GlyMet: 1.593 ± 0.345
4.061GlyAsn: 4.061 ± 0.448
0.956GlyPro: 0.956 ± 0.342
1.911GlyGln: 1.911 ± 0.344
2.07GlyArg: 2.07 ± 0.344
3.265GlySer: 3.265 ± 0.47
4.3GlyThr: 4.3 ± 0.797
4.3GlyVal: 4.3 ± 0.688
1.115GlyTrp: 1.115 ± 0.311
3.344GlyTyr: 3.344 ± 0.628
0.0GlyXaa: 0.0 ± 0.0
His
0.956HisAla: 0.956 ± 0.24
0.159HisCys: 0.159 ± 0.128
0.717HisAsp: 0.717 ± 0.235
1.433HisGlu: 1.433 ± 0.311
1.035HisPhe: 1.035 ± 0.346
1.115HisGly: 1.115 ± 0.313
0.239HisHis: 0.239 ± 0.139
0.557HisIle: 0.557 ± 0.168
1.274HisLys: 1.274 ± 0.359
0.876HisLeu: 0.876 ± 0.263
0.398HisMet: 0.398 ± 0.176
1.194HisAsn: 1.194 ± 0.348
0.239HisPro: 0.239 ± 0.14
0.319HisGln: 0.319 ± 0.161
0.478HisArg: 0.478 ± 0.154
0.398HisSer: 0.398 ± 0.179
0.876HisThr: 0.876 ± 0.348
1.035HisVal: 1.035 ± 0.293
0.159HisTrp: 0.159 ± 0.124
0.876HisTyr: 0.876 ± 0.246
0.0HisXaa: 0.0 ± 0.0
Ile
3.982IleAla: 3.982 ± 0.515
0.557IleCys: 0.557 ± 0.197
5.176IleAsp: 5.176 ± 0.495
7.007IleGlu: 7.007 ± 0.873
1.991IlePhe: 1.991 ± 0.395
4.857IleGly: 4.857 ± 0.747
0.796IleHis: 0.796 ± 0.264
3.663IleIle: 3.663 ± 0.415
6.052IleLys: 6.052 ± 0.839
5.654IleLeu: 5.654 ± 0.715
1.115IleMet: 1.115 ± 0.299
4.3IleAsn: 4.3 ± 0.789
2.548IlePro: 2.548 ± 0.507
2.469IleGln: 2.469 ± 0.371
1.672IleArg: 1.672 ± 0.285
3.504IleSer: 3.504 ± 0.597
4.619IleThr: 4.619 ± 0.396
4.22IleVal: 4.22 ± 0.613
0.717IleTrp: 0.717 ± 0.276
2.15IleTyr: 2.15 ± 0.455
0.0IleXaa: 0.0 ± 0.0
Lys
6.609LysAla: 6.609 ± 0.822
0.398LysCys: 0.398 ± 0.196
5.893LysAsp: 5.893 ± 0.634
8.52LysGlu: 8.52 ± 0.931
3.265LysPhe: 3.265 ± 0.494
5.733LysGly: 5.733 ± 0.829
1.433LysHis: 1.433 ± 0.346
5.415LysIle: 5.415 ± 0.56
6.848LysLys: 6.848 ± 0.914
6.609LysLeu: 6.609 ± 0.786
3.265LysMet: 3.265 ± 0.494
5.256LysAsn: 5.256 ± 0.669
3.026LysPro: 3.026 ± 0.559
4.061LysGln: 4.061 ± 0.56
3.743LysArg: 3.743 ± 0.605
4.539LysSer: 4.539 ± 0.736
5.415LysThr: 5.415 ± 0.629
6.37LysVal: 6.37 ± 0.783
1.035LysTrp: 1.035 ± 0.321
4.061LysTyr: 4.061 ± 0.663
0.0LysXaa: 0.0 ± 0.0
Leu
4.857LeuAla: 4.857 ± 0.787
0.478LeuCys: 0.478 ± 0.204
6.45LeuAsp: 6.45 ± 0.709
8.6LeuGlu: 8.6 ± 0.84
3.902LeuPhe: 3.902 ± 0.433
4.778LeuGly: 4.778 ± 0.737
0.717LeuHis: 0.717 ± 0.245
4.698LeuIle: 4.698 ± 0.781
6.211LeuLys: 6.211 ± 0.733
6.37LeuLeu: 6.37 ± 0.948
2.389LeuMet: 2.389 ± 0.432
6.689LeuAsn: 6.689 ± 0.774
2.469LeuPro: 2.469 ± 0.533
3.822LeuGln: 3.822 ± 0.451
2.628LeuArg: 2.628 ± 0.422
3.663LeuSer: 3.663 ± 0.468
4.3LeuThr: 4.3 ± 0.568
5.574LeuVal: 5.574 ± 0.783
1.035LeuTrp: 1.035 ± 0.257
2.707LeuTyr: 2.707 ± 0.505
0.0LeuXaa: 0.0 ± 0.0
Met
1.433MetAla: 1.433 ± 0.334
0.239MetCys: 0.239 ± 0.157
1.991MetAsp: 1.991 ± 0.363
2.23MetGlu: 2.23 ± 0.414
1.274MetPhe: 1.274 ± 0.426
1.274MetGly: 1.274 ± 0.313
0.08MetHis: 0.08 ± 0.065
1.274MetIle: 1.274 ± 0.257
2.946MetLys: 2.946 ± 0.524
2.548MetLeu: 2.548 ± 0.534
0.319MetMet: 0.319 ± 0.209
1.752MetAsn: 1.752 ± 0.392
0.796MetPro: 0.796 ± 0.271
1.035MetGln: 1.035 ± 0.272
1.593MetArg: 1.593 ± 0.332
1.274MetSer: 1.274 ± 0.325
2.23MetThr: 2.23 ± 0.482
1.672MetVal: 1.672 ± 0.506
0.398MetTrp: 0.398 ± 0.215
1.513MetTyr: 1.513 ± 0.362
0.0MetXaa: 0.0 ± 0.0
Asn
4.539AsnAla: 4.539 ± 0.72
0.159AsnCys: 0.159 ± 0.111
3.424AsnAsp: 3.424 ± 0.494
5.972AsnGlu: 5.972 ± 0.678
1.672AsnPhe: 1.672 ± 0.33
6.609AsnGly: 6.609 ± 0.708
0.717AsnHis: 0.717 ± 0.227
4.141AsnIle: 4.141 ± 0.711
5.893AsnLys: 5.893 ± 0.635
4.38AsnLeu: 4.38 ± 0.506
2.07AsnMet: 2.07 ± 0.391
4.061AsnAsn: 4.061 ± 0.651
1.513AsnPro: 1.513 ± 0.311
1.911AsnGln: 1.911 ± 0.344
1.433AsnArg: 1.433 ± 0.403
3.106AsnSer: 3.106 ± 0.467
5.495AsnThr: 5.495 ± 0.632
3.344AsnVal: 3.344 ± 0.434
0.717AsnTrp: 0.717 ± 0.236
2.548AsnTyr: 2.548 ± 0.489
0.0AsnXaa: 0.0 ± 0.0
Pro
1.911ProAla: 1.911 ± 0.443
0.239ProCys: 0.239 ± 0.17
2.15ProAsp: 2.15 ± 0.429
2.548ProGlu: 2.548 ± 0.385
1.194ProPhe: 1.194 ± 0.254
0.0ProGly: 0.0 ± 0.0
0.239ProHis: 0.239 ± 0.126
1.991ProIle: 1.991 ± 0.313
2.469ProLys: 2.469 ± 0.413
3.106ProLeu: 3.106 ± 0.468
0.717ProMet: 0.717 ± 0.211
1.911ProAsn: 1.911 ± 0.471
0.398ProPro: 0.398 ± 0.199
1.593ProGln: 1.593 ± 0.35
0.557ProArg: 0.557 ± 0.179
1.513ProSer: 1.513 ± 0.391
2.07ProThr: 2.07 ± 0.381
2.309ProVal: 2.309 ± 0.367
0.239ProTrp: 0.239 ± 0.112
1.672ProTyr: 1.672 ± 0.376
0.0ProXaa: 0.0 ± 0.0
Gln
2.389GlnAla: 2.389 ± 0.532
0.319GlnCys: 0.319 ± 0.176
1.911GlnAsp: 1.911 ± 0.35
1.911GlnGlu: 1.911 ± 0.386
1.672GlnPhe: 1.672 ± 0.411
1.752GlnGly: 1.752 ± 0.348
0.796GlnHis: 0.796 ± 0.244
2.309GlnIle: 2.309 ± 0.306
1.752GlnLys: 1.752 ± 0.269
3.743GlnLeu: 3.743 ± 0.546
1.035GlnMet: 1.035 ± 0.273
1.752GlnAsn: 1.752 ± 0.288
1.274GlnPro: 1.274 ± 0.256
1.911GlnGln: 1.911 ± 0.387
1.911GlnArg: 1.911 ± 0.473
2.15GlnSer: 2.15 ± 0.376
1.991GlnThr: 1.991 ± 0.328
2.469GlnVal: 2.469 ± 0.499
0.478GlnTrp: 0.478 ± 0.341
2.628GlnTyr: 2.628 ± 0.422
0.0GlnXaa: 0.0 ± 0.0
Arg
1.513ArgAla: 1.513 ± 0.391
0.557ArgCys: 0.557 ± 0.199
2.389ArgAsp: 2.389 ± 0.421
2.15ArgGlu: 2.15 ± 0.398
1.593ArgPhe: 1.593 ± 0.34
1.752ArgGly: 1.752 ± 0.315
0.637ArgHis: 0.637 ± 0.241
2.389ArgIle: 2.389 ± 0.436
3.106ArgLys: 3.106 ± 0.607
3.504ArgLeu: 3.504 ± 0.661
0.796ArgMet: 0.796 ± 0.191
2.23ArgAsn: 2.23 ± 0.382
0.956ArgPro: 0.956 ± 0.237
0.876ArgGln: 0.876 ± 0.24
0.796ArgArg: 0.796 ± 0.198
1.433ArgSer: 1.433 ± 0.348
1.832ArgThr: 1.832 ± 0.363
2.07ArgVal: 2.07 ± 0.396
0.398ArgTrp: 0.398 ± 0.204
1.752ArgTyr: 1.752 ± 0.299
0.0ArgXaa: 0.0 ± 0.0
Ser
2.946SerAla: 2.946 ± 0.542
0.159SerCys: 0.159 ± 0.109
2.787SerAsp: 2.787 ± 0.424
3.424SerGlu: 3.424 ± 0.586
2.469SerPhe: 2.469 ± 0.343
4.698SerGly: 4.698 ± 0.842
1.433SerHis: 1.433 ± 0.33
3.504SerIle: 3.504 ± 0.5
4.619SerLys: 4.619 ± 0.708
3.185SerLeu: 3.185 ± 0.642
1.274SerMet: 1.274 ± 0.397
3.583SerAsn: 3.583 ± 0.394
0.796SerPro: 0.796 ± 0.202
2.07SerGln: 2.07 ± 0.408
1.354SerArg: 1.354 ± 0.322
2.469SerSer: 2.469 ± 0.338
3.902SerThr: 3.902 ± 0.503
3.583SerVal: 3.583 ± 0.496
0.717SerTrp: 0.717 ± 0.271
2.23SerTyr: 2.23 ± 0.526
0.0SerXaa: 0.0 ± 0.0
Thr
3.743ThrAla: 3.743 ± 0.648
0.239ThrCys: 0.239 ± 0.163
3.504ThrAsp: 3.504 ± 0.555
4.3ThrGlu: 4.3 ± 0.601
2.548ThrPhe: 2.548 ± 0.441
4.857ThrGly: 4.857 ± 0.713
1.433ThrHis: 1.433 ± 0.306
4.619ThrIle: 4.619 ± 0.508
7.087ThrLys: 7.087 ± 0.715
5.813ThrLeu: 5.813 ± 0.912
1.593ThrMet: 1.593 ± 0.413
4.061ThrAsn: 4.061 ± 0.528
2.389ThrPro: 2.389 ± 0.407
2.469ThrGln: 2.469 ± 0.481
2.389ThrArg: 2.389 ± 0.504
2.867ThrSer: 2.867 ± 0.599
4.619ThrThr: 4.619 ± 0.99
4.778ThrVal: 4.778 ± 0.56
0.796ThrTrp: 0.796 ± 0.38
2.548ThrTyr: 2.548 ± 0.464
0.0ThrXaa: 0.0 ± 0.0
Val
5.574ValAla: 5.574 ± 0.597
0.637ValCys: 0.637 ± 0.26
4.22ValAsp: 4.22 ± 0.52
4.38ValGlu: 4.38 ± 0.509
2.946ValPhe: 2.946 ± 0.433
3.822ValGly: 3.822 ± 0.61
0.956ValHis: 0.956 ± 0.308
4.857ValIle: 4.857 ± 0.618
6.848ValLys: 6.848 ± 1.103
5.176ValLeu: 5.176 ± 0.674
2.15ValMet: 2.15 ± 0.345
3.982ValAsn: 3.982 ± 0.54
2.07ValPro: 2.07 ± 0.359
1.752ValGln: 1.752 ± 0.39
2.07ValArg: 2.07 ± 0.392
5.654ValSer: 5.654 ± 0.728
3.106ValThr: 3.106 ± 0.733
4.778ValVal: 4.778 ± 0.718
0.956ValTrp: 0.956 ± 0.335
2.548ValTyr: 2.548 ± 0.528
0.0ValXaa: 0.0 ± 0.0
Trp
0.557TrpAla: 0.557 ± 0.217
0.08TrpCys: 0.08 ± 0.087
0.717TrpAsp: 0.717 ± 0.286
1.433TrpGlu: 1.433 ± 0.335
0.956TrpPhe: 0.956 ± 0.208
0.876TrpGly: 0.876 ± 0.29
0.239TrpHis: 0.239 ± 0.145
0.876TrpIle: 0.876 ± 0.293
1.035TrpLys: 1.035 ± 0.274
0.876TrpLeu: 0.876 ± 0.269
0.08TrpMet: 0.08 ± 0.078
0.717TrpAsn: 0.717 ± 0.25
0.0TrpPro: 0.0 ± 0.0
0.398TrpGln: 0.398 ± 0.155
0.637TrpArg: 0.637 ± 0.253
0.557TrpSer: 0.557 ± 0.18
0.796TrpThr: 0.796 ± 0.232
1.354TrpVal: 1.354 ± 0.354
0.398TrpTrp: 0.398 ± 0.163
0.398TrpTyr: 0.398 ± 0.192
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.707TyrAla: 2.707 ± 0.47
0.478TyrCys: 0.478 ± 0.175
3.344TyrAsp: 3.344 ± 0.492
4.141TyrGlu: 4.141 ± 0.67
1.593TyrPhe: 1.593 ± 0.406
2.389TyrGly: 2.389 ± 0.479
0.637TyrHis: 0.637 ± 0.246
3.504TyrIle: 3.504 ± 0.614
3.504TyrLys: 3.504 ± 0.593
3.822TyrLeu: 3.822 ± 0.497
0.796TyrMet: 0.796 ± 0.211
3.743TyrAsn: 3.743 ± 0.61
1.354TyrPro: 1.354 ± 0.379
1.115TyrGln: 1.115 ± 0.256
1.035TyrArg: 1.035 ± 0.327
1.991TyrSer: 1.991 ± 0.437
2.628TyrThr: 2.628 ± 0.567
2.23TyrVal: 2.23 ± 0.356
0.319TyrTrp: 0.319 ± 0.191
1.513TyrTyr: 1.513 ± 0.355
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 69 proteins (12559 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski