Amino acid dipepetide frequency for Streptococcus phage SW6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.181AlaAla: 3.181 ± 0.893
0.099AlaCys: 0.099 ± 0.094
4.075AlaAsp: 4.075 ± 0.722
3.777AlaGlu: 3.777 ± 0.482
2.584AlaPhe: 2.584 ± 0.67
3.379AlaGly: 3.379 ± 0.691
0.696AlaHis: 0.696 ± 0.276
5.168AlaIle: 5.168 ± 1.067
7.057AlaLys: 7.057 ± 1.344
5.964AlaLeu: 5.964 ± 0.663
1.292AlaMet: 1.292 ± 0.354
4.97AlaAsn: 4.97 ± 0.91
1.392AlaPro: 1.392 ± 0.347
2.485AlaGln: 2.485 ± 0.458
2.087AlaArg: 2.087 ± 0.51
4.771AlaSer: 4.771 ± 0.807
4.672AlaThr: 4.672 ± 0.863
4.672AlaVal: 4.672 ± 0.595
0.994AlaTrp: 0.994 ± 0.241
2.087AlaTyr: 2.087 ± 0.396
0.0AlaXaa: 0.0 ± 0.0
Cys
0.398CysAla: 0.398 ± 0.198
0.0CysCys: 0.0 ± 0.0
0.795CysAsp: 0.795 ± 0.286
0.298CysGlu: 0.298 ± 0.146
0.298CysPhe: 0.298 ± 0.209
0.0CysGly: 0.0 ± 0.0
0.099CysHis: 0.099 ± 0.114
0.0CysIle: 0.0 ± 0.0
0.199CysLys: 0.199 ± 0.141
0.398CysLeu: 0.398 ± 0.244
0.099CysMet: 0.099 ± 0.094
0.298CysAsn: 0.298 ± 0.149
0.199CysPro: 0.199 ± 0.125
0.099CysGln: 0.099 ± 0.095
0.298CysArg: 0.298 ± 0.222
0.497CysSer: 0.497 ± 0.279
0.199CysThr: 0.199 ± 0.122
0.298CysVal: 0.298 ± 0.136
0.199CysTrp: 0.199 ± 0.134
0.099CysTyr: 0.099 ± 0.083
0.0CysXaa: 0.0 ± 0.0
Asp
3.678AspAla: 3.678 ± 0.671
0.398AspCys: 0.398 ± 0.2
4.672AspAsp: 4.672 ± 0.597
4.373AspGlu: 4.373 ± 0.709
3.081AspPhe: 3.081 ± 0.375
5.765AspGly: 5.765 ± 0.716
0.895AspHis: 0.895 ± 0.301
5.566AspIle: 5.566 ± 0.966
5.964AspLys: 5.964 ± 0.606
4.175AspLeu: 4.175 ± 0.709
2.286AspMet: 2.286 ± 0.471
4.175AspAsn: 4.175 ± 0.879
2.385AspPro: 2.385 ± 0.379
1.491AspGln: 1.491 ± 0.288
2.684AspArg: 2.684 ± 0.424
3.081AspSer: 3.081 ± 0.438
4.672AspThr: 4.672 ± 0.633
3.578AspVal: 3.578 ± 0.526
0.795AspTrp: 0.795 ± 0.312
2.684AspTyr: 2.684 ± 0.469
0.0AspXaa: 0.0 ± 0.0
Glu
4.075GluAla: 4.075 ± 0.601
0.199GluCys: 0.199 ± 0.129
3.678GluAsp: 3.678 ± 0.718
4.87GluGlu: 4.87 ± 0.96
2.187GluPhe: 2.187 ± 0.555
2.882GluGly: 2.882 ± 0.519
1.093GluHis: 1.093 ± 0.315
6.162GluIle: 6.162 ± 0.775
3.876GluLys: 3.876 ± 0.787
6.063GluLeu: 6.063 ± 0.712
2.286GluMet: 2.286 ± 0.417
4.274GluAsn: 4.274 ± 0.72
1.491GluPro: 1.491 ± 0.372
2.882GluGln: 2.882 ± 0.488
3.181GluArg: 3.181 ± 0.627
3.081GluSer: 3.081 ± 0.397
3.777GluThr: 3.777 ± 0.554
3.678GluVal: 3.678 ± 0.52
1.491GluTrp: 1.491 ± 0.321
3.181GluTyr: 3.181 ± 0.539
0.0GluXaa: 0.0 ± 0.0
Phe
3.081PheAla: 3.081 ± 0.561
0.199PheCys: 0.199 ± 0.145
3.578PheAsp: 3.578 ± 0.603
1.59PheGlu: 1.59 ± 0.403
1.988PhePhe: 1.988 ± 0.49
3.181PheGly: 3.181 ± 0.67
0.497PheHis: 0.497 ± 0.18
2.783PheIle: 2.783 ± 0.422
3.578PheLys: 3.578 ± 0.49
2.783PheLeu: 2.783 ± 0.427
0.696PheMet: 0.696 ± 0.249
2.982PheAsn: 2.982 ± 0.71
0.994PhePro: 0.994 ± 0.244
1.093PheGln: 1.093 ± 0.291
1.59PheArg: 1.59 ± 0.351
3.081PheSer: 3.081 ± 0.51
2.187PheThr: 2.187 ± 0.35
2.584PheVal: 2.584 ± 0.381
0.696PheTrp: 0.696 ± 0.241
2.087PheTyr: 2.087 ± 0.499
0.0PheXaa: 0.0 ± 0.0
Gly
2.982GlyAla: 2.982 ± 0.559
0.298GlyCys: 0.298 ± 0.167
3.976GlyAsp: 3.976 ± 0.498
3.777GlyGlu: 3.777 ± 0.61
4.075GlyPhe: 4.075 ± 0.71
4.572GlyGly: 4.572 ± 0.792
0.795GlyHis: 0.795 ± 0.308
4.175GlyIle: 4.175 ± 0.809
5.964GlyLys: 5.964 ± 0.749
6.56GlyLeu: 6.56 ± 0.858
1.392GlyMet: 1.392 ± 0.33
4.373GlyAsn: 4.373 ± 0.676
0.994GlyPro: 0.994 ± 0.367
3.181GlyGln: 3.181 ± 0.651
2.882GlyArg: 2.882 ± 0.409
4.274GlySer: 4.274 ± 0.862
4.274GlyThr: 4.274 ± 0.78
4.373GlyVal: 4.373 ± 0.708
1.093GlyTrp: 1.093 ± 0.363
2.982GlyTyr: 2.982 ± 0.596
0.0GlyXaa: 0.0 ± 0.0
His
0.199HisAla: 0.199 ± 0.15
0.099HisCys: 0.099 ± 0.094
0.994HisAsp: 0.994 ± 0.322
0.596HisGlu: 0.596 ± 0.273
0.596HisPhe: 0.596 ± 0.268
0.696HisGly: 0.696 ± 0.251
0.497HisHis: 0.497 ± 0.259
0.895HisIle: 0.895 ± 0.308
0.994HisLys: 0.994 ± 0.322
1.392HisLeu: 1.392 ± 0.292
0.596HisMet: 0.596 ± 0.253
0.696HisAsn: 0.696 ± 0.217
0.696HisPro: 0.696 ± 0.237
0.895HisGln: 0.895 ± 0.297
0.795HisArg: 0.795 ± 0.238
0.696HisSer: 0.696 ± 0.191
0.895HisThr: 0.895 ± 0.277
1.292HisVal: 1.292 ± 0.381
0.099HisTrp: 0.099 ± 0.094
0.696HisTyr: 0.696 ± 0.303
0.0HisXaa: 0.0 ± 0.0
Ile
4.672IleAla: 4.672 ± 0.882
0.497IleCys: 0.497 ± 0.234
5.864IleAsp: 5.864 ± 0.762
4.075IleGlu: 4.075 ± 0.77
1.193IlePhe: 1.193 ± 0.41
5.168IleGly: 5.168 ± 0.718
0.994IleHis: 0.994 ± 0.237
2.485IleIle: 2.485 ± 0.442
7.256IleLys: 7.256 ± 0.795
4.373IleLeu: 4.373 ± 0.699
1.888IleMet: 1.888 ± 0.457
3.28IleAsn: 3.28 ± 0.397
2.882IlePro: 2.882 ± 0.595
3.181IleGln: 3.181 ± 0.481
3.081IleArg: 3.081 ± 0.482
4.672IleSer: 4.672 ± 0.62
4.175IleThr: 4.175 ± 0.618
3.578IleVal: 3.578 ± 0.624
0.795IleTrp: 0.795 ± 0.236
2.087IleTyr: 2.087 ± 0.507
0.0IleXaa: 0.0 ± 0.0
Lys
5.665LysAla: 5.665 ± 0.706
0.298LysCys: 0.298 ± 0.192
5.467LysAsp: 5.467 ± 0.804
5.864LysGlu: 5.864 ± 0.75
2.783LysPhe: 2.783 ± 0.764
6.063LysGly: 6.063 ± 0.655
1.69LysHis: 1.69 ± 0.506
5.964LysIle: 5.964 ± 0.738
5.864LysLys: 5.864 ± 1.027
6.56LysLeu: 6.56 ± 0.857
2.385LysMet: 2.385 ± 0.532
4.572LysAsn: 4.572 ± 0.635
3.181LysPro: 3.181 ± 0.403
3.976LysGln: 3.976 ± 0.494
3.876LysArg: 3.876 ± 0.441
4.473LysSer: 4.473 ± 0.616
5.268LysThr: 5.268 ± 0.783
4.175LysVal: 4.175 ± 0.733
1.193LysTrp: 1.193 ± 0.289
3.578LysTyr: 3.578 ± 0.73
0.0LysXaa: 0.0 ± 0.0
Leu
7.057LeuAla: 7.057 ± 0.876
0.596LeuCys: 0.596 ± 0.214
5.069LeuAsp: 5.069 ± 0.75
6.063LeuGlu: 6.063 ± 0.826
3.081LeuPhe: 3.081 ± 0.433
5.964LeuGly: 5.964 ± 1.055
0.895LeuHis: 0.895 ± 0.35
3.777LeuIle: 3.777 ± 0.645
7.156LeuLys: 7.156 ± 0.801
5.069LeuLeu: 5.069 ± 0.7
2.187LeuMet: 2.187 ± 0.399
5.268LeuAsn: 5.268 ± 0.718
3.081LeuPro: 3.081 ± 0.454
2.982LeuGln: 2.982 ± 0.441
3.081LeuArg: 3.081 ± 0.693
5.069LeuSer: 5.069 ± 0.585
6.262LeuThr: 6.262 ± 0.954
3.678LeuVal: 3.678 ± 0.518
0.795LeuTrp: 0.795 ± 0.196
1.888LeuTyr: 1.888 ± 0.454
0.0LeuXaa: 0.0 ± 0.0
Met
1.988MetAla: 1.988 ± 0.412
0.0MetCys: 0.0 ± 0.0
0.596MetAsp: 0.596 ± 0.216
1.59MetGlu: 1.59 ± 0.492
1.193MetPhe: 1.193 ± 0.281
0.895MetGly: 0.895 ± 0.308
0.298MetHis: 0.298 ± 0.139
2.187MetIle: 2.187 ± 0.497
2.684MetLys: 2.684 ± 0.441
1.59MetLeu: 1.59 ± 0.375
0.398MetMet: 0.398 ± 0.172
1.093MetAsn: 1.093 ± 0.286
1.093MetPro: 1.093 ± 0.269
0.696MetGln: 0.696 ± 0.211
0.895MetArg: 0.895 ± 0.246
2.187MetSer: 2.187 ± 0.413
1.59MetThr: 1.59 ± 0.35
1.988MetVal: 1.988 ± 0.421
0.298MetTrp: 0.298 ± 0.15
0.795MetTyr: 0.795 ± 0.257
0.0MetXaa: 0.0 ± 0.0
Asn
5.467AsnAla: 5.467 ± 1.102
0.199AsnCys: 0.199 ± 0.145
3.479AsnAsp: 3.479 ± 0.53
3.777AsnGlu: 3.777 ± 0.781
2.684AsnPhe: 2.684 ± 0.434
7.355AsnGly: 7.355 ± 1.359
0.795AsnHis: 0.795 ± 0.243
3.578AsnIle: 3.578 ± 0.619
4.175AsnLys: 4.175 ± 0.459
4.274AsnLeu: 4.274 ± 0.652
1.292AsnMet: 1.292 ± 0.318
4.274AsnAsn: 4.274 ± 0.855
3.28AsnPro: 3.28 ± 0.505
2.783AsnGln: 2.783 ± 0.446
2.385AsnArg: 2.385 ± 0.471
4.373AsnSer: 4.373 ± 0.733
3.777AsnThr: 3.777 ± 0.44
3.578AsnVal: 3.578 ± 0.416
1.193AsnTrp: 1.193 ± 0.305
2.087AsnTyr: 2.087 ± 0.448
0.0AsnXaa: 0.0 ± 0.0
Pro
1.59ProAla: 1.59 ± 0.337
0.199ProCys: 0.199 ± 0.212
1.59ProAsp: 1.59 ± 0.463
2.087ProGlu: 2.087 ± 0.397
1.193ProPhe: 1.193 ± 0.302
0.795ProGly: 0.795 ± 0.273
0.497ProHis: 0.497 ± 0.164
2.087ProIle: 2.087 ± 0.309
3.479ProLys: 3.479 ± 0.55
2.584ProLeu: 2.584 ± 0.405
0.298ProMet: 0.298 ± 0.164
2.982ProAsn: 2.982 ± 0.442
0.497ProPro: 0.497 ± 0.245
1.093ProGln: 1.093 ± 0.31
1.292ProArg: 1.292 ± 0.459
2.882ProSer: 2.882 ± 0.528
1.988ProThr: 1.988 ± 0.347
1.69ProVal: 1.69 ± 0.388
0.398ProTrp: 0.398 ± 0.159
1.491ProTyr: 1.491 ± 0.452
0.0ProXaa: 0.0 ± 0.0
Gln
3.976GlnAla: 3.976 ± 0.539
0.099GlnCys: 0.099 ± 0.106
1.988GlnAsp: 1.988 ± 0.335
2.385GlnGlu: 2.385 ± 0.496
1.093GlnPhe: 1.093 ± 0.218
3.181GlnGly: 3.181 ± 0.628
0.795GlnHis: 0.795 ± 0.336
2.385GlnIle: 2.385 ± 0.58
3.081GlnLys: 3.081 ± 0.476
3.678GlnLeu: 3.678 ± 0.382
1.292GlnMet: 1.292 ± 0.283
2.286GlnAsn: 2.286 ± 0.449
0.298GlnPro: 0.298 ± 0.127
2.783GlnGln: 2.783 ± 0.54
2.286GlnArg: 2.286 ± 0.426
2.187GlnSer: 2.187 ± 0.382
3.479GlnThr: 3.479 ± 0.548
2.385GlnVal: 2.385 ± 0.522
0.398GlnTrp: 0.398 ± 0.205
1.789GlnTyr: 1.789 ± 0.333
0.0GlnXaa: 0.0 ± 0.0
Arg
2.087ArgAla: 2.087 ± 0.368
0.099ArgCys: 0.099 ± 0.095
2.485ArgAsp: 2.485 ± 0.464
2.485ArgGlu: 2.485 ± 0.622
2.385ArgPhe: 2.385 ± 0.443
2.485ArgGly: 2.485 ± 0.6
0.696ArgHis: 0.696 ± 0.23
2.783ArgIle: 2.783 ± 0.589
3.578ArgLys: 3.578 ± 0.843
3.181ArgLeu: 3.181 ± 0.561
0.696ArgMet: 0.696 ± 0.256
3.181ArgAsn: 3.181 ± 0.397
1.193ArgPro: 1.193 ± 0.287
2.087ArgGln: 2.087 ± 0.489
1.292ArgArg: 1.292 ± 0.322
1.59ArgSer: 1.59 ± 0.338
3.081ArgThr: 3.081 ± 0.782
2.485ArgVal: 2.485 ± 0.439
1.193ArgTrp: 1.193 ± 0.286
2.385ArgTyr: 2.385 ± 0.552
0.0ArgXaa: 0.0 ± 0.0
Ser
3.876SerAla: 3.876 ± 0.598
0.398SerCys: 0.398 ± 0.193
4.373SerAsp: 4.373 ± 0.515
3.876SerGlu: 3.876 ± 0.536
3.181SerPhe: 3.181 ± 0.499
4.473SerGly: 4.473 ± 0.611
0.398SerHis: 0.398 ± 0.168
4.274SerIle: 4.274 ± 0.64
5.168SerLys: 5.168 ± 0.978
4.175SerLeu: 4.175 ± 0.6
1.69SerMet: 1.69 ± 0.34
4.87SerAsn: 4.87 ± 0.685
1.69SerPro: 1.69 ± 0.332
2.882SerGln: 2.882 ± 0.6
2.982SerArg: 2.982 ± 0.595
4.473SerSer: 4.473 ± 1.187
4.175SerThr: 4.175 ± 0.616
5.168SerVal: 5.168 ± 0.791
0.795SerTrp: 0.795 ± 0.297
1.789SerTyr: 1.789 ± 0.504
0.0SerXaa: 0.0 ± 0.0
Thr
4.373ThrAla: 4.373 ± 0.658
0.298ThrCys: 0.298 ± 0.176
4.672ThrAsp: 4.672 ± 0.701
4.373ThrGlu: 4.373 ± 0.556
2.783ThrPhe: 2.783 ± 0.615
3.578ThrGly: 3.578 ± 0.448
1.093ThrHis: 1.093 ± 0.371
4.175ThrIle: 4.175 ± 0.63
4.87ThrLys: 4.87 ± 0.737
7.355ThrLeu: 7.355 ± 0.803
1.093ThrMet: 1.093 ± 0.276
4.075ThrAsn: 4.075 ± 0.582
2.087ThrPro: 2.087 ± 0.468
2.286ThrGln: 2.286 ± 0.484
1.988ThrArg: 1.988 ± 0.476
4.672ThrSer: 4.672 ± 0.755
3.976ThrThr: 3.976 ± 0.639
3.578ThrVal: 3.578 ± 0.5
0.795ThrTrp: 0.795 ± 0.269
3.28ThrTyr: 3.28 ± 0.595
0.0ThrXaa: 0.0 ± 0.0
Val
3.678ValAla: 3.678 ± 0.589
0.298ValCys: 0.298 ± 0.167
4.87ValAsp: 4.87 ± 0.459
4.473ValGlu: 4.473 ± 0.704
2.187ValPhe: 2.187 ± 0.467
4.175ValGly: 4.175 ± 0.658
0.398ValHis: 0.398 ± 0.157
4.87ValIle: 4.87 ± 0.633
4.672ValLys: 4.672 ± 0.613
3.876ValLeu: 3.876 ± 0.704
1.193ValMet: 1.193 ± 0.289
4.175ValAsn: 4.175 ± 0.785
1.59ValPro: 1.59 ± 0.298
1.888ValGln: 1.888 ± 0.409
1.888ValArg: 1.888 ± 0.601
4.87ValSer: 4.87 ± 0.611
4.672ValThr: 4.672 ± 0.864
4.175ValVal: 4.175 ± 0.856
1.093ValTrp: 1.093 ± 0.258
1.888ValTyr: 1.888 ± 0.45
0.0ValXaa: 0.0 ± 0.0
Trp
0.696TrpAla: 0.696 ± 0.21
0.0TrpCys: 0.0 ± 0.0
1.093TrpAsp: 1.093 ± 0.311
1.292TrpGlu: 1.292 ± 0.216
0.596TrpPhe: 0.596 ± 0.221
0.795TrpGly: 0.795 ± 0.247
0.398TrpHis: 0.398 ± 0.204
0.596TrpIle: 0.596 ± 0.156
0.696TrpLys: 0.696 ± 0.341
1.093TrpLeu: 1.093 ± 0.356
0.099TrpMet: 0.099 ± 0.11
1.093TrpAsn: 1.093 ± 0.356
0.199TrpPro: 0.199 ± 0.131
0.696TrpGln: 0.696 ± 0.222
0.696TrpArg: 0.696 ± 0.212
1.69TrpSer: 1.69 ± 0.686
0.795TrpThr: 0.795 ± 0.224
1.491TrpVal: 1.491 ± 0.279
0.298TrpTrp: 0.298 ± 0.203
0.497TrpTyr: 0.497 ± 0.239
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.684TyrAla: 2.684 ± 0.472
0.497TyrCys: 0.497 ± 0.272
3.081TyrAsp: 3.081 ± 0.475
2.783TyrGlu: 2.783 ± 0.57
2.087TyrPhe: 2.087 ± 0.357
1.59TyrGly: 1.59 ± 0.389
0.795TyrHis: 0.795 ± 0.23
2.286TyrIle: 2.286 ± 0.394
2.584TyrLys: 2.584 ± 0.457
3.876TyrLeu: 3.876 ± 0.469
0.795TyrMet: 0.795 ± 0.264
1.888TyrAsn: 1.888 ± 0.387
1.491TyrPro: 1.491 ± 0.391
2.385TyrGln: 2.385 ± 0.415
2.286TyrArg: 2.286 ± 0.337
2.087TyrSer: 2.087 ± 0.567
1.491TyrThr: 1.491 ± 0.336
2.485TyrVal: 2.485 ± 0.387
0.199TyrTrp: 0.199 ± 0.138
2.286TyrTyr: 2.286 ± 0.616
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 42 proteins (10062 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski