Amino acid dipepetide frequency for Streptococcus phage Javan488

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.673AlaAla: 4.673 ± 1.487
0.396AlaCys: 0.396 ± 0.179
4.752AlaAsp: 4.752 ± 0.566
6.099AlaGlu: 6.099 ± 0.855
1.505AlaPhe: 1.505 ± 0.344
3.722AlaGly: 3.722 ± 0.693
1.426AlaHis: 1.426 ± 0.33
5.623AlaIle: 5.623 ± 0.745
6.891AlaLys: 6.891 ± 0.647
7.128AlaLeu: 7.128 ± 0.903
2.059AlaMet: 2.059 ± 0.435
4.435AlaAsn: 4.435 ± 0.63
2.455AlaPro: 2.455 ± 0.462
3.722AlaGln: 3.722 ± 0.691
3.881AlaArg: 3.881 ± 0.558
4.514AlaSer: 4.514 ± 1.091
4.99AlaThr: 4.99 ± 0.715
4.514AlaVal: 4.514 ± 0.657
1.03AlaTrp: 1.03 ± 0.272
2.376AlaTyr: 2.376 ± 0.376
0.0AlaXaa: 0.0 ± 0.0
Cys
0.475CysAla: 0.475 ± 0.174
0.158CysCys: 0.158 ± 0.114
0.317CysAsp: 0.317 ± 0.2
0.95CysGlu: 0.95 ± 0.243
0.554CysPhe: 0.554 ± 0.178
0.634CysGly: 0.634 ± 0.295
0.158CysHis: 0.158 ± 0.11
0.238CysIle: 0.238 ± 0.123
0.475CysLys: 0.475 ± 0.204
0.713CysLeu: 0.713 ± 0.232
0.158CysMet: 0.158 ± 0.105
0.396CysAsn: 0.396 ± 0.17
0.317CysPro: 0.317 ± 0.161
0.0CysGln: 0.0 ± 0.0
0.396CysArg: 0.396 ± 0.166
0.238CysSer: 0.238 ± 0.138
0.079CysThr: 0.079 ± 0.087
0.396CysVal: 0.396 ± 0.167
0.079CysTrp: 0.079 ± 0.067
0.158CysTyr: 0.158 ± 0.127
0.0CysXaa: 0.0 ± 0.0
Asp
4.514AspAla: 4.514 ± 0.738
0.396AspCys: 0.396 ± 0.19
4.594AspAsp: 4.594 ± 0.751
4.752AspGlu: 4.752 ± 0.627
2.376AspPhe: 2.376 ± 0.314
6.178AspGly: 6.178 ± 0.739
0.871AspHis: 0.871 ± 0.23
4.356AspIle: 4.356 ± 0.702
5.94AspLys: 5.94 ± 0.684
6.732AspLeu: 6.732 ± 0.906
1.426AspMet: 1.426 ± 0.281
4.039AspAsn: 4.039 ± 0.525
1.505AspPro: 1.505 ± 0.341
1.584AspGln: 1.584 ± 0.375
3.01AspArg: 3.01 ± 0.516
4.198AspSer: 4.198 ± 0.6
3.406AspThr: 3.406 ± 0.584
4.514AspVal: 4.514 ± 0.518
0.871AspTrp: 0.871 ± 0.289
3.406AspTyr: 3.406 ± 0.605
0.0AspXaa: 0.0 ± 0.0
Glu
5.069GluAla: 5.069 ± 0.607
0.396GluCys: 0.396 ± 0.15
3.564GluAsp: 3.564 ± 0.626
5.465GluGlu: 5.465 ± 0.775
2.534GluPhe: 2.534 ± 0.439
3.564GluGly: 3.564 ± 0.506
0.95GluHis: 0.95 ± 0.276
6.019GluIle: 6.019 ± 0.789
5.227GluLys: 5.227 ± 0.616
8.633GluLeu: 8.633 ± 0.777
1.426GluMet: 1.426 ± 0.473
3.485GluAsn: 3.485 ± 0.508
2.218GluPro: 2.218 ± 0.484
4.118GluGln: 4.118 ± 0.634
3.247GluArg: 3.247 ± 0.637
4.356GluSer: 4.356 ± 0.511
3.802GluThr: 3.802 ± 0.548
4.911GluVal: 4.911 ± 0.654
1.109GluTrp: 1.109 ± 0.28
2.772GluTyr: 2.772 ± 0.451
0.0GluXaa: 0.0 ± 0.0
Phe
2.376PheAla: 2.376 ± 0.493
0.396PheCys: 0.396 ± 0.185
3.96PheAsp: 3.96 ± 0.472
2.534PheGlu: 2.534 ± 0.478
0.475PhePhe: 0.475 ± 0.226
2.138PheGly: 2.138 ± 0.353
0.238PheHis: 0.238 ± 0.146
1.901PheIle: 1.901 ± 0.398
2.138PheLys: 2.138 ± 0.341
2.297PheLeu: 2.297 ± 0.398
0.713PheMet: 0.713 ± 0.208
1.901PheAsn: 1.901 ± 0.372
0.713PhePro: 0.713 ± 0.298
0.554PheGln: 0.554 ± 0.232
1.901PheArg: 1.901 ± 0.446
2.138PheSer: 2.138 ± 0.401
2.376PheThr: 2.376 ± 0.37
1.822PheVal: 1.822 ± 0.329
0.238PheTrp: 0.238 ± 0.203
1.109PheTyr: 1.109 ± 0.296
0.0PheXaa: 0.0 ± 0.0
Gly
4.673GlyAla: 4.673 ± 0.8
0.475GlyCys: 0.475 ± 0.158
3.168GlyAsp: 3.168 ± 0.365
3.722GlyGlu: 3.722 ± 0.622
2.772GlyPhe: 2.772 ± 0.439
3.96GlyGly: 3.96 ± 0.561
1.267GlyHis: 1.267 ± 0.264
4.99GlyIle: 4.99 ± 0.772
6.178GlyLys: 6.178 ± 0.5
4.435GlyLeu: 4.435 ± 0.684
1.346GlyMet: 1.346 ± 0.307
3.406GlyAsn: 3.406 ± 0.566
0.792GlyPro: 0.792 ± 0.277
3.089GlyGln: 3.089 ± 0.45
1.901GlyArg: 1.901 ± 0.378
3.485GlySer: 3.485 ± 0.367
3.643GlyThr: 3.643 ± 0.555
5.623GlyVal: 5.623 ± 0.932
1.584GlyTrp: 1.584 ± 0.448
2.376GlyTyr: 2.376 ± 0.614
0.0GlyXaa: 0.0 ± 0.0
His
1.267HisAla: 1.267 ± 0.3
0.158HisCys: 0.158 ± 0.108
0.713HisAsp: 0.713 ± 0.233
1.188HisGlu: 1.188 ± 0.359
1.267HisPhe: 1.267 ± 0.279
0.871HisGly: 0.871 ± 0.271
0.317HisHis: 0.317 ± 0.172
1.03HisIle: 1.03 ± 0.26
0.792HisLys: 0.792 ± 0.223
1.584HisLeu: 1.584 ± 0.335
0.158HisMet: 0.158 ± 0.116
0.713HisAsn: 0.713 ± 0.196
0.554HisPro: 0.554 ± 0.183
0.634HisGln: 0.634 ± 0.183
0.634HisArg: 0.634 ± 0.185
0.871HisSer: 0.871 ± 0.327
0.95HisThr: 0.95 ± 0.299
0.554HisVal: 0.554 ± 0.19
0.158HisTrp: 0.158 ± 0.101
0.713HisTyr: 0.713 ± 0.232
0.0HisXaa: 0.0 ± 0.0
Ile
6.099IleAla: 6.099 ± 0.839
0.317IleCys: 0.317 ± 0.155
6.732IleAsp: 6.732 ± 0.823
5.623IleGlu: 5.623 ± 0.733
1.822IlePhe: 1.822 ± 0.401
3.643IleGly: 3.643 ± 0.47
0.475IleHis: 0.475 ± 0.209
3.722IleIle: 3.722 ± 0.694
7.524IleLys: 7.524 ± 0.896
3.802IleLeu: 3.802 ± 0.572
0.792IleMet: 0.792 ± 0.225
5.465IleAsn: 5.465 ± 0.596
0.95IlePro: 0.95 ± 0.232
1.663IleGln: 1.663 ± 0.268
2.218IleArg: 2.218 ± 0.408
4.118IleSer: 4.118 ± 0.753
4.752IleThr: 4.752 ± 0.473
4.514IleVal: 4.514 ± 0.667
0.634IleTrp: 0.634 ± 0.213
2.93IleTyr: 2.93 ± 0.451
0.0IleXaa: 0.0 ± 0.0
Lys
7.999LysAla: 7.999 ± 0.743
0.554LysCys: 0.554 ± 0.192
5.307LysAsp: 5.307 ± 0.801
6.178LysGlu: 6.178 ± 0.714
1.901LysPhe: 1.901 ± 0.424
5.227LysGly: 5.227 ± 0.704
1.188LysHis: 1.188 ± 0.248
5.386LysIle: 5.386 ± 0.835
6.811LysLys: 6.811 ± 0.891
7.049LysLeu: 7.049 ± 0.762
2.059LysMet: 2.059 ± 0.37
5.465LysAsn: 5.465 ± 0.669
2.376LysPro: 2.376 ± 0.505
4.039LysGln: 4.039 ± 0.685
3.881LysArg: 3.881 ± 0.569
3.881LysSer: 3.881 ± 0.549
5.069LysThr: 5.069 ± 0.555
5.703LysVal: 5.703 ± 0.774
0.792LysTrp: 0.792 ± 0.237
3.96LysTyr: 3.96 ± 0.49
0.0LysXaa: 0.0 ± 0.0
Leu
7.128LeuAla: 7.128 ± 0.86
0.634LeuCys: 0.634 ± 0.228
7.603LeuAsp: 7.603 ± 0.823
6.891LeuGlu: 6.891 ± 0.751
2.455LeuPhe: 2.455 ± 0.464
5.861LeuGly: 5.861 ± 0.526
0.792LeuHis: 0.792 ± 0.341
4.594LeuIle: 4.594 ± 0.709
7.841LeuLys: 7.841 ± 0.791
6.336LeuLeu: 6.336 ± 0.936
1.426LeuMet: 1.426 ± 0.4
6.019LeuAsn: 6.019 ± 0.769
2.693LeuPro: 2.693 ± 0.483
2.693LeuGln: 2.693 ± 0.449
3.485LeuArg: 3.485 ± 0.599
4.99LeuSer: 4.99 ± 0.572
4.831LeuThr: 4.831 ± 0.404
4.752LeuVal: 4.752 ± 0.609
0.792LeuTrp: 0.792 ± 0.213
2.693LeuTyr: 2.693 ± 0.529
0.0LeuXaa: 0.0 ± 0.0
Met
1.109MetAla: 1.109 ± 0.319
0.158MetCys: 0.158 ± 0.103
1.584MetAsp: 1.584 ± 0.418
1.267MetGlu: 1.267 ± 0.296
0.079MetPhe: 0.079 ± 0.087
1.426MetGly: 1.426 ± 0.361
0.158MetHis: 0.158 ± 0.107
1.267MetIle: 1.267 ± 0.262
1.584MetLys: 1.584 ± 0.337
1.742MetLeu: 1.742 ± 0.394
0.238MetMet: 0.238 ± 0.149
1.188MetAsn: 1.188 ± 0.313
0.95MetPro: 0.95 ± 0.247
1.98MetGln: 1.98 ± 0.444
1.901MetArg: 1.901 ± 0.392
2.376MetSer: 2.376 ± 0.511
2.455MetThr: 2.455 ± 0.472
0.95MetVal: 0.95 ± 0.308
0.317MetTrp: 0.317 ± 0.145
0.238MetTyr: 0.238 ± 0.141
0.0MetXaa: 0.0 ± 0.0
Asn
4.198AsnAla: 4.198 ± 0.709
0.396AsnCys: 0.396 ± 0.21
3.643AsnAsp: 3.643 ± 0.525
2.851AsnGlu: 2.851 ± 0.544
2.059AsnPhe: 2.059 ± 0.433
4.039AsnGly: 4.039 ± 0.508
0.95AsnHis: 0.95 ± 0.31
4.039AsnIle: 4.039 ± 0.68
4.673AsnLys: 4.673 ± 0.605
4.435AsnLeu: 4.435 ± 0.42
1.98AsnMet: 1.98 ± 0.398
3.326AsnAsn: 3.326 ± 0.548
2.218AsnPro: 2.218 ± 0.488
2.614AsnGln: 2.614 ± 0.397
3.01AsnArg: 3.01 ± 0.553
3.247AsnSer: 3.247 ± 0.602
2.693AsnThr: 2.693 ± 0.497
2.93AsnVal: 2.93 ± 0.424
0.713AsnTrp: 0.713 ± 0.25
1.901AsnTyr: 1.901 ± 0.522
0.0AsnXaa: 0.0 ± 0.0
Pro
1.505ProAla: 1.505 ± 0.316
0.158ProCys: 0.158 ± 0.107
1.663ProAsp: 1.663 ± 0.352
2.138ProGlu: 2.138 ± 0.471
1.109ProPhe: 1.109 ± 0.311
1.109ProGly: 1.109 ± 0.308
0.475ProHis: 0.475 ± 0.197
2.138ProIle: 2.138 ± 0.521
2.93ProLys: 2.93 ± 0.409
2.851ProLeu: 2.851 ± 0.482
0.634ProMet: 0.634 ± 0.226
1.346ProAsn: 1.346 ± 0.338
1.188ProPro: 1.188 ± 0.43
1.267ProGln: 1.267 ± 0.404
1.03ProArg: 1.03 ± 0.273
2.138ProSer: 2.138 ± 0.421
2.138ProThr: 2.138 ± 0.474
2.138ProVal: 2.138 ± 0.374
0.0ProTrp: 0.0 ± 0.0
1.822ProTyr: 1.822 ± 0.309
0.0ProXaa: 0.0 ± 0.0
Gln
3.247GlnAla: 3.247 ± 0.475
0.475GlnCys: 0.475 ± 0.192
1.426GlnAsp: 1.426 ± 0.394
3.564GlnGlu: 3.564 ± 0.6
1.663GlnPhe: 1.663 ± 0.333
2.614GlnGly: 2.614 ± 0.571
0.871GlnHis: 0.871 ± 0.277
3.01GlnIle: 3.01 ± 0.431
3.802GlnLys: 3.802 ± 0.537
4.277GlnLeu: 4.277 ± 0.61
1.267GlnMet: 1.267 ± 0.27
1.742GlnAsn: 1.742 ± 0.421
1.426GlnPro: 1.426 ± 0.561
2.772GlnGln: 2.772 ± 0.807
2.534GlnArg: 2.534 ± 0.483
2.93GlnSer: 2.93 ± 0.512
3.01GlnThr: 3.01 ± 0.567
1.584GlnVal: 1.584 ± 0.377
0.713GlnTrp: 0.713 ± 0.218
0.871GlnTyr: 0.871 ± 0.223
0.0GlnXaa: 0.0 ± 0.0
Arg
3.089ArgAla: 3.089 ± 0.443
0.238ArgCys: 0.238 ± 0.138
2.059ArgAsp: 2.059 ± 0.378
3.485ArgGlu: 3.485 ± 0.558
1.426ArgPhe: 1.426 ± 0.33
2.534ArgGly: 2.534 ± 0.419
0.713ArgHis: 0.713 ± 0.234
2.772ArgIle: 2.772 ± 0.435
3.643ArgLys: 3.643 ± 0.49
3.881ArgLeu: 3.881 ± 0.565
1.267ArgMet: 1.267 ± 0.28
2.218ArgAsn: 2.218 ± 0.391
1.109ArgPro: 1.109 ± 0.377
2.138ArgGln: 2.138 ± 0.446
1.742ArgArg: 1.742 ± 0.372
1.901ArgSer: 1.901 ± 0.325
3.01ArgThr: 3.01 ± 0.461
3.01ArgVal: 3.01 ± 0.5
0.713ArgTrp: 0.713 ± 0.256
1.98ArgTyr: 1.98 ± 0.584
0.0ArgXaa: 0.0 ± 0.0
Ser
4.594SerAla: 4.594 ± 0.771
0.396SerCys: 0.396 ± 0.173
4.435SerAsp: 4.435 ± 0.649
4.752SerGlu: 4.752 ± 0.598
2.376SerPhe: 2.376 ± 0.384
3.96SerGly: 3.96 ± 0.599
0.871SerHis: 0.871 ± 0.298
4.118SerIle: 4.118 ± 0.583
4.277SerLys: 4.277 ± 0.528
5.148SerLeu: 5.148 ± 0.718
1.742SerMet: 1.742 ± 0.32
2.297SerAsn: 2.297 ± 0.478
1.346SerPro: 1.346 ± 0.358
2.93SerGln: 2.93 ± 0.468
1.822SerArg: 1.822 ± 0.467
2.93SerSer: 2.93 ± 0.547
3.881SerThr: 3.881 ± 0.763
3.168SerVal: 3.168 ± 0.578
0.95SerTrp: 0.95 ± 0.389
1.742SerTyr: 1.742 ± 0.421
0.0SerXaa: 0.0 ± 0.0
Thr
6.257ThrAla: 6.257 ± 0.837
0.158ThrCys: 0.158 ± 0.104
3.722ThrAsp: 3.722 ± 0.517
4.277ThrGlu: 4.277 ± 0.537
1.822ThrPhe: 1.822 ± 0.384
4.198ThrGly: 4.198 ± 0.586
1.109ThrHis: 1.109 ± 0.296
4.039ThrIle: 4.039 ± 0.5
4.277ThrLys: 4.277 ± 0.444
4.911ThrLeu: 4.911 ± 0.503
1.505ThrMet: 1.505 ± 0.386
2.93ThrAsn: 2.93 ± 0.426
3.01ThrPro: 3.01 ± 0.422
2.693ThrGln: 2.693 ± 0.48
2.455ThrArg: 2.455 ± 0.454
3.406ThrSer: 3.406 ± 0.465
4.198ThrThr: 4.198 ± 0.649
5.307ThrVal: 5.307 ± 0.808
0.634ThrTrp: 0.634 ± 0.18
2.138ThrTyr: 2.138 ± 0.522
0.0ThrXaa: 0.0 ± 0.0
Val
4.514ValAla: 4.514 ± 0.557
0.396ValCys: 0.396 ± 0.208
4.752ValAsp: 4.752 ± 0.535
4.673ValGlu: 4.673 ± 0.621
2.297ValPhe: 2.297 ± 0.558
4.673ValGly: 4.673 ± 0.621
1.109ValHis: 1.109 ± 0.27
5.465ValIle: 5.465 ± 0.588
5.227ValLys: 5.227 ± 0.679
4.831ValLeu: 4.831 ± 0.729
2.138ValMet: 2.138 ± 0.388
3.01ValAsn: 3.01 ± 0.404
2.218ValPro: 2.218 ± 0.489
1.742ValGln: 1.742 ± 0.311
2.138ValArg: 2.138 ± 0.354
3.326ValSer: 3.326 ± 0.47
4.435ValThr: 4.435 ± 0.594
3.96ValVal: 3.96 ± 0.528
0.238ValTrp: 0.238 ± 0.118
1.742ValTyr: 1.742 ± 0.456
0.0ValXaa: 0.0 ± 0.0
Trp
0.792TrpAla: 0.792 ± 0.213
0.079TrpCys: 0.079 ± 0.084
1.267TrpAsp: 1.267 ± 0.315
0.871TrpGlu: 0.871 ± 0.273
0.634TrpPhe: 0.634 ± 0.217
0.634TrpGly: 0.634 ± 0.301
0.475TrpHis: 0.475 ± 0.21
0.871TrpIle: 0.871 ± 0.307
1.03TrpLys: 1.03 ± 0.289
0.95TrpLeu: 0.95 ± 0.376
0.158TrpMet: 0.158 ± 0.093
0.713TrpAsn: 0.713 ± 0.208
0.079TrpPro: 0.079 ± 0.088
0.634TrpGln: 0.634 ± 0.182
0.475TrpArg: 0.475 ± 0.193
0.634TrpSer: 0.634 ± 0.235
0.871TrpThr: 0.871 ± 0.247
0.634TrpVal: 0.634 ± 0.183
0.0TrpTrp: 0.0 ± 0.0
0.396TrpTyr: 0.396 ± 0.163
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.614TyrAla: 2.614 ± 0.414
0.554TyrCys: 0.554 ± 0.237
3.406TyrAsp: 3.406 ± 0.481
1.663TyrGlu: 1.663 ± 0.332
0.792TyrPhe: 0.792 ± 0.228
1.901TyrGly: 1.901 ± 0.337
0.713TyrHis: 0.713 ± 0.311
2.218TyrIle: 2.218 ± 0.464
3.564TyrLys: 3.564 ± 0.62
2.772TyrLeu: 2.772 ± 0.494
0.396TyrMet: 0.396 ± 0.199
1.822TyrAsn: 1.822 ± 0.438
1.663TyrPro: 1.663 ± 0.323
2.93TyrGln: 2.93 ± 0.608
1.267TyrArg: 1.267 ± 0.321
2.059TyrSer: 2.059 ± 0.447
2.455TyrThr: 2.455 ± 0.631
1.901TyrVal: 1.901 ± 0.398
0.634TyrTrp: 0.634 ± 0.193
1.742TyrTyr: 1.742 ± 0.388
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (12627 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski