Amino acid dipepetide frequency for Streptococcus phage Javan129

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.75AlaAla: 3.75 ± 1.284
0.408AlaCys: 0.408 ± 0.223
4.321AlaAsp: 4.321 ± 0.568
5.136AlaGlu: 5.136 ± 0.811
1.793AlaPhe: 1.793 ± 0.365
3.994AlaGly: 3.994 ± 0.929
0.734AlaHis: 0.734 ± 0.221
5.38AlaIle: 5.38 ± 0.661
6.44AlaLys: 6.44 ± 0.606
6.114AlaLeu: 6.114 ± 1.04
2.12AlaMet: 2.12 ± 0.491
3.668AlaAsn: 3.668 ± 0.578
1.304AlaPro: 1.304 ± 0.339
3.424AlaGln: 3.424 ± 0.596
2.69AlaArg: 2.69 ± 0.358
4.647AlaSer: 4.647 ± 0.887
3.668AlaThr: 3.668 ± 0.698
5.38AlaVal: 5.38 ± 0.998
0.734AlaTrp: 0.734 ± 0.293
2.69AlaTyr: 2.69 ± 0.407
0.0AlaXaa: 0.0 ± 0.0
Cys
0.163CysAla: 0.163 ± 0.112
0.0CysCys: 0.0 ± 0.0
0.082CysAsp: 0.082 ± 0.076
0.652CysGlu: 0.652 ± 0.245
0.489CysPhe: 0.489 ± 0.203
0.326CysGly: 0.326 ± 0.198
0.082CysHis: 0.082 ± 0.081
0.326CysIle: 0.326 ± 0.157
0.245CysLys: 0.245 ± 0.151
0.734CysLeu: 0.734 ± 0.277
0.163CysMet: 0.163 ± 0.111
0.245CysAsn: 0.245 ± 0.152
0.0CysPro: 0.0 ± 0.0
0.245CysGln: 0.245 ± 0.194
0.326CysArg: 0.326 ± 0.162
0.326CysSer: 0.326 ± 0.162
0.326CysThr: 0.326 ± 0.17
0.245CysVal: 0.245 ± 0.138
0.163CysTrp: 0.163 ± 0.106
0.245CysTyr: 0.245 ± 0.142
0.0CysXaa: 0.0 ± 0.0
Asp
3.179AspAla: 3.179 ± 0.402
0.326AspCys: 0.326 ± 0.175
4.973AspAsp: 4.973 ± 0.537
4.565AspGlu: 4.565 ± 0.782
3.098AspPhe: 3.098 ± 0.468
4.973AspGly: 4.973 ± 0.97
0.978AspHis: 0.978 ± 0.242
5.136AspIle: 5.136 ± 0.788
7.092AspLys: 7.092 ± 0.612
7.337AspLeu: 7.337 ± 0.695
1.386AspMet: 1.386 ± 0.304
3.831AspAsn: 3.831 ± 0.555
1.875AspPro: 1.875 ± 0.428
1.63AspGln: 1.63 ± 0.373
1.793AspArg: 1.793 ± 0.357
4.321AspSer: 4.321 ± 0.537
2.772AspThr: 2.772 ± 0.499
4.402AspVal: 4.402 ± 0.558
0.571AspTrp: 0.571 ± 0.205
3.179AspTyr: 3.179 ± 0.593
0.0AspXaa: 0.0 ± 0.0
Glu
5.706GluAla: 5.706 ± 0.705
0.326GluCys: 0.326 ± 0.162
3.342GluAsp: 3.342 ± 0.55
5.706GluGlu: 5.706 ± 0.78
2.853GluPhe: 2.853 ± 0.421
2.201GluGly: 2.201 ± 0.547
1.304GluHis: 1.304 ± 0.394
6.766GluIle: 6.766 ± 0.635
5.788GluLys: 5.788 ± 0.603
7.5GluLeu: 7.5 ± 0.855
1.712GluMet: 1.712 ± 0.326
3.668GluAsn: 3.668 ± 0.528
1.793GluPro: 1.793 ± 0.385
3.016GluGln: 3.016 ± 0.653
3.424GluArg: 3.424 ± 0.513
3.913GluSer: 3.913 ± 0.534
5.625GluThr: 5.625 ± 0.752
4.973GluVal: 4.973 ± 0.86
0.897GluTrp: 0.897 ± 0.229
2.772GluTyr: 2.772 ± 0.583
0.0GluXaa: 0.0 ± 0.0
Phe
2.201PheAla: 2.201 ± 0.386
0.245PheCys: 0.245 ± 0.143
2.853PheAsp: 2.853 ± 0.484
3.016PheGlu: 3.016 ± 0.522
0.734PhePhe: 0.734 ± 0.28
3.179PheGly: 3.179 ± 0.471
0.245PheHis: 0.245 ± 0.125
2.283PheIle: 2.283 ± 0.423
3.342PheLys: 3.342 ± 0.48
2.283PheLeu: 2.283 ± 0.543
0.978PheMet: 0.978 ± 0.267
2.609PheAsn: 2.609 ± 0.451
1.06PhePro: 1.06 ± 0.297
0.978PheGln: 0.978 ± 0.242
1.63PheArg: 1.63 ± 0.412
2.364PheSer: 2.364 ± 0.454
2.446PheThr: 2.446 ± 0.321
2.446PheVal: 2.446 ± 0.421
0.326PheTrp: 0.326 ± 0.178
1.141PheTyr: 1.141 ± 0.344
0.0PheXaa: 0.0 ± 0.0
Gly
3.587GlyAla: 3.587 ± 0.679
0.326GlyCys: 0.326 ± 0.152
3.994GlyAsp: 3.994 ± 0.714
3.587GlyGlu: 3.587 ± 0.537
2.527GlyPhe: 2.527 ± 0.51
4.402GlyGly: 4.402 ± 0.691
1.141GlyHis: 1.141 ± 0.313
4.728GlyIle: 4.728 ± 0.509
6.44GlyLys: 6.44 ± 0.654
6.114GlyLeu: 6.114 ± 0.776
1.63GlyMet: 1.63 ± 0.369
3.505GlyAsn: 3.505 ± 0.457
1.06GlyPro: 1.06 ± 0.31
2.69GlyGln: 2.69 ± 0.425
2.527GlyArg: 2.527 ± 0.43
2.609GlySer: 2.609 ± 0.464
3.342GlyThr: 3.342 ± 0.419
4.484GlyVal: 4.484 ± 0.555
1.06GlyTrp: 1.06 ± 0.313
3.016GlyTyr: 3.016 ± 0.614
0.0GlyXaa: 0.0 ± 0.0
His
0.734HisAla: 0.734 ± 0.309
0.163HisCys: 0.163 ± 0.116
1.467HisAsp: 1.467 ± 0.414
1.141HisGlu: 1.141 ± 0.285
0.571HisPhe: 0.571 ± 0.25
0.815HisGly: 0.815 ± 0.228
0.245HisHis: 0.245 ± 0.157
0.734HisIle: 0.734 ± 0.273
1.386HisLys: 1.386 ± 0.304
0.652HisLeu: 0.652 ± 0.234
0.163HisMet: 0.163 ± 0.116
1.06HisAsn: 1.06 ± 0.346
0.408HisPro: 0.408 ± 0.148
0.408HisGln: 0.408 ± 0.166
0.978HisArg: 0.978 ± 0.253
0.897HisSer: 0.897 ± 0.35
1.141HisThr: 1.141 ± 0.303
0.897HisVal: 0.897 ± 0.232
0.163HisTrp: 0.163 ± 0.111
0.489HisTyr: 0.489 ± 0.171
0.0HisXaa: 0.0 ± 0.0
Ile
5.136IleAla: 5.136 ± 0.568
0.408IleCys: 0.408 ± 0.161
5.951IleAsp: 5.951 ± 0.794
5.543IleGlu: 5.543 ± 0.616
1.956IlePhe: 1.956 ± 0.458
4.321IleGly: 4.321 ± 0.553
1.223IleHis: 1.223 ± 0.378
3.342IleIle: 3.342 ± 0.502
7.581IleLys: 7.581 ± 0.741
6.195IleLeu: 6.195 ± 0.912
0.978IleMet: 0.978 ± 0.303
4.565IleAsn: 4.565 ± 0.817
2.69IlePro: 2.69 ± 0.567
1.712IleGln: 1.712 ± 0.344
2.283IleArg: 2.283 ± 0.422
3.913IleSer: 3.913 ± 0.509
4.81IleThr: 4.81 ± 0.675
3.75IleVal: 3.75 ± 0.48
0.489IleTrp: 0.489 ± 0.2
2.364IleTyr: 2.364 ± 0.36
0.0IleXaa: 0.0 ± 0.0
Lys
6.522LysAla: 6.522 ± 0.774
0.326LysCys: 0.326 ± 0.165
5.136LysAsp: 5.136 ± 0.712
7.337LysGlu: 7.337 ± 0.885
3.016LysPhe: 3.016 ± 0.39
5.299LysGly: 5.299 ± 0.826
1.386LysHis: 1.386 ± 0.343
6.603LysIle: 6.603 ± 1.031
7.663LysLys: 7.663 ± 0.719
7.663LysLeu: 7.663 ± 0.872
2.527LysMet: 2.527 ± 0.438
4.647LysAsn: 4.647 ± 0.549
2.772LysPro: 2.772 ± 0.438
4.81LysGln: 4.81 ± 0.582
3.342LysArg: 3.342 ± 0.631
5.299LysSer: 5.299 ± 0.56
6.603LysThr: 6.603 ± 0.739
6.114LysVal: 6.114 ± 0.734
0.734LysTrp: 0.734 ± 0.223
3.505LysTyr: 3.505 ± 0.624
0.0LysXaa: 0.0 ± 0.0
Leu
6.44LeuAla: 6.44 ± 1.061
0.326LeuCys: 0.326 ± 0.169
7.011LeuAsp: 7.011 ± 0.731
7.744LeuGlu: 7.744 ± 0.88
2.609LeuPhe: 2.609 ± 0.54
6.195LeuGly: 6.195 ± 0.786
0.652LeuHis: 0.652 ± 0.273
5.299LeuIle: 5.299 ± 0.767
9.538LeuLys: 9.538 ± 0.908
6.44LeuLeu: 6.44 ± 0.763
2.283LeuMet: 2.283 ± 0.445
5.38LeuAsn: 5.38 ± 0.551
3.179LeuPro: 3.179 ± 0.485
3.668LeuGln: 3.668 ± 0.652
3.831LeuArg: 3.831 ± 0.504
6.195LeuSer: 6.195 ± 0.955
5.951LeuThr: 5.951 ± 0.624
4.484LeuVal: 4.484 ± 0.601
0.571LeuTrp: 0.571 ± 0.228
2.527LeuTyr: 2.527 ± 0.462
0.0LeuXaa: 0.0 ± 0.0
Met
2.283MetAla: 2.283 ± 0.433
0.0MetCys: 0.0 ± 0.0
0.978MetAsp: 0.978 ± 0.371
1.304MetGlu: 1.304 ± 0.326
1.06MetPhe: 1.06 ± 0.252
1.467MetGly: 1.467 ± 0.288
0.245MetHis: 0.245 ± 0.148
2.283MetIle: 2.283 ± 0.46
1.549MetLys: 1.549 ± 0.324
2.038MetLeu: 2.038 ± 0.445
0.489MetMet: 0.489 ± 0.24
0.978MetAsn: 0.978 ± 0.309
0.652MetPro: 0.652 ± 0.253
0.978MetGln: 0.978 ± 0.242
1.223MetArg: 1.223 ± 0.247
1.793MetSer: 1.793 ± 0.412
1.956MetThr: 1.956 ± 0.458
1.304MetVal: 1.304 ± 0.288
0.163MetTrp: 0.163 ± 0.126
0.571MetTyr: 0.571 ± 0.247
0.0MetXaa: 0.0 ± 0.0
Asn
3.831AsnAla: 3.831 ± 0.639
0.163AsnCys: 0.163 ± 0.16
3.505AsnAsp: 3.505 ± 0.517
3.75AsnGlu: 3.75 ± 0.657
1.956AsnPhe: 1.956 ± 0.372
4.565AsnGly: 4.565 ± 0.506
0.571AsnHis: 0.571 ± 0.185
3.75AsnIle: 3.75 ± 0.63
4.565AsnLys: 4.565 ± 0.65
5.869AsnLeu: 5.869 ± 0.686
1.304AsnMet: 1.304 ± 0.38
3.342AsnAsn: 3.342 ± 0.465
2.609AsnPro: 2.609 ± 0.436
2.853AsnGln: 2.853 ± 0.621
2.446AsnArg: 2.446 ± 0.443
3.424AsnSer: 3.424 ± 0.717
2.527AsnThr: 2.527 ± 0.457
3.179AsnVal: 3.179 ± 0.428
0.978AsnTrp: 0.978 ± 0.252
2.038AsnTyr: 2.038 ± 0.393
0.0AsnXaa: 0.0 ± 0.0
Pro
1.549ProAla: 1.549 ± 0.419
0.245ProCys: 0.245 ± 0.13
1.712ProAsp: 1.712 ± 0.435
2.038ProGlu: 2.038 ± 0.405
1.141ProPhe: 1.141 ± 0.341
0.978ProGly: 0.978 ± 0.273
0.489ProHis: 0.489 ± 0.155
1.63ProIle: 1.63 ± 0.344
3.505ProLys: 3.505 ± 0.67
2.772ProLeu: 2.772 ± 0.651
0.408ProMet: 0.408 ± 0.227
1.549ProAsn: 1.549 ± 0.352
0.652ProPro: 0.652 ± 0.266
1.793ProGln: 1.793 ± 0.382
0.652ProArg: 0.652 ± 0.222
1.467ProSer: 1.467 ± 0.302
1.956ProThr: 1.956 ± 0.356
1.956ProVal: 1.956 ± 0.489
0.082ProTrp: 0.082 ± 0.078
1.141ProTyr: 1.141 ± 0.372
0.0ProXaa: 0.0 ± 0.0
Gln
3.016GlnAla: 3.016 ± 0.585
0.163GlnCys: 0.163 ± 0.101
1.63GlnAsp: 1.63 ± 0.311
3.587GlnGlu: 3.587 ± 0.538
1.712GlnPhe: 1.712 ± 0.355
1.793GlnGly: 1.793 ± 0.346
0.489GlnHis: 0.489 ± 0.199
3.179GlnIle: 3.179 ± 0.538
2.853GlnLys: 2.853 ± 0.5
4.891GlnLeu: 4.891 ± 0.584
1.549GlnMet: 1.549 ± 0.395
2.935GlnAsn: 2.935 ± 0.541
0.408GlnPro: 0.408 ± 0.194
1.467GlnGln: 1.467 ± 0.438
2.038GlnArg: 2.038 ± 0.364
3.913GlnSer: 3.913 ± 0.655
2.038GlnThr: 2.038 ± 0.495
1.63GlnVal: 1.63 ± 0.357
0.897GlnTrp: 0.897 ± 0.322
1.223GlnTyr: 1.223 ± 0.379
0.0GlnXaa: 0.0 ± 0.0
Arg
2.69ArgAla: 2.69 ± 0.41
0.245ArgCys: 0.245 ± 0.136
2.527ArgAsp: 2.527 ± 0.564
2.609ArgGlu: 2.609 ± 0.513
1.549ArgPhe: 1.549 ± 0.27
2.69ArgGly: 2.69 ± 0.38
0.978ArgHis: 0.978 ± 0.258
3.179ArgIle: 3.179 ± 0.677
3.668ArgLys: 3.668 ± 0.726
3.75ArgLeu: 3.75 ± 0.612
0.734ArgMet: 0.734 ± 0.266
2.609ArgAsn: 2.609 ± 0.451
0.408ArgPro: 0.408 ± 0.164
1.712ArgGln: 1.712 ± 0.404
2.12ArgArg: 2.12 ± 0.487
1.956ArgSer: 1.956 ± 0.345
2.364ArgThr: 2.364 ± 0.563
2.364ArgVal: 2.364 ± 0.494
0.326ArgTrp: 0.326 ± 0.169
2.038ArgTyr: 2.038 ± 0.459
0.0ArgXaa: 0.0 ± 0.0
Ser
4.81SerAla: 4.81 ± 0.978
0.245SerCys: 0.245 ± 0.131
5.788SerAsp: 5.788 ± 0.627
3.75SerGlu: 3.75 ± 0.425
2.283SerPhe: 2.283 ± 0.438
4.973SerGly: 4.973 ± 0.732
1.06SerHis: 1.06 ± 0.295
3.913SerIle: 3.913 ± 0.691
5.38SerLys: 5.38 ± 0.667
4.565SerLeu: 4.565 ± 0.574
1.304SerMet: 1.304 ± 0.258
3.179SerAsn: 3.179 ± 0.724
1.386SerPro: 1.386 ± 0.324
3.098SerGln: 3.098 ± 0.393
2.283SerArg: 2.283 ± 0.368
2.772SerSer: 2.772 ± 0.43
2.772SerThr: 2.772 ± 0.493
3.831SerVal: 3.831 ± 0.636
0.571SerTrp: 0.571 ± 0.255
2.69SerTyr: 2.69 ± 0.483
0.0SerXaa: 0.0 ± 0.0
Thr
5.054ThrAla: 5.054 ± 0.927
0.326ThrCys: 0.326 ± 0.163
3.587ThrAsp: 3.587 ± 0.562
3.831ThrGlu: 3.831 ± 0.63
2.283ThrPhe: 2.283 ± 0.423
4.647ThrGly: 4.647 ± 0.508
0.652ThrHis: 0.652 ± 0.248
4.321ThrIle: 4.321 ± 0.52
5.462ThrLys: 5.462 ± 0.678
5.136ThrLeu: 5.136 ± 0.64
1.304ThrMet: 1.304 ± 0.396
3.261ThrAsn: 3.261 ± 0.498
2.283ThrPro: 2.283 ± 0.424
2.201ThrGln: 2.201 ± 0.35
2.038ThrArg: 2.038 ± 0.437
4.565ThrSer: 4.565 ± 0.562
3.913ThrThr: 3.913 ± 0.559
3.505ThrVal: 3.505 ± 0.496
0.734ThrTrp: 0.734 ± 0.192
1.875ThrTyr: 1.875 ± 0.449
0.0ThrXaa: 0.0 ± 0.0
Val
4.321ValAla: 4.321 ± 0.926
0.245ValCys: 0.245 ± 0.14
5.054ValAsp: 5.054 ± 0.68
4.973ValGlu: 4.973 ± 0.602
3.098ValPhe: 3.098 ± 0.448
3.098ValGly: 3.098 ± 0.526
0.815ValHis: 0.815 ± 0.238
3.587ValIle: 3.587 ± 0.543
4.973ValLys: 4.973 ± 0.77
6.359ValLeu: 6.359 ± 0.676
1.304ValMet: 1.304 ± 0.333
3.913ValAsn: 3.913 ± 0.618
1.467ValPro: 1.467 ± 0.333
1.875ValGln: 1.875 ± 0.472
2.609ValArg: 2.609 ± 0.479
3.994ValSer: 3.994 ± 0.509
3.994ValThr: 3.994 ± 0.475
3.505ValVal: 3.505 ± 0.477
0.571ValTrp: 0.571 ± 0.185
1.467ValTyr: 1.467 ± 0.337
0.0ValXaa: 0.0 ± 0.0
Trp
0.489TrpAla: 0.489 ± 0.198
0.163TrpCys: 0.163 ± 0.136
0.815TrpAsp: 0.815 ± 0.212
0.734TrpGlu: 0.734 ± 0.202
0.408TrpPhe: 0.408 ± 0.17
0.897TrpGly: 0.897 ± 0.27
0.163TrpHis: 0.163 ± 0.095
0.978TrpIle: 0.978 ± 0.263
0.734TrpLys: 0.734 ± 0.258
0.978TrpLeu: 0.978 ± 0.314
0.163TrpMet: 0.163 ± 0.123
0.489TrpAsn: 0.489 ± 0.184
0.408TrpPro: 0.408 ± 0.164
0.571TrpGln: 0.571 ± 0.191
0.489TrpArg: 0.489 ± 0.205
0.652TrpSer: 0.652 ± 0.213
0.652TrpThr: 0.652 ± 0.249
0.489TrpVal: 0.489 ± 0.18
0.0TrpTrp: 0.0 ± 0.0
0.326TrpTyr: 0.326 ± 0.168
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.935TyrAla: 2.935 ± 0.429
0.734TyrCys: 0.734 ± 0.246
2.69TyrAsp: 2.69 ± 0.407
2.364TyrGlu: 2.364 ± 0.407
1.223TyrPhe: 1.223 ± 0.377
1.956TyrGly: 1.956 ± 0.397
1.06TyrHis: 1.06 ± 0.281
1.793TyrIle: 1.793 ± 0.513
3.098TyrLys: 3.098 ± 0.594
2.935TyrLeu: 2.935 ± 0.545
0.734TyrMet: 0.734 ± 0.256
1.793TyrAsn: 1.793 ± 0.426
1.304TyrPro: 1.304 ± 0.362
2.283TyrGln: 2.283 ± 0.445
1.793TyrArg: 1.793 ± 0.305
1.63TyrSer: 1.63 ± 0.32
2.12TyrThr: 2.12 ± 0.475
2.283TyrVal: 2.283 ± 0.405
0.571TyrTrp: 0.571 ± 0.219
2.12TyrTyr: 2.12 ± 0.468
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 64 proteins (12268 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski