Amino acid dipepetide frequency for Streptococcus phage Javan242

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.474AlaAla: 3.474 ± 0.789
0.511AlaCys: 0.511 ± 0.215
4.394AlaAsp: 4.394 ± 0.58
5.62AlaGlu: 5.62 ± 0.778
3.065AlaPhe: 3.065 ± 0.53
5.211AlaGly: 5.211 ± 1.248
1.022AlaHis: 1.022 ± 0.271
5.62AlaIle: 5.62 ± 1.6
4.904AlaLys: 4.904 ± 0.684
4.7AlaLeu: 4.7 ± 0.669
1.533AlaMet: 1.533 ± 0.479
3.985AlaAsn: 3.985 ± 0.648
0.92AlaPro: 0.92 ± 0.317
3.167AlaGln: 3.167 ± 0.472
3.27AlaArg: 3.27 ± 0.488
3.985AlaSer: 3.985 ± 0.684
3.474AlaThr: 3.474 ± 0.532
5.211AlaVal: 5.211 ± 1.158
1.635AlaTrp: 1.635 ± 0.858
2.657AlaTyr: 2.657 ± 0.464
0.0AlaXaa: 0.0 ± 0.0
Cys
0.409CysAla: 0.409 ± 0.186
0.102CysCys: 0.102 ± 0.098
0.715CysAsp: 0.715 ± 0.258
0.307CysGlu: 0.307 ± 0.163
0.204CysPhe: 0.204 ± 0.119
0.511CysGly: 0.511 ± 0.24
0.204CysHis: 0.204 ± 0.117
0.307CysIle: 0.307 ± 0.179
0.307CysLys: 0.307 ± 0.185
1.022CysLeu: 1.022 ± 0.268
0.102CysMet: 0.102 ± 0.09
0.102CysAsn: 0.102 ± 0.101
0.102CysPro: 0.102 ± 0.083
0.409CysGln: 0.409 ± 0.193
0.511CysArg: 0.511 ± 0.248
0.817CysSer: 0.817 ± 0.204
0.307CysThr: 0.307 ± 0.16
0.307CysVal: 0.307 ± 0.168
0.102CysTrp: 0.102 ± 0.09
0.307CysTyr: 0.307 ± 0.184
0.0CysXaa: 0.0 ± 0.0
Asp
3.27AspAla: 3.27 ± 0.602
0.511AspCys: 0.511 ± 0.225
3.065AspAsp: 3.065 ± 0.678
5.007AspGlu: 5.007 ± 0.804
3.678AspPhe: 3.678 ± 0.567
6.233AspGly: 6.233 ± 1.073
0.409AspHis: 0.409 ± 0.182
4.496AspIle: 4.496 ± 0.596
5.109AspLys: 5.109 ± 0.842
5.415AspLeu: 5.415 ± 0.689
1.124AspMet: 1.124 ± 0.341
3.372AspAsn: 3.372 ± 0.502
1.635AspPro: 1.635 ± 0.452
1.635AspGln: 1.635 ± 0.503
1.737AspArg: 1.737 ± 0.285
3.372AspSer: 3.372 ± 0.679
2.146AspThr: 2.146 ± 0.427
2.963AspVal: 2.963 ± 0.604
1.226AspTrp: 1.226 ± 0.317
3.27AspTyr: 3.27 ± 0.75
0.0AspXaa: 0.0 ± 0.0
Glu
5.313GluAla: 5.313 ± 0.732
0.204GluCys: 0.204 ± 0.126
3.985GluAsp: 3.985 ± 0.753
6.948GluGlu: 6.948 ± 1.575
3.065GluPhe: 3.065 ± 0.586
4.087GluGly: 4.087 ± 0.583
0.511GluHis: 0.511 ± 0.213
6.539GluIle: 6.539 ± 0.969
7.459GluLys: 7.459 ± 1.21
8.174GluLeu: 8.174 ± 1.477
2.861GluMet: 2.861 ± 0.641
5.518GluAsn: 5.518 ± 0.682
1.533GluPro: 1.533 ± 0.477
3.883GluGln: 3.883 ± 0.589
3.474GluArg: 3.474 ± 0.527
4.087GluSer: 4.087 ± 0.562
4.087GluThr: 4.087 ± 0.527
6.028GluVal: 6.028 ± 1.026
1.022GluTrp: 1.022 ± 0.349
2.657GluTyr: 2.657 ± 0.559
0.0GluXaa: 0.0 ± 0.0
Phe
3.065PheAla: 3.065 ± 0.461
0.307PheCys: 0.307 ± 0.174
3.065PheAsp: 3.065 ± 0.554
4.7PheGlu: 4.7 ± 0.619
2.452PhePhe: 2.452 ± 0.67
3.678PheGly: 3.678 ± 0.542
0.409PheHis: 0.409 ± 0.305
2.657PheIle: 2.657 ± 0.44
3.781PheLys: 3.781 ± 0.646
1.839PheLeu: 1.839 ± 0.575
1.226PheMet: 1.226 ± 0.346
2.146PheAsn: 2.146 ± 0.367
1.022PhePro: 1.022 ± 0.304
0.715PheGln: 0.715 ± 0.274
1.635PheArg: 1.635 ± 0.409
2.963PheSer: 2.963 ± 0.62
2.452PheThr: 2.452 ± 0.507
2.35PheVal: 2.35 ± 0.381
0.511PheTrp: 0.511 ± 0.227
1.226PheTyr: 1.226 ± 0.348
0.0PheXaa: 0.0 ± 0.0
Gly
4.904GlyAla: 4.904 ± 1.161
0.204GlyCys: 0.204 ± 0.166
3.474GlyAsp: 3.474 ± 0.535
5.109GlyGlu: 5.109 ± 0.67
3.576GlyPhe: 3.576 ± 1.314
4.802GlyGly: 4.802 ± 0.708
0.92GlyHis: 0.92 ± 0.314
7.459GlyIle: 7.459 ± 2.48
4.7GlyLys: 4.7 ± 0.622
5.211GlyLeu: 5.211 ± 1.118
1.737GlyMet: 1.737 ± 0.342
3.474GlyAsn: 3.474 ± 0.548
0.92GlyPro: 0.92 ± 0.291
3.985GlyGln: 3.985 ± 0.672
3.576GlyArg: 3.576 ± 0.721
2.963GlySer: 2.963 ± 0.437
4.087GlyThr: 4.087 ± 0.566
2.963GlyVal: 2.963 ± 0.695
1.43GlyTrp: 1.43 ± 0.321
3.883GlyTyr: 3.883 ± 0.66
0.0GlyXaa: 0.0 ± 0.0
His
0.409HisAla: 0.409 ± 0.197
0.613HisCys: 0.613 ± 0.239
0.715HisAsp: 0.715 ± 0.241
0.817HisGlu: 0.817 ± 0.302
0.715HisPhe: 0.715 ± 0.272
1.226HisGly: 1.226 ± 0.333
0.307HisHis: 0.307 ± 0.192
0.817HisIle: 0.817 ± 0.252
0.613HisLys: 0.613 ± 0.209
1.635HisLeu: 1.635 ± 0.482
0.0HisMet: 0.0 ± 0.0
0.613HisAsn: 0.613 ± 0.235
0.409HisPro: 0.409 ± 0.213
0.307HisGln: 0.307 ± 0.15
0.511HisArg: 0.511 ± 0.203
0.92HisSer: 0.92 ± 0.259
0.511HisThr: 0.511 ± 0.196
0.92HisVal: 0.92 ± 0.304
0.102HisTrp: 0.102 ± 0.097
1.124HisTyr: 1.124 ± 0.328
0.0HisXaa: 0.0 ± 0.0
Ile
4.904IleAla: 4.904 ± 0.986
0.613IleCys: 0.613 ± 0.235
5.824IleAsp: 5.824 ± 0.816
6.335IleGlu: 6.335 ± 0.821
2.657IlePhe: 2.657 ± 0.479
4.291IleGly: 4.291 ± 0.819
0.715IleHis: 0.715 ± 0.212
4.291IleIle: 4.291 ± 0.712
7.97IleLys: 7.97 ± 1.11
4.189IleLeu: 4.189 ± 0.558
1.43IleMet: 1.43 ± 0.58
3.781IleAsn: 3.781 ± 0.594
2.861IlePro: 2.861 ± 0.441
2.248IleGln: 2.248 ± 0.402
2.759IleArg: 2.759 ± 0.425
6.335IleSer: 6.335 ± 0.946
5.211IleThr: 5.211 ± 0.902
3.372IleVal: 3.372 ± 0.535
0.92IleTrp: 0.92 ± 0.321
2.044IleTyr: 2.044 ± 0.515
0.0IleXaa: 0.0 ± 0.0
Lys
6.335LysAla: 6.335 ± 1.01
0.307LysCys: 0.307 ± 0.171
5.007LysAsp: 5.007 ± 0.901
7.663LysGlu: 7.663 ± 1.251
2.452LysPhe: 2.452 ± 0.418
5.824LysGly: 5.824 ± 0.747
0.817LysHis: 0.817 ± 0.288
5.415LysIle: 5.415 ± 0.804
8.174LysLys: 8.174 ± 1.001
5.824LysLeu: 5.824 ± 0.605
2.657LysMet: 2.657 ± 0.627
5.313LysAsn: 5.313 ± 0.838
1.635LysPro: 1.635 ± 0.428
3.576LysGln: 3.576 ± 0.943
3.167LysArg: 3.167 ± 0.513
5.926LysSer: 5.926 ± 0.848
5.518LysThr: 5.518 ± 0.919
5.926LysVal: 5.926 ± 1.057
1.328LysTrp: 1.328 ± 0.312
4.189LysTyr: 4.189 ± 0.603
0.0LysXaa: 0.0 ± 0.0
Leu
5.926LeuAla: 5.926 ± 0.924
0.307LeuCys: 0.307 ± 0.136
4.394LeuAsp: 4.394 ± 0.573
7.05LeuGlu: 7.05 ± 1.176
4.087LeuPhe: 4.087 ± 0.628
6.028LeuGly: 6.028 ± 0.986
1.022LeuHis: 1.022 ± 0.299
4.904LeuIle: 4.904 ± 0.611
9.911LeuLys: 9.911 ± 0.991
6.335LeuLeu: 6.335 ± 0.723
1.635LeuMet: 1.635 ± 0.427
4.291LeuAsn: 4.291 ± 0.853
2.554LeuPro: 2.554 ± 0.65
3.065LeuGln: 3.065 ± 0.692
2.963LeuArg: 2.963 ± 0.717
4.802LeuSer: 4.802 ± 0.477
3.985LeuThr: 3.985 ± 0.579
3.678LeuVal: 3.678 ± 0.581
1.533LeuTrp: 1.533 ± 0.679
2.657LeuTyr: 2.657 ± 0.558
0.0LeuXaa: 0.0 ± 0.0
Met
1.737MetAla: 1.737 ± 0.644
0.204MetCys: 0.204 ± 0.131
1.226MetAsp: 1.226 ± 0.311
2.35MetGlu: 2.35 ± 0.478
0.613MetPhe: 0.613 ± 0.348
1.124MetGly: 1.124 ± 0.314
0.204MetHis: 0.204 ± 0.141
2.146MetIle: 2.146 ± 0.413
2.044MetLys: 2.044 ± 0.434
1.533MetLeu: 1.533 ± 0.365
0.715MetMet: 0.715 ± 0.408
1.226MetAsn: 1.226 ± 0.31
0.817MetPro: 0.817 ± 0.281
0.817MetGln: 0.817 ± 0.291
0.817MetArg: 0.817 ± 0.388
1.226MetSer: 1.226 ± 0.437
1.226MetThr: 1.226 ± 0.498
1.737MetVal: 1.737 ± 0.395
0.307MetTrp: 0.307 ± 0.191
0.715MetTyr: 0.715 ± 0.262
0.0MetXaa: 0.0 ± 0.0
Asn
3.781AsnAla: 3.781 ± 0.476
0.511AsnCys: 0.511 ± 0.234
2.146AsnAsp: 2.146 ± 0.593
4.7AsnGlu: 4.7 ± 0.85
1.737AsnPhe: 1.737 ± 0.507
5.007AsnGly: 5.007 ± 0.843
0.817AsnHis: 0.817 ± 0.305
3.781AsnIle: 3.781 ± 0.495
3.985AsnLys: 3.985 ± 0.656
5.211AsnLeu: 5.211 ± 0.81
1.022AsnMet: 1.022 ± 0.292
2.963AsnAsn: 2.963 ± 0.554
2.248AsnPro: 2.248 ± 0.547
2.657AsnGln: 2.657 ± 0.578
2.248AsnArg: 2.248 ± 0.47
3.474AsnSer: 3.474 ± 0.484
3.372AsnThr: 3.372 ± 0.577
4.496AsnVal: 4.496 ± 0.633
0.817AsnTrp: 0.817 ± 0.322
1.737AsnTyr: 1.737 ± 0.473
0.0AsnXaa: 0.0 ± 0.0
Pro
0.92ProAla: 0.92 ± 0.25
0.0ProCys: 0.0 ± 0.0
1.941ProAsp: 1.941 ± 0.494
2.35ProGlu: 2.35 ± 0.471
1.328ProPhe: 1.328 ± 0.345
1.124ProGly: 1.124 ± 0.392
0.204ProHis: 0.204 ± 0.143
2.35ProIle: 2.35 ± 0.505
2.044ProLys: 2.044 ± 0.53
2.248ProLeu: 2.248 ± 0.487
0.613ProMet: 0.613 ± 0.224
1.941ProAsn: 1.941 ± 0.414
0.307ProPro: 0.307 ± 0.178
0.409ProGln: 0.409 ± 0.191
0.92ProArg: 0.92 ± 0.307
2.248ProSer: 2.248 ± 0.547
1.124ProThr: 1.124 ± 0.419
1.737ProVal: 1.737 ± 0.379
0.204ProTrp: 0.204 ± 0.166
0.817ProTyr: 0.817 ± 0.244
0.0ProXaa: 0.0 ± 0.0
Gln
3.576GlnAla: 3.576 ± 0.627
0.102GlnCys: 0.102 ± 0.09
1.941GlnAsp: 1.941 ± 0.447
4.087GlnGlu: 4.087 ± 0.654
0.92GlnPhe: 0.92 ± 0.335
2.044GlnGly: 2.044 ± 0.415
0.715GlnHis: 0.715 ± 0.247
2.759GlnIle: 2.759 ± 0.528
3.678GlnLys: 3.678 ± 0.891
2.248GlnLeu: 2.248 ± 0.681
0.92GlnMet: 0.92 ± 0.367
2.861GlnAsn: 2.861 ± 0.571
1.226GlnPro: 1.226 ± 0.307
0.817GlnGln: 0.817 ± 0.312
1.839GlnArg: 1.839 ± 0.364
2.554GlnSer: 2.554 ± 0.442
2.554GlnThr: 2.554 ± 0.609
2.861GlnVal: 2.861 ± 0.556
0.204GlnTrp: 0.204 ± 0.127
1.022GlnTyr: 1.022 ± 0.361
0.0GlnXaa: 0.0 ± 0.0
Arg
2.044ArgAla: 2.044 ± 0.496
0.511ArgCys: 0.511 ± 0.181
2.657ArgAsp: 2.657 ± 0.615
2.35ArgGlu: 2.35 ± 0.394
1.737ArgPhe: 1.737 ± 0.427
2.044ArgGly: 2.044 ± 0.399
1.328ArgHis: 1.328 ± 0.373
3.372ArgIle: 3.372 ± 0.646
5.415ArgLys: 5.415 ± 1.054
4.087ArgLeu: 4.087 ± 0.74
1.226ArgMet: 1.226 ± 0.327
2.963ArgAsn: 2.963 ± 0.541
1.022ArgPro: 1.022 ± 0.437
1.328ArgGln: 1.328 ± 0.316
1.941ArgArg: 1.941 ± 0.485
2.248ArgSer: 2.248 ± 0.466
1.839ArgThr: 1.839 ± 0.5
2.248ArgVal: 2.248 ± 0.498
0.613ArgTrp: 0.613 ± 0.242
2.35ArgTyr: 2.35 ± 0.593
0.0ArgXaa: 0.0 ± 0.0
Ser
5.211SerAla: 5.211 ± 1.389
0.715SerCys: 0.715 ± 0.3
3.883SerAsp: 3.883 ± 0.563
4.291SerGlu: 4.291 ± 0.632
2.861SerPhe: 2.861 ± 0.557
5.62SerGly: 5.62 ± 0.958
1.124SerHis: 1.124 ± 0.342
3.576SerIle: 3.576 ± 0.637
4.189SerLys: 4.189 ± 0.58
5.313SerLeu: 5.313 ± 0.779
1.737SerMet: 1.737 ± 0.58
2.963SerAsn: 2.963 ± 0.485
1.839SerPro: 1.839 ± 0.428
2.759SerGln: 2.759 ± 0.519
2.452SerArg: 2.452 ± 0.656
2.759SerSer: 2.759 ± 0.626
3.474SerThr: 3.474 ± 0.543
3.372SerVal: 3.372 ± 0.681
0.613SerTrp: 0.613 ± 0.22
2.861SerTyr: 2.861 ± 0.45
0.0SerXaa: 0.0 ± 0.0
Thr
4.904ThrAla: 4.904 ± 1.048
0.307ThrCys: 0.307 ± 0.148
3.781ThrAsp: 3.781 ± 0.638
2.861ThrGlu: 2.861 ± 0.414
3.27ThrPhe: 3.27 ± 0.619
3.781ThrGly: 3.781 ± 0.793
0.715ThrHis: 0.715 ± 0.302
4.394ThrIle: 4.394 ± 0.748
3.576ThrLys: 3.576 ± 0.438
6.846ThrLeu: 6.846 ± 0.821
0.715ThrMet: 0.715 ± 0.306
3.065ThrAsn: 3.065 ± 0.527
0.92ThrPro: 0.92 ± 0.33
2.044ThrGln: 2.044 ± 0.377
2.759ThrArg: 2.759 ± 0.548
3.372ThrSer: 3.372 ± 0.671
3.678ThrThr: 3.678 ± 0.759
3.576ThrVal: 3.576 ± 0.576
0.409ThrTrp: 0.409 ± 0.177
1.124ThrTyr: 1.124 ± 0.292
0.0ThrXaa: 0.0 ± 0.0
Val
4.802ValAla: 4.802 ± 0.843
0.307ValCys: 0.307 ± 0.185
3.372ValAsp: 3.372 ± 0.682
4.7ValGlu: 4.7 ± 0.864
1.533ValPhe: 1.533 ± 0.324
2.759ValGly: 2.759 ± 0.481
0.92ValHis: 0.92 ± 0.253
3.985ValIle: 3.985 ± 0.715
5.62ValLys: 5.62 ± 0.564
5.415ValLeu: 5.415 ± 0.666
0.613ValMet: 0.613 ± 0.224
2.963ValAsn: 2.963 ± 0.655
1.737ValPro: 1.737 ± 0.414
2.452ValGln: 2.452 ± 0.501
3.576ValArg: 3.576 ± 0.708
4.598ValSer: 4.598 ± 0.602
4.496ValThr: 4.496 ± 0.639
3.883ValVal: 3.883 ± 0.637
1.124ValTrp: 1.124 ± 0.597
1.941ValTyr: 1.941 ± 0.331
0.0ValXaa: 0.0 ± 0.0
Trp
1.022TrpAla: 1.022 ± 0.38
0.204TrpCys: 0.204 ± 0.148
1.226TrpAsp: 1.226 ± 0.562
1.328TrpGlu: 1.328 ± 0.321
0.613TrpPhe: 0.613 ± 0.246
1.022TrpGly: 1.022 ± 0.328
0.204TrpHis: 0.204 ± 0.143
1.226TrpIle: 1.226 ± 0.447
0.92TrpLys: 0.92 ± 0.306
1.022TrpLeu: 1.022 ± 0.309
0.204TrpMet: 0.204 ± 0.144
1.43TrpAsn: 1.43 ± 0.846
0.307TrpPro: 0.307 ± 0.172
0.715TrpGln: 0.715 ± 0.19
0.613TrpArg: 0.613 ± 0.216
0.817TrpSer: 0.817 ± 0.261
0.613TrpThr: 0.613 ± 0.2
0.92TrpVal: 0.92 ± 0.265
0.102TrpTrp: 0.102 ± 0.109
0.102TrpTyr: 0.102 ± 0.097
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.35TyrAla: 2.35 ± 0.509
0.613TyrCys: 0.613 ± 0.28
2.861TyrAsp: 2.861 ± 0.488
2.861TyrGlu: 2.861 ± 0.463
1.635TyrPhe: 1.635 ± 0.396
2.963TyrGly: 2.963 ± 0.499
0.715TyrHis: 0.715 ± 0.238
2.657TyrIle: 2.657 ± 0.48
2.759TyrLys: 2.759 ± 0.587
2.861TyrLeu: 2.861 ± 0.489
0.613TyrMet: 0.613 ± 0.238
1.737TyrAsn: 1.737 ± 0.392
0.817TyrPro: 0.817 ± 0.312
1.941TyrGln: 1.941 ± 0.43
2.554TyrArg: 2.554 ± 0.51
2.146TyrSer: 2.146 ± 0.645
1.941TyrThr: 1.941 ± 0.455
2.248TyrVal: 2.248 ± 0.39
0.409TyrTrp: 0.409 ± 0.209
1.226TyrTyr: 1.226 ± 0.419
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (9788 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski