Amino acid dipepetide frequency for Streptococcus satellite phage Javan730

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.569AlaAla: 0.569 ± 0.499
0.284AlaCys: 0.284 ± 0.266
1.422AlaAsp: 1.422 ± 0.574
5.121AlaGlu: 5.121 ± 1.756
2.845AlaPhe: 2.845 ± 0.76
2.276AlaGly: 2.276 ± 0.567
0.284AlaHis: 0.284 ± 0.281
3.698AlaIle: 3.698 ± 1.114
9.104AlaLys: 9.104 ± 1.628
2.845AlaLeu: 2.845 ± 0.791
2.845AlaMet: 2.845 ± 0.789
2.56AlaAsn: 2.56 ± 0.959
0.569AlaPro: 0.569 ± 0.356
1.991AlaGln: 1.991 ± 0.633
2.845AlaArg: 2.845 ± 0.96
0.853AlaSer: 0.853 ± 0.439
4.267AlaThr: 4.267 ± 1.045
2.56AlaVal: 2.56 ± 0.816
0.284AlaTrp: 0.284 ± 0.297
2.56AlaTyr: 2.56 ± 0.733
0.0AlaXaa: 0.0 ± 0.0
Cys
1.138CysAla: 1.138 ± 0.532
0.0CysCys: 0.0 ± 0.0
0.284CysAsp: 0.284 ± 0.281
0.569CysGlu: 0.569 ± 0.407
0.0CysPhe: 0.0 ± 0.0
1.138CysGly: 1.138 ± 0.591
0.284CysHis: 0.284 ± 0.247
0.284CysIle: 0.284 ± 0.25
0.284CysLys: 0.284 ± 0.301
0.853CysLeu: 0.853 ± 0.463
0.284CysMet: 0.284 ± 0.261
0.284CysAsn: 0.284 ± 0.256
0.853CysPro: 0.853 ± 0.335
0.853CysGln: 0.853 ± 0.902
0.853CysArg: 0.853 ± 0.402
0.284CysSer: 0.284 ± 0.266
0.569CysThr: 0.569 ± 0.424
0.284CysVal: 0.284 ± 0.256
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.853AspAla: 0.853 ± 0.456
1.422AspCys: 1.422 ± 0.452
4.552AspAsp: 4.552 ± 1.052
3.983AspGlu: 3.983 ± 1.294
4.267AspPhe: 4.267 ± 0.989
4.267AspGly: 4.267 ± 1.088
0.0AspHis: 0.0 ± 0.0
6.259AspIle: 6.259 ± 1.233
6.543AspLys: 6.543 ± 1.685
6.259AspLeu: 6.259 ± 0.942
0.569AspMet: 0.569 ± 0.423
4.267AspAsn: 4.267 ± 0.831
1.138AspPro: 1.138 ± 0.483
0.853AspGln: 0.853 ± 0.454
1.422AspArg: 1.422 ± 0.608
5.405AspSer: 5.405 ± 1.588
2.276AspThr: 2.276 ± 0.712
3.129AspVal: 3.129 ± 0.779
0.284AspTrp: 0.284 ± 0.256
1.422AspTyr: 1.422 ± 0.793
0.0AspXaa: 0.0 ± 0.0
Glu
3.129GluAla: 3.129 ± 0.884
1.138GluCys: 1.138 ± 0.485
3.983GluAsp: 3.983 ± 1.255
5.974GluGlu: 5.974 ± 1.572
3.414GluPhe: 3.414 ± 0.727
3.698GluGly: 3.698 ± 0.967
1.422GluHis: 1.422 ± 0.964
8.819GluIle: 8.819 ± 1.327
9.957GluLys: 9.957 ± 1.798
11.949GluLeu: 11.949 ± 1.813
1.991GluMet: 1.991 ± 0.712
4.836GluAsn: 4.836 ± 1.086
1.422GluPro: 1.422 ± 0.68
3.698GluGln: 3.698 ± 0.737
3.129GluArg: 3.129 ± 0.953
7.397GluSer: 7.397 ± 1.533
5.121GluThr: 5.121 ± 0.998
3.129GluVal: 3.129 ± 0.955
0.284GluTrp: 0.284 ± 0.25
4.267GluTyr: 4.267 ± 1.265
0.0GluXaa: 0.0 ± 0.0
Phe
1.138PheAla: 1.138 ± 0.478
0.853PheCys: 0.853 ± 0.532
3.983PheAsp: 3.983 ± 1.061
4.267PheGlu: 4.267 ± 1.394
2.56PhePhe: 2.56 ± 0.894
1.422PheGly: 1.422 ± 0.547
0.284PheHis: 0.284 ± 0.247
3.129PheIle: 3.129 ± 0.79
5.405PheLys: 5.405 ± 1.062
4.552PheLeu: 4.552 ± 1.019
1.138PheMet: 1.138 ± 0.6
1.991PheAsn: 1.991 ± 0.977
0.853PhePro: 0.853 ± 0.525
0.569PheGln: 0.569 ± 0.403
1.707PheArg: 1.707 ± 0.628
3.698PheSer: 3.698 ± 1.136
1.991PheThr: 1.991 ± 0.672
2.276PheVal: 2.276 ± 1.122
0.853PheTrp: 0.853 ± 0.475
1.138PheTyr: 1.138 ± 0.724
0.0PheXaa: 0.0 ± 0.0
Gly
2.845GlyAla: 2.845 ± 0.953
0.284GlyCys: 0.284 ± 0.301
2.56GlyAsp: 2.56 ± 1.047
4.552GlyGlu: 4.552 ± 1.025
2.845GlyPhe: 2.845 ± 0.88
1.991GlyGly: 1.991 ± 0.611
1.707GlyHis: 1.707 ± 0.651
2.845GlyIle: 2.845 ± 0.731
5.974GlyLys: 5.974 ± 1.003
3.698GlyLeu: 3.698 ± 0.777
1.707GlyMet: 1.707 ± 0.774
1.138GlyAsn: 1.138 ± 0.631
0.0GlyPro: 0.0 ± 0.0
1.991GlyGln: 1.991 ± 0.734
2.56GlyArg: 2.56 ± 0.743
1.991GlySer: 1.991 ± 0.616
2.276GlyThr: 2.276 ± 0.771
3.698GlyVal: 3.698 ± 0.993
0.569GlyTrp: 0.569 ± 0.378
5.121GlyTyr: 5.121 ± 1.534
0.0GlyXaa: 0.0 ± 0.0
His
1.707HisAla: 1.707 ± 0.71
0.0HisCys: 0.0 ± 0.0
0.284HisAsp: 0.284 ± 0.266
0.569HisGlu: 0.569 ± 0.387
0.569HisPhe: 0.569 ± 0.447
0.284HisGly: 0.284 ± 0.281
0.569HisHis: 0.569 ± 0.399
1.138HisIle: 1.138 ± 0.527
1.138HisLys: 1.138 ± 0.569
2.845HisLeu: 2.845 ± 1.157
0.284HisMet: 0.284 ± 0.342
1.422HisAsn: 1.422 ± 0.712
0.569HisPro: 0.569 ± 0.347
0.569HisGln: 0.569 ± 0.371
0.284HisArg: 0.284 ± 0.297
0.853HisSer: 0.853 ± 0.524
1.422HisThr: 1.422 ± 0.544
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.414IleAla: 3.414 ± 1.349
0.0IleCys: 0.0 ± 0.0
3.698IleAsp: 3.698 ± 0.997
5.405IleGlu: 5.405 ± 1.391
3.129IlePhe: 3.129 ± 0.954
2.276IleGly: 2.276 ± 0.921
1.138IleHis: 1.138 ± 0.512
5.69IleIle: 5.69 ± 1.504
8.535IleLys: 8.535 ± 1.442
7.112IleLeu: 7.112 ± 1.648
0.853IleMet: 0.853 ± 0.414
3.414IleAsn: 3.414 ± 0.988
3.698IlePro: 3.698 ± 0.8
3.698IleGln: 3.698 ± 0.863
2.845IleArg: 2.845 ± 0.922
3.414IleSer: 3.414 ± 1.086
3.983IleThr: 3.983 ± 0.863
2.56IleVal: 2.56 ± 0.722
0.284IleTrp: 0.284 ± 0.281
3.698IleTyr: 3.698 ± 0.837
0.0IleXaa: 0.0 ± 0.0
Lys
8.819LysAla: 8.819 ± 1.557
0.853LysCys: 0.853 ± 0.444
5.974LysAsp: 5.974 ± 1.173
11.949LysGlu: 11.949 ± 1.459
3.983LysPhe: 3.983 ± 1.074
6.828LysGly: 6.828 ± 1.03
2.56LysHis: 2.56 ± 1.105
7.681LysIle: 7.681 ± 1.779
11.949LysLys: 11.949 ± 1.693
7.966LysLeu: 7.966 ± 1.487
2.56LysMet: 2.56 ± 0.655
7.112LysAsn: 7.112 ± 1.242
1.991LysPro: 1.991 ± 0.92
3.414LysGln: 3.414 ± 1.083
6.259LysArg: 6.259 ± 1.661
5.69LysSer: 5.69 ± 0.983
7.397LysThr: 7.397 ± 1.482
3.698LysVal: 3.698 ± 0.707
1.138LysTrp: 1.138 ± 0.499
4.552LysTyr: 4.552 ± 1.201
0.0LysXaa: 0.0 ± 0.0
Leu
6.259LeuAla: 6.259 ± 1.444
1.138LeuCys: 1.138 ± 0.467
9.104LeuAsp: 9.104 ± 1.342
10.811LeuGlu: 10.811 ± 1.471
3.129LeuPhe: 3.129 ± 0.969
5.405LeuGly: 5.405 ± 1.128
0.569LeuHis: 0.569 ± 0.436
5.974LeuIle: 5.974 ± 1.365
8.819LeuLys: 8.819 ± 1.271
7.966LeuLeu: 7.966 ± 2.076
2.276LeuMet: 2.276 ± 0.565
6.259LeuAsn: 6.259 ± 1.515
2.56LeuPro: 2.56 ± 0.9
3.129LeuGln: 3.129 ± 0.962
5.974LeuArg: 5.974 ± 0.901
5.405LeuSer: 5.405 ± 1.262
7.112LeuThr: 7.112 ± 1.665
5.121LeuVal: 5.121 ± 1.05
0.284LeuTrp: 0.284 ± 0.274
3.129LeuTyr: 3.129 ± 1.027
0.0LeuXaa: 0.0 ± 0.0
Met
1.422MetAla: 1.422 ± 0.797
0.284MetCys: 0.284 ± 0.3
2.276MetAsp: 2.276 ± 0.647
2.56MetGlu: 2.56 ± 0.924
0.569MetPhe: 0.569 ± 0.392
1.422MetGly: 1.422 ± 0.55
0.0MetHis: 0.0 ± 0.0
1.422MetIle: 1.422 ± 0.568
1.991MetLys: 1.991 ± 0.592
1.422MetLeu: 1.422 ± 0.464
0.569MetMet: 0.569 ± 0.354
1.422MetAsn: 1.422 ± 0.522
0.853MetPro: 0.853 ± 0.431
0.569MetGln: 0.569 ± 0.358
1.991MetArg: 1.991 ± 0.757
1.707MetSer: 1.707 ± 0.725
2.56MetThr: 2.56 ± 1.104
0.284MetVal: 0.284 ± 0.25
0.284MetTrp: 0.284 ± 0.231
0.284MetTyr: 0.284 ± 0.263
0.0MetXaa: 0.0 ± 0.0
Asn
3.698AsnAla: 3.698 ± 0.832
0.284AsnCys: 0.284 ± 0.247
1.707AsnAsp: 1.707 ± 0.629
1.707AsnGlu: 1.707 ± 0.688
2.56AsnPhe: 2.56 ± 0.741
2.56AsnGly: 2.56 ± 0.899
1.138AsnHis: 1.138 ± 0.632
2.276AsnIle: 2.276 ± 1.018
6.259AsnLys: 6.259 ± 1.692
5.69AsnLeu: 5.69 ± 1.4
2.56AsnMet: 2.56 ± 0.799
1.991AsnAsn: 1.991 ± 0.777
1.707AsnPro: 1.707 ± 0.862
2.276AsnGln: 2.276 ± 0.74
3.414AsnArg: 3.414 ± 0.931
3.414AsnSer: 3.414 ± 1.071
3.983AsnThr: 3.983 ± 1.436
1.991AsnVal: 1.991 ± 0.977
0.853AsnTrp: 0.853 ± 0.491
3.129AsnTyr: 3.129 ± 0.699
0.0AsnXaa: 0.0 ± 0.0
Pro
1.422ProAla: 1.422 ± 0.714
0.0ProCys: 0.0 ± 0.0
2.845ProAsp: 2.845 ± 0.784
2.845ProGlu: 2.845 ± 1.048
1.138ProPhe: 1.138 ± 0.353
0.569ProGly: 0.569 ± 0.367
0.569ProHis: 0.569 ± 0.398
0.853ProIle: 0.853 ± 0.502
4.267ProLys: 4.267 ± 1.137
0.853ProLeu: 0.853 ± 0.417
0.569ProMet: 0.569 ± 0.352
1.707ProAsn: 1.707 ± 0.612
0.853ProPro: 0.853 ± 0.593
0.853ProGln: 0.853 ± 0.433
1.991ProArg: 1.991 ± 0.622
0.569ProSer: 0.569 ± 0.349
0.853ProThr: 0.853 ± 0.448
0.853ProVal: 0.853 ± 0.408
0.0ProTrp: 0.0 ± 0.0
0.569ProTyr: 0.569 ± 0.35
0.0ProXaa: 0.0 ± 0.0
Gln
3.698GlnAla: 3.698 ± 1.028
0.569GlnCys: 0.569 ± 0.433
1.707GlnAsp: 1.707 ± 0.649
3.698GlnGlu: 3.698 ± 0.81
1.707GlnPhe: 1.707 ± 0.836
1.422GlnGly: 1.422 ± 0.589
0.284GlnHis: 0.284 ± 0.301
2.276GlnIle: 2.276 ± 0.553
4.267GlnLys: 4.267 ± 1.018
2.845GlnLeu: 2.845 ± 0.994
0.853GlnMet: 0.853 ± 0.428
1.422GlnAsn: 1.422 ± 0.687
0.853GlnPro: 0.853 ± 0.492
1.422GlnGln: 1.422 ± 0.686
1.422GlnArg: 1.422 ± 0.728
0.853GlnSer: 0.853 ± 0.415
1.707GlnThr: 1.707 ± 0.721
3.983GlnVal: 3.983 ± 0.842
0.569GlnTrp: 0.569 ± 0.361
0.853GlnTyr: 0.853 ± 0.601
0.0GlnXaa: 0.0 ± 0.0
Arg
1.707ArgAla: 1.707 ± 0.709
0.0ArgCys: 0.0 ± 0.0
2.56ArgAsp: 2.56 ± 0.653
4.552ArgGlu: 4.552 ± 1.184
2.276ArgPhe: 2.276 ± 0.811
2.56ArgGly: 2.56 ± 0.751
0.569ArgHis: 0.569 ± 0.338
2.845ArgIle: 2.845 ± 0.573
5.974ArgLys: 5.974 ± 1.387
5.69ArgLeu: 5.69 ± 1.398
1.138ArgMet: 1.138 ± 0.573
3.414ArgAsn: 3.414 ± 1.152
1.422ArgPro: 1.422 ± 0.57
3.414ArgGln: 3.414 ± 0.926
2.276ArgArg: 2.276 ± 0.837
1.991ArgSer: 1.991 ± 0.706
2.56ArgThr: 2.56 ± 1.033
1.422ArgVal: 1.422 ± 0.67
0.284ArgTrp: 0.284 ± 0.284
2.845ArgTyr: 2.845 ± 1.27
0.0ArgXaa: 0.0 ± 0.0
Ser
1.991SerAla: 1.991 ± 1.381
0.853SerCys: 0.853 ± 0.517
4.267SerAsp: 4.267 ± 0.933
6.828SerGlu: 6.828 ± 1.872
2.56SerPhe: 2.56 ± 1.203
3.129SerGly: 3.129 ± 0.703
0.853SerHis: 0.853 ± 0.352
3.698SerIle: 3.698 ± 1.145
5.405SerLys: 5.405 ± 1.103
7.681SerLeu: 7.681 ± 1.191
0.853SerMet: 0.853 ± 0.539
2.276SerAsn: 2.276 ± 0.712
1.707SerPro: 1.707 ± 0.603
1.991SerGln: 1.991 ± 0.627
2.845SerArg: 2.845 ± 0.875
3.698SerSer: 3.698 ± 1.21
2.845SerThr: 2.845 ± 0.897
2.276SerVal: 2.276 ± 0.491
0.284SerTrp: 0.284 ± 0.261
1.138SerTyr: 1.138 ± 0.56
0.0SerXaa: 0.0 ± 0.0
Thr
1.138ThrAla: 1.138 ± 0.688
0.0ThrCys: 0.0 ± 0.0
1.707ThrAsp: 1.707 ± 0.556
4.267ThrGlu: 4.267 ± 0.975
1.991ThrPhe: 1.991 ± 0.515
4.267ThrGly: 4.267 ± 1.05
1.991ThrHis: 1.991 ± 0.671
3.983ThrIle: 3.983 ± 1.205
6.259ThrLys: 6.259 ± 1.179
7.397ThrLeu: 7.397 ± 1.13
1.138ThrMet: 1.138 ± 0.466
2.56ThrAsn: 2.56 ± 0.721
1.422ThrPro: 1.422 ± 0.47
1.138ThrGln: 1.138 ± 0.668
2.845ThrArg: 2.845 ± 0.826
3.698ThrSer: 3.698 ± 0.995
3.414ThrThr: 3.414 ± 1.235
6.543ThrVal: 6.543 ± 1.512
0.569ThrTrp: 0.569 ± 0.362
3.129ThrTyr: 3.129 ± 1.206
0.0ThrXaa: 0.0 ± 0.0
Val
2.845ValAla: 2.845 ± 0.804
0.284ValCys: 0.284 ± 0.281
3.698ValAsp: 3.698 ± 1.307
3.983ValGlu: 3.983 ± 0.637
1.707ValPhe: 1.707 ± 0.934
3.414ValGly: 3.414 ± 1.129
0.0ValHis: 0.0 ± 0.0
3.414ValIle: 3.414 ± 1.023
5.69ValLys: 5.69 ± 1.324
3.698ValLeu: 3.698 ± 0.925
0.569ValMet: 0.569 ± 0.382
2.276ValAsn: 2.276 ± 0.512
0.853ValPro: 0.853 ± 0.437
1.991ValGln: 1.991 ± 1.032
1.707ValArg: 1.707 ± 0.673
3.129ValSer: 3.129 ± 0.905
2.845ValThr: 2.845 ± 0.923
2.56ValVal: 2.56 ± 0.686
0.569ValTrp: 0.569 ± 0.416
3.698ValTyr: 3.698 ± 0.959
0.0ValXaa: 0.0 ± 0.0
Trp
0.853TrpAla: 0.853 ± 0.335
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.991TrpGlu: 1.991 ± 0.72
0.0TrpPhe: 0.0 ± 0.0
0.284TrpGly: 0.284 ± 0.274
0.0TrpHis: 0.0 ± 0.0
0.284TrpIle: 0.284 ± 0.261
0.853TrpLys: 0.853 ± 0.485
1.422TrpLeu: 1.422 ± 0.602
0.0TrpMet: 0.0 ± 0.0
0.284TrpAsn: 0.284 ± 0.284
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.284TrpArg: 0.284 ± 0.345
0.569TrpSer: 0.569 ± 0.337
0.0TrpThr: 0.0 ± 0.0
0.853TrpVal: 0.853 ± 0.445
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.138TyrAla: 1.138 ± 0.634
0.853TyrCys: 0.853 ± 0.516
2.845TyrAsp: 2.845 ± 0.717
3.129TyrGlu: 3.129 ± 0.693
2.56TyrPhe: 2.56 ± 0.743
1.138TyrGly: 1.138 ± 0.495
0.284TyrHis: 0.284 ± 0.231
2.276TyrIle: 2.276 ± 0.952
3.698TyrLys: 3.698 ± 1.342
7.681TyrLeu: 7.681 ± 1.452
0.853TyrMet: 0.853 ± 0.432
2.56TyrAsn: 2.56 ± 0.693
0.853TyrPro: 0.853 ± 0.471
1.991TyrGln: 1.991 ± 0.621
2.845TyrArg: 2.845 ± 0.909
2.56TyrSer: 2.56 ± 0.852
2.276TyrThr: 2.276 ± 0.579
1.707TyrVal: 1.707 ± 0.742
0.284TyrTrp: 0.284 ± 0.256
0.853TyrTyr: 0.853 ± 0.442
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 25 proteins (3516 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski