Amino acid dipepetide frequency for Streptococcus satellite phage Javan480

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.29AlaAla: 0.29 ± 0.293
1.161AlaCys: 1.161 ± 0.534
3.194AlaAsp: 3.194 ± 1.238
4.646AlaGlu: 4.646 ± 1.307
3.194AlaPhe: 3.194 ± 0.899
1.742AlaGly: 1.742 ± 0.66
0.0AlaHis: 0.0 ± 0.0
6.098AlaIle: 6.098 ± 1.299
5.226AlaLys: 5.226 ± 1.205
4.355AlaLeu: 4.355 ± 1.043
1.161AlaMet: 1.161 ± 0.533
4.355AlaAsn: 4.355 ± 0.917
0.581AlaPro: 0.581 ± 0.31
2.613AlaGln: 2.613 ± 0.589
2.323AlaArg: 2.323 ± 0.581
5.807AlaSer: 5.807 ± 1.584
4.065AlaThr: 4.065 ± 0.552
2.904AlaVal: 2.904 ± 0.933
0.871AlaTrp: 0.871 ± 0.536
1.452AlaTyr: 1.452 ± 0.567
0.0AlaXaa: 0.0 ± 0.0
Cys
0.871CysAla: 0.871 ± 0.451
0.29CysCys: 0.29 ± 0.272
0.581CysAsp: 0.581 ± 0.411
0.29CysGlu: 0.29 ± 0.272
0.0CysPhe: 0.0 ± 0.0
0.581CysGly: 0.581 ± 0.386
0.29CysHis: 0.29 ± 0.317
0.0CysIle: 0.0 ± 0.0
0.29CysLys: 0.29 ± 0.282
1.161CysLeu: 1.161 ± 0.559
0.0CysMet: 0.0 ± 0.0
0.581CysAsn: 0.581 ± 0.413
0.581CysPro: 0.581 ± 0.398
0.581CysGln: 0.581 ± 0.338
0.581CysArg: 0.581 ± 0.392
0.581CysSer: 0.581 ± 0.459
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.581CysTyr: 0.581 ± 0.386
0.0CysXaa: 0.0 ± 0.0
Asp
2.613AspAla: 2.613 ± 0.807
0.871AspCys: 0.871 ± 0.582
3.194AspAsp: 3.194 ± 0.984
4.936AspGlu: 4.936 ± 1.285
1.742AspPhe: 1.742 ± 0.521
2.033AspGly: 2.033 ± 0.794
1.161AspHis: 1.161 ± 0.692
7.259AspIle: 7.259 ± 1.219
6.388AspLys: 6.388 ± 1.231
6.678AspLeu: 6.678 ± 0.814
1.161AspMet: 1.161 ± 0.743
2.033AspAsn: 2.033 ± 0.686
1.161AspPro: 1.161 ± 0.716
1.742AspGln: 1.742 ± 0.707
2.904AspArg: 2.904 ± 0.685
2.323AspSer: 2.323 ± 0.943
3.194AspThr: 3.194 ± 1.06
0.871AspVal: 0.871 ± 0.516
0.29AspTrp: 0.29 ± 0.306
4.065AspTyr: 4.065 ± 1.421
0.0AspXaa: 0.0 ± 0.0
Glu
6.098GluAla: 6.098 ± 1.09
1.161GluCys: 1.161 ± 0.658
4.936GluAsp: 4.936 ± 1.248
5.807GluGlu: 5.807 ± 1.7
3.484GluPhe: 3.484 ± 1.372
2.904GluGly: 2.904 ± 0.9
1.742GluHis: 1.742 ± 0.749
6.098GluIle: 6.098 ± 1.42
5.517GluLys: 5.517 ± 0.778
9.292GluLeu: 9.292 ± 1.196
2.033GluMet: 2.033 ± 0.669
3.194GluAsn: 3.194 ± 0.839
1.161GluPro: 1.161 ± 0.417
4.646GluGln: 4.646 ± 1.517
3.775GluArg: 3.775 ± 1.063
1.452GluSer: 1.452 ± 0.623
4.936GluThr: 4.936 ± 1.14
4.646GluVal: 4.646 ± 1.334
0.581GluTrp: 0.581 ± 0.357
4.646GluTyr: 4.646 ± 1.17
0.0GluXaa: 0.0 ± 0.0
Phe
2.033PheAla: 2.033 ± 0.556
0.0PheCys: 0.0 ± 0.0
3.484PheAsp: 3.484 ± 0.683
2.323PheGlu: 2.323 ± 0.707
1.742PhePhe: 1.742 ± 0.614
1.161PheGly: 1.161 ± 0.376
2.033PheHis: 2.033 ± 0.519
2.613PheIle: 2.613 ± 0.624
4.355PheLys: 4.355 ± 1.128
2.613PheLeu: 2.613 ± 0.925
0.29PheMet: 0.29 ± 0.259
2.323PheAsn: 2.323 ± 0.844
0.871PhePro: 0.871 ± 0.496
1.452PheGln: 1.452 ± 0.52
2.033PheArg: 2.033 ± 0.624
3.194PheSer: 3.194 ± 0.707
3.484PheThr: 3.484 ± 0.833
2.613PheVal: 2.613 ± 0.651
0.29PheTrp: 0.29 ± 0.237
1.742PheTyr: 1.742 ± 0.818
0.0PheXaa: 0.0 ± 0.0
Gly
2.323GlyAla: 2.323 ± 0.974
0.0GlyCys: 0.0 ± 0.0
4.065GlyAsp: 4.065 ± 1.162
1.452GlyGlu: 1.452 ± 0.436
2.613GlyPhe: 2.613 ± 0.699
1.742GlyGly: 1.742 ± 0.569
1.452GlyHis: 1.452 ± 0.671
2.613GlyIle: 2.613 ± 0.961
3.775GlyLys: 3.775 ± 0.88
6.098GlyLeu: 6.098 ± 1.389
1.161GlyMet: 1.161 ± 0.423
2.323GlyAsn: 2.323 ± 0.744
0.581GlyPro: 0.581 ± 0.398
2.904GlyGln: 2.904 ± 1.397
2.613GlyArg: 2.613 ± 0.839
1.742GlySer: 1.742 ± 0.772
2.904GlyThr: 2.904 ± 0.731
2.613GlyVal: 2.613 ± 1.031
0.581GlyTrp: 0.581 ± 0.357
2.613GlyTyr: 2.613 ± 0.774
0.0GlyXaa: 0.0 ± 0.0
His
2.033HisAla: 2.033 ± 1.244
0.0HisCys: 0.0 ± 0.0
0.29HisAsp: 0.29 ± 0.293
0.581HisGlu: 0.581 ± 0.317
0.29HisPhe: 0.29 ± 0.35
1.161HisGly: 1.161 ± 0.444
0.29HisHis: 0.29 ± 0.237
1.452HisIle: 1.452 ± 0.583
1.742HisLys: 1.742 ± 0.87
2.323HisLeu: 2.323 ± 0.946
0.0HisMet: 0.0 ± 0.0
1.161HisAsn: 1.161 ± 0.754
0.581HisPro: 0.581 ± 0.375
1.161HisGln: 1.161 ± 0.654
0.581HisArg: 0.581 ± 0.383
0.0HisSer: 0.0 ± 0.0
1.161HisThr: 1.161 ± 0.453
0.871HisVal: 0.871 ± 0.339
0.581HisTrp: 0.581 ± 0.338
2.033HisTyr: 2.033 ± 0.798
0.0HisXaa: 0.0 ± 0.0
Ile
5.807IleAla: 5.807 ± 1.22
0.581IleCys: 0.581 ± 0.409
5.226IleAsp: 5.226 ± 1.181
5.517IleGlu: 5.517 ± 1.14
2.904IlePhe: 2.904 ± 0.88
1.742IleGly: 1.742 ± 0.534
1.161IleHis: 1.161 ± 0.724
4.065IleIle: 4.065 ± 1.164
11.324IleLys: 11.324 ± 1.602
5.226IleLeu: 5.226 ± 0.976
1.161IleMet: 1.161 ± 0.559
3.194IleAsn: 3.194 ± 0.795
2.613IlePro: 2.613 ± 0.781
2.323IleGln: 2.323 ± 0.702
2.904IleArg: 2.904 ± 0.886
4.065IleSer: 4.065 ± 1.05
4.936IleThr: 4.936 ± 1.159
3.194IleVal: 3.194 ± 0.774
0.0IleTrp: 0.0 ± 0.0
2.613IleTyr: 2.613 ± 0.908
0.0IleXaa: 0.0 ± 0.0
Lys
8.13LysAla: 8.13 ± 1.495
0.29LysCys: 0.29 ± 0.293
4.065LysAsp: 4.065 ± 1.121
12.776LysGlu: 12.776 ± 1.477
2.904LysPhe: 2.904 ± 0.684
4.355LysGly: 4.355 ± 1.548
2.033LysHis: 2.033 ± 0.668
4.936LysIle: 4.936 ± 1.247
7.84LysLys: 7.84 ± 1.779
5.807LysLeu: 5.807 ± 1.095
2.033LysMet: 2.033 ± 0.988
5.807LysAsn: 5.807 ± 1.161
4.646LysPro: 4.646 ± 1.191
4.646LysGln: 4.646 ± 0.999
5.807LysArg: 5.807 ± 1.085
3.775LysSer: 3.775 ± 1.018
7.259LysThr: 7.259 ± 1.54
6.678LysVal: 6.678 ± 1.179
1.161LysTrp: 1.161 ± 0.728
3.194LysTyr: 3.194 ± 0.803
0.0LysXaa: 0.0 ± 0.0
Leu
6.098LeuAla: 6.098 ± 1.333
0.871LeuCys: 0.871 ± 0.496
5.807LeuAsp: 5.807 ± 1.259
10.743LeuGlu: 10.743 ± 1.038
4.355LeuPhe: 4.355 ± 1.141
4.646LeuGly: 4.646 ± 1.323
1.452LeuHis: 1.452 ± 0.586
8.711LeuIle: 8.711 ± 1.555
9.292LeuLys: 9.292 ± 1.443
8.13LeuLeu: 8.13 ± 1.421
1.452LeuMet: 1.452 ± 0.703
4.355LeuAsn: 4.355 ± 0.937
4.646LeuPro: 4.646 ± 1.125
2.613LeuGln: 2.613 ± 0.598
2.033LeuArg: 2.033 ± 0.683
8.13LeuSer: 8.13 ± 1.992
4.065LeuThr: 4.065 ± 0.696
3.484LeuVal: 3.484 ± 1.008
0.871LeuTrp: 0.871 ± 0.412
5.807LeuTyr: 5.807 ± 0.978
0.0LeuXaa: 0.0 ± 0.0
Met
0.871MetAla: 0.871 ± 0.358
0.29MetCys: 0.29 ± 0.306
1.161MetAsp: 1.161 ± 0.488
0.29MetGlu: 0.29 ± 0.296
0.581MetPhe: 0.581 ± 0.356
0.871MetGly: 0.871 ± 0.577
0.29MetHis: 0.29 ± 0.35
0.581MetIle: 0.581 ± 0.437
3.775MetLys: 3.775 ± 0.915
2.904MetLeu: 2.904 ± 0.639
0.0MetMet: 0.0 ± 0.0
1.452MetAsn: 1.452 ± 0.58
0.29MetPro: 0.29 ± 0.282
0.29MetGln: 0.29 ± 0.333
1.161MetArg: 1.161 ± 0.572
1.452MetSer: 1.452 ± 0.634
4.065MetThr: 4.065 ± 1.043
0.581MetVal: 0.581 ± 0.398
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.065AsnAla: 4.065 ± 0.867
0.0AsnCys: 0.0 ± 0.0
1.452AsnAsp: 1.452 ± 0.647
3.194AsnGlu: 3.194 ± 0.937
1.161AsnPhe: 1.161 ± 0.66
4.355AsnGly: 4.355 ± 0.945
1.452AsnHis: 1.452 ± 0.561
3.194AsnIle: 3.194 ± 1.313
4.646AsnLys: 4.646 ± 1.121
4.646AsnLeu: 4.646 ± 1.032
2.033AsnMet: 2.033 ± 0.825
2.033AsnAsn: 2.033 ± 0.783
2.904AsnPro: 2.904 ± 0.755
3.194AsnGln: 3.194 ± 1.016
4.065AsnArg: 4.065 ± 0.782
2.323AsnSer: 2.323 ± 0.54
2.904AsnThr: 2.904 ± 0.947
2.613AsnVal: 2.613 ± 0.814
0.581AsnTrp: 0.581 ± 0.445
2.323AsnTyr: 2.323 ± 0.64
0.0AsnXaa: 0.0 ± 0.0
Pro
0.581ProAla: 0.581 ± 0.445
0.29ProCys: 0.29 ± 0.283
1.452ProAsp: 1.452 ± 0.561
4.065ProGlu: 4.065 ± 0.86
1.452ProPhe: 1.452 ± 0.567
0.29ProGly: 0.29 ± 0.272
0.0ProHis: 0.0 ± 0.0
1.742ProIle: 1.742 ± 0.664
4.936ProLys: 4.936 ± 1.223
2.613ProLeu: 2.613 ± 0.928
0.871ProMet: 0.871 ± 0.541
2.323ProAsn: 2.323 ± 0.778
0.871ProPro: 0.871 ± 0.507
0.581ProGln: 0.581 ± 0.378
2.613ProArg: 2.613 ± 0.864
1.161ProSer: 1.161 ± 0.591
2.904ProThr: 2.904 ± 0.693
2.323ProVal: 2.323 ± 0.799
0.0ProTrp: 0.0 ± 0.0
1.742ProTyr: 1.742 ± 1.001
0.0ProXaa: 0.0 ± 0.0
Gln
2.033GlnAla: 2.033 ± 0.75
0.0GlnCys: 0.0 ± 0.0
2.033GlnAsp: 2.033 ± 0.706
4.065GlnGlu: 4.065 ± 1.014
1.161GlnPhe: 1.161 ± 0.566
3.775GlnGly: 3.775 ± 0.953
0.871GlnHis: 0.871 ± 0.362
2.323GlnIle: 2.323 ± 0.75
4.355GlnLys: 4.355 ± 1.3
6.678GlnLeu: 6.678 ± 1.318
1.161GlnMet: 1.161 ± 0.869
2.323GlnAsn: 2.323 ± 0.767
1.452GlnPro: 1.452 ± 0.621
3.194GlnGln: 3.194 ± 1.193
2.904GlnArg: 2.904 ± 0.585
2.904GlnSer: 2.904 ± 0.778
1.161GlnThr: 1.161 ± 0.534
2.613GlnVal: 2.613 ± 0.947
0.29GlnTrp: 0.29 ± 0.306
0.581GlnTyr: 0.581 ± 0.378
0.0GlnXaa: 0.0 ± 0.0
Arg
1.452ArgAla: 1.452 ± 0.473
0.581ArgCys: 0.581 ± 0.374
3.484ArgAsp: 3.484 ± 0.926
2.033ArgGlu: 2.033 ± 0.816
2.033ArgPhe: 2.033 ± 0.791
2.613ArgGly: 2.613 ± 0.816
1.452ArgHis: 1.452 ± 0.694
2.323ArgIle: 2.323 ± 0.76
5.517ArgLys: 5.517 ± 1.314
5.807ArgLeu: 5.807 ± 0.997
0.871ArgMet: 0.871 ± 0.454
2.613ArgAsn: 2.613 ± 1.047
2.033ArgPro: 2.033 ± 0.717
3.484ArgGln: 3.484 ± 0.81
1.742ArgArg: 1.742 ± 0.721
2.904ArgSer: 2.904 ± 0.853
2.033ArgThr: 2.033 ± 0.654
3.484ArgVal: 3.484 ± 0.953
0.581ArgTrp: 0.581 ± 0.378
3.194ArgTyr: 3.194 ± 0.901
0.0ArgXaa: 0.0 ± 0.0
Ser
3.194SerAla: 3.194 ± 1.009
0.581SerCys: 0.581 ± 0.386
4.065SerAsp: 4.065 ± 1.082
4.355SerGlu: 4.355 ± 1.3
2.613SerPhe: 2.613 ± 0.976
2.033SerGly: 2.033 ± 0.909
0.581SerHis: 0.581 ± 0.374
4.355SerIle: 4.355 ± 1.043
6.388SerLys: 6.388 ± 1.306
6.388SerLeu: 6.388 ± 1.219
1.452SerMet: 1.452 ± 0.563
2.033SerAsn: 2.033 ± 0.514
1.452SerPro: 1.452 ± 0.573
2.613SerGln: 2.613 ± 0.886
1.742SerArg: 1.742 ± 0.559
1.742SerSer: 1.742 ± 0.62
3.484SerThr: 3.484 ± 1.028
3.194SerVal: 3.194 ± 1.192
0.581SerTrp: 0.581 ± 0.383
2.323SerTyr: 2.323 ± 0.844
0.0SerXaa: 0.0 ± 0.0
Thr
3.775ThrAla: 3.775 ± 1.282
0.0ThrCys: 0.0 ± 0.0
2.323ThrAsp: 2.323 ± 0.76
3.775ThrGlu: 3.775 ± 1.134
3.484ThrPhe: 3.484 ± 1.218
5.226ThrGly: 5.226 ± 1.132
0.581ThrHis: 0.581 ± 0.332
5.517ThrIle: 5.517 ± 1.228
2.904ThrLys: 2.904 ± 0.874
6.678ThrLeu: 6.678 ± 1.343
0.581ThrMet: 0.581 ± 0.303
2.033ThrAsn: 2.033 ± 1.27
3.484ThrPro: 3.484 ± 0.888
2.904ThrGln: 2.904 ± 0.89
3.484ThrArg: 3.484 ± 1.108
4.065ThrSer: 4.065 ± 0.913
3.484ThrThr: 3.484 ± 1.391
2.904ThrVal: 2.904 ± 0.728
0.871ThrTrp: 0.871 ± 0.414
3.775ThrTyr: 3.775 ± 1.256
0.0ThrXaa: 0.0 ± 0.0
Val
2.033ValAla: 2.033 ± 0.588
0.581ValCys: 0.581 ± 0.392
2.613ValAsp: 2.613 ± 0.936
2.613ValGlu: 2.613 ± 0.944
2.613ValPhe: 2.613 ± 0.67
2.613ValGly: 2.613 ± 0.64
0.29ValHis: 0.29 ± 0.272
4.646ValIle: 4.646 ± 1.276
4.355ValLys: 4.355 ± 0.816
5.517ValLeu: 5.517 ± 1.326
1.742ValMet: 1.742 ± 0.617
3.775ValAsn: 3.775 ± 0.96
1.161ValPro: 1.161 ± 0.606
1.452ValGln: 1.452 ± 0.668
2.033ValArg: 2.033 ± 0.736
4.936ValSer: 4.936 ± 0.974
3.194ValThr: 3.194 ± 0.998
3.194ValVal: 3.194 ± 0.872
0.581ValTrp: 0.581 ± 0.475
1.452ValTyr: 1.452 ± 0.406
0.0ValXaa: 0.0 ± 0.0
Trp
0.29TrpAla: 0.29 ± 0.224
0.0TrpCys: 0.0 ± 0.0
0.581TrpAsp: 0.581 ± 0.423
0.581TrpGlu: 0.581 ± 0.401
0.0TrpPhe: 0.0 ± 0.0
0.581TrpGly: 0.581 ± 0.378
0.0TrpHis: 0.0 ± 0.0
0.29TrpIle: 0.29 ± 0.308
0.871TrpLys: 0.871 ± 0.474
2.033TrpLeu: 2.033 ± 0.678
0.0TrpMet: 0.0 ± 0.0
0.581TrpAsn: 0.581 ± 0.343
0.29TrpPro: 0.29 ± 0.237
0.581TrpGln: 0.581 ± 0.386
0.29TrpArg: 0.29 ± 0.272
0.29TrpSer: 0.29 ± 0.273
0.0TrpThr: 0.0 ± 0.0
1.452TrpVal: 1.452 ± 0.606
0.871TrpTrp: 0.871 ± 0.61
0.581TrpTyr: 0.581 ± 0.422
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.452TyrAla: 1.452 ± 0.601
0.29TyrCys: 0.29 ± 0.273
2.613TyrAsp: 2.613 ± 0.681
4.065TyrGlu: 4.065 ± 1.122
2.323TyrPhe: 2.323 ± 0.652
1.742TyrGly: 1.742 ± 0.753
1.161TyrHis: 1.161 ± 0.498
1.742TyrIle: 1.742 ± 0.657
4.936TyrLys: 4.936 ± 1.3
2.613TyrLeu: 2.613 ± 0.553
1.452TyrMet: 1.452 ± 0.701
4.646TyrAsn: 4.646 ± 1.038
1.452TyrPro: 1.452 ± 0.795
2.904TyrGln: 2.904 ± 0.86
4.646TyrArg: 4.646 ± 1.594
2.323TyrSer: 2.323 ± 0.7
2.613TyrThr: 2.613 ± 0.461
1.161TyrVal: 1.161 ± 0.488
0.581TyrTrp: 0.581 ± 0.544
3.775TyrTyr: 3.775 ± 0.932
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 23 proteins (3445 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski