Amino acid dipepetide frequency for Streptococcus satellite phage Javan746

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.801AlaAla: 0.801 ± 0.5
0.0AlaCys: 0.0 ± 0.0
3.205AlaAsp: 3.205 ± 1.036
4.006AlaGlu: 4.006 ± 0.884
2.804AlaPhe: 2.804 ± 0.918
2.003AlaGly: 2.003 ± 0.577
0.401AlaHis: 0.401 ± 0.468
4.006AlaIle: 4.006 ± 1.165
5.609AlaLys: 5.609 ± 0.976
2.804AlaLeu: 2.804 ± 1.145
1.603AlaMet: 1.603 ± 0.886
2.003AlaAsn: 2.003 ± 0.837
0.801AlaPro: 0.801 ± 0.503
3.205AlaGln: 3.205 ± 1.056
3.606AlaArg: 3.606 ± 1.452
4.006AlaSer: 4.006 ± 1.318
3.205AlaThr: 3.205 ± 0.927
3.606AlaVal: 3.606 ± 1.127
0.801AlaTrp: 0.801 ± 0.544
2.404AlaTyr: 2.404 ± 0.729
0.0AlaXaa: 0.0 ± 0.0
Cys
0.401CysAla: 0.401 ± 0.349
0.0CysCys: 0.0 ± 0.0
0.401CysAsp: 0.401 ± 0.349
0.801CysGlu: 0.801 ± 0.386
0.0CysPhe: 0.0 ± 0.0
0.401CysGly: 0.401 ± 0.359
0.401CysHis: 0.401 ± 0.311
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.801CysLeu: 0.801 ± 0.437
0.0CysMet: 0.0 ± 0.0
0.401CysAsn: 0.401 ± 0.398
0.401CysPro: 0.401 ± 0.311
0.0CysGln: 0.0 ± 0.0
0.401CysArg: 0.401 ± 0.423
0.0CysSer: 0.0 ± 0.0
0.401CysThr: 0.401 ± 0.499
0.401CysVal: 0.401 ± 0.299
0.0CysTrp: 0.0 ± 0.0
0.401CysTyr: 0.401 ± 0.369
0.0CysXaa: 0.0 ± 0.0
Asp
0.801AspAla: 0.801 ± 0.402
0.401AspCys: 0.401 ± 0.311
3.205AspAsp: 3.205 ± 1.204
6.811AspGlu: 6.811 ± 2.006
2.003AspPhe: 2.003 ± 0.85
3.205AspGly: 3.205 ± 0.828
0.401AspHis: 0.401 ± 0.299
7.212AspIle: 7.212 ± 1.161
6.41AspLys: 6.41 ± 1.499
7.612AspLeu: 7.612 ± 1.502
1.603AspMet: 1.603 ± 0.614
3.205AspAsn: 3.205 ± 0.871
0.801AspPro: 0.801 ± 0.599
0.801AspGln: 0.801 ± 0.738
2.804AspArg: 2.804 ± 1.327
2.003AspSer: 2.003 ± 0.937
2.003AspThr: 2.003 ± 0.673
0.801AspVal: 0.801 ± 0.402
0.0AspTrp: 0.0 ± 0.0
3.205AspTyr: 3.205 ± 0.907
0.0AspXaa: 0.0 ± 0.0
Glu
3.205GluAla: 3.205 ± 1.018
0.801GluCys: 0.801 ± 0.437
2.804GluAsp: 2.804 ± 1.089
8.814GluGlu: 8.814 ± 2.304
2.404GluPhe: 2.404 ± 1.156
2.804GluGly: 2.804 ± 0.633
1.603GluHis: 1.603 ± 0.736
6.811GluIle: 6.811 ± 0.747
11.218GluLys: 11.218 ± 2.517
11.619GluLeu: 11.619 ± 2.326
2.804GluMet: 2.804 ± 1.234
4.407GluAsn: 4.407 ± 1.397
0.401GluPro: 0.401 ± 0.398
2.404GluGln: 2.404 ± 0.832
4.808GluArg: 4.808 ± 1.132
6.41GluSer: 6.41 ± 1.399
4.006GluThr: 4.006 ± 0.781
6.01GluVal: 6.01 ± 1.374
0.801GluTrp: 0.801 ± 0.529
5.208GluTyr: 5.208 ± 1.18
0.0GluXaa: 0.0 ± 0.0
Phe
1.603PheAla: 1.603 ± 0.821
0.0PheCys: 0.0 ± 0.0
4.006PheAsp: 4.006 ± 1.396
3.205PheGlu: 3.205 ± 1.035
1.603PhePhe: 1.603 ± 0.871
3.606PheGly: 3.606 ± 1.082
0.801PheHis: 0.801 ± 0.491
2.404PheIle: 2.404 ± 0.644
2.003PheLys: 2.003 ± 1.196
5.208PheLeu: 5.208 ± 1.404
0.801PheMet: 0.801 ± 0.611
1.603PheAsn: 1.603 ± 0.729
0.801PhePro: 0.801 ± 0.409
0.0PheGln: 0.0 ± 0.0
1.603PheArg: 1.603 ± 0.593
3.205PheSer: 3.205 ± 0.899
3.606PheThr: 3.606 ± 1.301
3.205PheVal: 3.205 ± 1.19
0.401PheTrp: 0.401 ± 0.299
2.804PheTyr: 2.804 ± 1.177
0.0PheXaa: 0.0 ± 0.0
Gly
4.006GlyAla: 4.006 ± 1.534
0.0GlyCys: 0.0 ± 0.0
2.804GlyAsp: 2.804 ± 1.222
3.606GlyGlu: 3.606 ± 0.931
2.804GlyPhe: 2.804 ± 0.685
1.603GlyGly: 1.603 ± 0.782
1.202GlyHis: 1.202 ± 0.425
2.804GlyIle: 2.804 ± 0.838
5.208GlyLys: 5.208 ± 1.551
7.212GlyLeu: 7.212 ± 1.137
1.202GlyMet: 1.202 ± 0.614
1.202GlyAsn: 1.202 ± 0.574
0.401GlyPro: 0.401 ± 0.369
1.603GlyGln: 1.603 ± 0.888
3.205GlyArg: 3.205 ± 0.935
2.003GlySer: 2.003 ± 0.701
1.603GlyThr: 1.603 ± 0.526
5.208GlyVal: 5.208 ± 1.428
2.404GlyTrp: 2.404 ± 0.841
3.205GlyTyr: 3.205 ± 0.912
0.0GlyXaa: 0.0 ± 0.0
His
1.202HisAla: 1.202 ± 0.933
0.0HisCys: 0.0 ± 0.0
1.202HisAsp: 1.202 ± 0.61
1.202HisGlu: 1.202 ± 0.599
0.801HisPhe: 0.801 ± 0.634
1.603HisGly: 1.603 ± 0.799
0.0HisHis: 0.0 ± 0.0
0.401HisIle: 0.401 ± 0.369
2.003HisLys: 2.003 ± 0.756
2.404HisLeu: 2.404 ± 0.984
0.401HisMet: 0.401 ± 0.299
1.202HisAsn: 1.202 ± 0.547
0.0HisPro: 0.0 ± 0.0
0.801HisGln: 0.801 ± 0.6
0.401HisArg: 0.401 ± 0.353
0.801HisSer: 0.801 ± 0.544
1.202HisThr: 1.202 ± 0.679
0.401HisVal: 0.401 ± 0.349
0.0HisTrp: 0.0 ± 0.0
1.603HisTyr: 1.603 ± 1.068
0.0HisXaa: 0.0 ± 0.0
Ile
3.606IleAla: 3.606 ± 1.194
0.401IleCys: 0.401 ± 0.398
2.404IleAsp: 2.404 ± 1.262
4.006IleGlu: 4.006 ± 1.062
1.202IlePhe: 1.202 ± 0.694
2.404IleGly: 2.404 ± 0.823
0.401IleHis: 0.401 ± 0.349
4.808IleIle: 4.808 ± 1.029
8.013IleLys: 8.013 ± 1.871
5.609IleLeu: 5.609 ± 1.408
1.202IleMet: 1.202 ± 0.629
2.003IleAsn: 2.003 ± 0.755
3.606IlePro: 3.606 ± 0.928
2.804IleGln: 2.804 ± 0.775
3.205IleArg: 3.205 ± 0.868
4.808IleSer: 4.808 ± 0.857
3.205IleThr: 3.205 ± 0.796
4.407IleVal: 4.407 ± 1.175
0.401IleTrp: 0.401 ± 0.468
4.006IleTyr: 4.006 ± 0.981
0.0IleXaa: 0.0 ± 0.0
Lys
5.609LysAla: 5.609 ± 1.256
0.0LysCys: 0.0 ± 0.0
8.413LysAsp: 8.413 ± 1.118
11.619LysGlu: 11.619 ± 1.735
5.609LysPhe: 5.609 ± 1.383
6.41LysGly: 6.41 ± 1.43
1.603LysHis: 1.603 ± 0.79
7.212LysIle: 7.212 ± 1.688
8.814LysLys: 8.814 ± 1.772
11.619LysLeu: 11.619 ± 1.981
0.801LysMet: 0.801 ± 0.452
5.609LysAsn: 5.609 ± 0.882
2.003LysPro: 2.003 ± 0.681
3.606LysGln: 3.606 ± 0.976
6.01LysArg: 6.01 ± 1.406
6.01LysSer: 6.01 ± 1.041
6.811LysThr: 6.811 ± 1.949
3.205LysVal: 3.205 ± 0.841
1.202LysTrp: 1.202 ± 0.534
3.205LysTyr: 3.205 ± 1.099
0.0LysXaa: 0.0 ± 0.0
Leu
6.811LeuAla: 6.811 ± 1.729
0.401LeuCys: 0.401 ± 0.359
6.01LeuAsp: 6.01 ± 1.596
10.817LeuGlu: 10.817 ± 1.897
4.808LeuPhe: 4.808 ± 1.583
5.208LeuGly: 5.208 ± 1.562
1.603LeuHis: 1.603 ± 0.822
5.208LeuIle: 5.208 ± 1.441
12.42LeuLys: 12.42 ± 1.788
7.212LeuLeu: 7.212 ± 1.985
2.404LeuMet: 2.404 ± 1.032
7.612LeuAsn: 7.612 ± 1.59
4.407LeuPro: 4.407 ± 0.913
3.606LeuGln: 3.606 ± 1.374
4.808LeuArg: 4.808 ± 1.266
4.407LeuSer: 4.407 ± 0.829
8.013LeuThr: 8.013 ± 1.941
5.208LeuVal: 5.208 ± 1.547
0.0LeuTrp: 0.0 ± 0.0
3.205LeuTyr: 3.205 ± 1.108
0.0LeuXaa: 0.0 ± 0.0
Met
1.202MetAla: 1.202 ± 0.521
0.401MetCys: 0.401 ± 0.499
0.801MetAsp: 0.801 ± 0.386
1.202MetGlu: 1.202 ± 1.001
0.801MetPhe: 0.801 ± 0.577
2.003MetGly: 2.003 ± 0.734
0.0MetHis: 0.0 ± 0.0
1.603MetIle: 1.603 ± 0.639
2.804MetLys: 2.804 ± 0.635
2.003MetLeu: 2.003 ± 0.978
0.401MetMet: 0.401 ± 0.369
2.404MetAsn: 2.404 ± 0.797
0.0MetPro: 0.0 ± 0.0
0.801MetGln: 0.801 ± 0.409
1.202MetArg: 1.202 ± 0.689
1.202MetSer: 1.202 ± 0.662
0.801MetThr: 0.801 ± 0.445
3.606MetVal: 3.606 ± 1.24
0.0MetTrp: 0.0 ± 0.0
0.801MetTyr: 0.801 ± 0.409
0.0MetXaa: 0.0 ± 0.0
Asn
3.205AsnAla: 3.205 ± 1.091
0.801AsnCys: 0.801 ± 0.437
3.606AsnAsp: 3.606 ± 1.104
1.603AsnGlu: 1.603 ± 0.622
1.202AsnPhe: 1.202 ± 0.505
3.205AsnGly: 3.205 ± 1.162
2.003AsnHis: 2.003 ± 0.58
1.603AsnIle: 1.603 ± 0.707
6.811AsnLys: 6.811 ± 1.155
6.41AsnLeu: 6.41 ± 1.865
0.801AsnMet: 0.801 ± 0.399
0.801AsnAsn: 0.801 ± 0.538
1.202AsnPro: 1.202 ± 0.501
5.208AsnGln: 5.208 ± 1.388
2.804AsnArg: 2.804 ± 1.027
3.205AsnSer: 3.205 ± 1.025
2.404AsnThr: 2.404 ± 1.018
1.603AsnVal: 1.603 ± 0.736
0.801AsnTrp: 0.801 ± 0.386
1.603AsnTyr: 1.603 ± 0.802
0.0AsnXaa: 0.0 ± 0.0
Pro
1.202ProAla: 1.202 ± 0.633
0.0ProCys: 0.0 ± 0.0
2.804ProAsp: 2.804 ± 1.159
3.205ProGlu: 3.205 ± 1.199
1.603ProPhe: 1.603 ± 0.63
0.401ProGly: 0.401 ± 0.369
0.0ProHis: 0.0 ± 0.0
0.801ProIle: 0.801 ± 0.519
1.202ProLys: 1.202 ± 0.616
1.603ProLeu: 1.603 ± 0.762
0.801ProMet: 0.801 ± 0.409
0.801ProAsn: 0.801 ± 0.491
2.003ProPro: 2.003 ± 0.904
0.801ProGln: 0.801 ± 0.536
1.202ProArg: 1.202 ± 0.674
0.801ProSer: 0.801 ± 0.47
1.603ProThr: 1.603 ± 0.595
0.801ProVal: 0.801 ± 0.402
0.0ProTrp: 0.0 ± 0.0
1.202ProTyr: 1.202 ± 0.616
0.0ProXaa: 0.0 ± 0.0
Gln
3.205GlnAla: 3.205 ± 0.811
0.0GlnCys: 0.0 ± 0.0
1.202GlnAsp: 1.202 ± 0.547
4.808GlnGlu: 4.808 ± 1.878
1.202GlnPhe: 1.202 ± 0.685
1.603GlnGly: 1.603 ± 0.764
2.003GlnHis: 2.003 ± 1.747
3.205GlnIle: 3.205 ± 1.125
5.208GlnLys: 5.208 ± 1.683
4.006GlnLeu: 4.006 ± 1.181
0.801GlnMet: 0.801 ± 0.634
2.404GlnAsn: 2.404 ± 0.618
0.401GlnPro: 0.401 ± 0.468
2.804GlnGln: 2.804 ± 1.051
1.603GlnArg: 1.603 ± 0.751
2.404GlnSer: 2.404 ± 1.207
1.603GlnThr: 1.603 ± 0.488
3.606GlnVal: 3.606 ± 1.178
0.401GlnTrp: 0.401 ± 0.311
0.401GlnTyr: 0.401 ± 0.299
0.0GlnXaa: 0.0 ± 0.0
Arg
3.606ArgAla: 3.606 ± 1.431
0.0ArgCys: 0.0 ± 0.0
3.606ArgAsp: 3.606 ± 0.675
4.407ArgGlu: 4.407 ± 1.137
2.404ArgPhe: 2.404 ± 1.043
4.407ArgGly: 4.407 ± 1.345
0.801ArgHis: 0.801 ± 0.437
2.404ArgIle: 2.404 ± 0.588
2.804ArgLys: 2.804 ± 0.723
7.212ArgLeu: 7.212 ± 1.911
1.202ArgMet: 1.202 ± 0.597
2.003ArgAsn: 2.003 ± 0.952
0.401ArgPro: 0.401 ± 0.299
2.804ArgGln: 2.804 ± 0.886
2.404ArgArg: 2.404 ± 0.996
2.003ArgSer: 2.003 ± 0.809
2.003ArgThr: 2.003 ± 0.706
3.205ArgVal: 3.205 ± 0.852
0.401ArgTrp: 0.401 ± 0.418
3.606ArgTyr: 3.606 ± 1.337
0.0ArgXaa: 0.0 ± 0.0
Ser
2.404SerAla: 2.404 ± 1.093
1.202SerCys: 1.202 ± 0.479
2.804SerAsp: 2.804 ± 1.005
5.208SerGlu: 5.208 ± 0.902
2.404SerPhe: 2.404 ± 0.905
2.404SerGly: 2.404 ± 0.538
0.401SerHis: 0.401 ± 0.311
3.606SerIle: 3.606 ± 0.908
5.208SerLys: 5.208 ± 1.296
5.208SerLeu: 5.208 ± 0.895
3.606SerMet: 3.606 ± 1.358
2.404SerAsn: 2.404 ± 0.619
1.202SerPro: 1.202 ± 0.875
3.205SerGln: 3.205 ± 1.147
3.205SerArg: 3.205 ± 1.093
2.804SerSer: 2.804 ± 0.982
2.003SerThr: 2.003 ± 0.529
1.603SerVal: 1.603 ± 0.663
0.0SerTrp: 0.0 ± 0.0
2.804SerTyr: 2.804 ± 0.772
0.0SerXaa: 0.0 ± 0.0
Thr
1.603ThrAla: 1.603 ± 0.64
0.0ThrCys: 0.0 ± 0.0
2.404ThrAsp: 2.404 ± 1.011
3.606ThrGlu: 3.606 ± 1.149
2.404ThrPhe: 2.404 ± 0.533
4.006ThrGly: 4.006 ± 1.543
2.003ThrHis: 2.003 ± 0.705
4.006ThrIle: 4.006 ± 1.303
6.01ThrLys: 6.01 ± 1.196
4.407ThrLeu: 4.407 ± 1.274
1.603ThrMet: 1.603 ± 1.051
2.404ThrAsn: 2.404 ± 0.996
1.603ThrPro: 1.603 ± 0.607
2.804ThrGln: 2.804 ± 0.76
2.003ThrArg: 2.003 ± 0.903
2.404ThrSer: 2.404 ± 0.79
2.404ThrThr: 2.404 ± 0.924
5.609ThrVal: 5.609 ± 1.979
0.401ThrTrp: 0.401 ± 0.349
2.404ThrTyr: 2.404 ± 1.522
0.0ThrXaa: 0.0 ± 0.0
Val
4.808ValAla: 4.808 ± 1.62
0.401ValCys: 0.401 ± 0.349
2.003ValAsp: 2.003 ± 0.708
5.208ValGlu: 5.208 ± 0.871
2.804ValPhe: 2.804 ± 0.678
2.804ValGly: 2.804 ± 1.125
0.801ValHis: 0.801 ± 0.409
2.404ValIle: 2.404 ± 1.035
5.208ValLys: 5.208 ± 1.434
5.208ValLeu: 5.208 ± 0.842
1.603ValMet: 1.603 ± 0.741
5.208ValAsn: 5.208 ± 1.117
0.801ValPro: 0.801 ± 0.402
2.003ValGln: 2.003 ± 0.84
2.804ValArg: 2.804 ± 0.957
2.003ValSer: 2.003 ± 1.287
4.006ValThr: 4.006 ± 0.793
3.205ValVal: 3.205 ± 1.147
0.401ValTrp: 0.401 ± 0.398
3.205ValTyr: 3.205 ± 1.119
0.0ValXaa: 0.0 ± 0.0
Trp
0.401TrpAla: 0.401 ± 0.311
0.401TrpCys: 0.401 ± 0.299
0.0TrpAsp: 0.0 ± 0.0
2.003TrpGlu: 2.003 ± 0.988
0.401TrpPhe: 0.401 ± 0.349
0.401TrpGly: 0.401 ± 0.299
0.0TrpHis: 0.0 ± 0.0
0.401TrpIle: 0.401 ± 0.444
1.603TrpLys: 1.603 ± 0.823
0.801TrpLeu: 0.801 ± 0.437
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.401TrpGln: 0.401 ± 0.369
0.401TrpArg: 0.401 ± 0.299
0.401TrpSer: 0.401 ± 0.311
0.801TrpThr: 0.801 ± 0.409
0.401TrpVal: 0.401 ± 0.398
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.202TyrAla: 1.202 ± 0.633
0.401TyrCys: 0.401 ± 0.311
1.603TyrAsp: 1.603 ± 0.356
3.606TyrGlu: 3.606 ± 1.345
3.205TyrPhe: 3.205 ± 1.05
2.804TyrGly: 2.804 ± 0.949
1.202TyrHis: 1.202 ± 0.519
1.603TyrIle: 1.603 ± 0.73
6.811TyrLys: 6.811 ± 1.863
5.208TyrLeu: 5.208 ± 1.748
0.0TyrMet: 0.0 ± 0.0
3.205TyrAsn: 3.205 ± 1.135
1.603TyrPro: 1.603 ± 0.602
3.205TyrGln: 3.205 ± 1.224
3.205TyrArg: 3.205 ± 1.258
2.804TyrSer: 2.804 ± 0.824
2.404TyrThr: 2.404 ± 0.978
0.801TyrVal: 0.801 ± 0.519
0.401TyrTrp: 0.401 ± 0.369
2.404TyrTyr: 2.404 ± 0.983
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 15 proteins (2497 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski