Amino acid dipepetide frequency for Streptococcus satellite phage Javan402

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.985AlaCys: 0.985 ± 0.456
2.626AlaAsp: 2.626 ± 0.852
3.61AlaGlu: 3.61 ± 1.462
2.626AlaPhe: 2.626 ± 0.962
2.954AlaGly: 2.954 ± 1.09
0.328AlaHis: 0.328 ± 0.376
8.205AlaIle: 8.205 ± 1.352
3.61AlaLys: 3.61 ± 1.249
6.892AlaLeu: 6.892 ± 1.295
1.641AlaMet: 1.641 ± 0.576
2.297AlaAsn: 2.297 ± 0.694
1.641AlaPro: 1.641 ± 0.987
2.626AlaGln: 2.626 ± 1.0
2.297AlaArg: 2.297 ± 0.748
0.656AlaSer: 0.656 ± 0.374
3.938AlaThr: 3.938 ± 1.167
0.656AlaVal: 0.656 ± 0.555
0.0AlaTrp: 0.0 ± 0.0
1.641AlaTyr: 1.641 ± 0.77
0.0AlaXaa: 0.0 ± 0.0
Cys
1.313CysAla: 1.313 ± 0.749
0.0CysCys: 0.0 ± 0.0
1.313CysAsp: 1.313 ± 0.726
0.328CysGlu: 0.328 ± 0.277
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.328CysHis: 0.328 ± 0.277
0.656CysIle: 0.656 ± 0.329
0.328CysLys: 0.328 ± 0.302
0.656CysLeu: 0.656 ± 0.439
0.0CysMet: 0.0 ± 0.0
0.656CysAsn: 0.656 ± 0.441
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.328CysVal: 0.328 ± 0.287
0.0CysTrp: 0.0 ± 0.0
0.328CysTyr: 0.328 ± 0.258
0.0CysXaa: 0.0 ± 0.0
Asp
0.656AspAla: 0.656 ± 0.443
1.641AspCys: 1.641 ± 0.958
5.579AspAsp: 5.579 ± 1.759
4.266AspGlu: 4.266 ± 1.366
2.626AspPhe: 2.626 ± 0.887
2.626AspGly: 2.626 ± 0.797
0.656AspHis: 0.656 ± 0.652
8.205AspIle: 8.205 ± 1.366
6.236AspLys: 6.236 ± 1.757
6.236AspLeu: 6.236 ± 1.038
2.626AspMet: 2.626 ± 1.028
4.266AspAsn: 4.266 ± 1.422
0.328AspPro: 0.328 ± 0.336
2.297AspGln: 2.297 ± 0.866
1.641AspArg: 1.641 ± 0.753
1.641AspSer: 1.641 ± 0.567
4.923AspThr: 4.923 ± 1.893
1.313AspVal: 1.313 ± 0.664
0.328AspTrp: 0.328 ± 0.362
5.579AspTyr: 5.579 ± 1.288
0.0AspXaa: 0.0 ± 0.0
Glu
5.251GluAla: 5.251 ± 1.353
0.985GluCys: 0.985 ± 0.568
2.626GluAsp: 2.626 ± 1.066
3.282GluGlu: 3.282 ± 1.329
2.297GluPhe: 2.297 ± 0.984
0.328GluGly: 0.328 ± 0.295
0.656GluHis: 0.656 ± 0.409
3.282GluIle: 3.282 ± 0.893
7.22GluLys: 7.22 ± 1.682
13.128GluLeu: 13.128 ± 1.629
1.641GluMet: 1.641 ± 0.653
6.236GluAsn: 6.236 ± 1.331
3.282GluPro: 3.282 ± 1.03
3.282GluGln: 3.282 ± 1.392
3.938GluArg: 3.938 ± 0.843
1.313GluSer: 1.313 ± 0.629
3.938GluThr: 3.938 ± 0.861
3.282GluVal: 3.282 ± 1.167
0.328GluTrp: 0.328 ± 0.277
3.61GluTyr: 3.61 ± 0.954
0.0GluXaa: 0.0 ± 0.0
Phe
0.656PheAla: 0.656 ± 0.429
0.0PheCys: 0.0 ± 0.0
1.641PheAsp: 1.641 ± 0.52
3.938PheGlu: 3.938 ± 1.931
1.969PhePhe: 1.969 ± 0.856
1.313PheGly: 1.313 ± 0.509
0.656PheHis: 0.656 ± 0.555
2.626PheIle: 2.626 ± 0.82
3.938PheLys: 3.938 ± 1.198
3.282PheLeu: 3.282 ± 1.258
0.985PheMet: 0.985 ± 0.668
2.297PheAsn: 2.297 ± 0.871
0.328PhePro: 0.328 ± 0.295
1.313PheGln: 1.313 ± 0.737
2.626PheArg: 2.626 ± 0.896
2.626PheSer: 2.626 ± 0.815
3.282PheThr: 3.282 ± 1.155
1.641PheVal: 1.641 ± 0.466
0.328PheTrp: 0.328 ± 0.287
2.297PheTyr: 2.297 ± 0.893
0.0PheXaa: 0.0 ± 0.0
Gly
1.969GlyAla: 1.969 ± 0.829
0.0GlyCys: 0.0 ± 0.0
2.626GlyAsp: 2.626 ± 1.199
0.985GlyGlu: 0.985 ± 0.617
1.641GlyPhe: 1.641 ± 0.822
2.626GlyGly: 2.626 ± 0.79
0.328GlyHis: 0.328 ± 0.277
6.564GlyIle: 6.564 ± 2.129
4.595GlyLys: 4.595 ± 1.073
5.251GlyLeu: 5.251 ± 1.325
0.328GlyMet: 0.328 ± 0.314
3.61GlyAsn: 3.61 ± 0.97
0.328GlyPro: 0.328 ± 0.362
1.641GlyGln: 1.641 ± 0.696
2.954GlyArg: 2.954 ± 1.043
1.313GlySer: 1.313 ± 0.858
2.626GlyThr: 2.626 ± 0.61
1.313GlyVal: 1.313 ± 0.891
0.328GlyTrp: 0.328 ± 0.287
1.969GlyTyr: 1.969 ± 0.727
0.0GlyXaa: 0.0 ± 0.0
His
1.969HisAla: 1.969 ± 0.716
0.0HisCys: 0.0 ± 0.0
1.313HisAsp: 1.313 ± 0.647
0.985HisGlu: 0.985 ± 0.511
0.328HisPhe: 0.328 ± 0.336
0.656HisGly: 0.656 ± 0.484
0.656HisHis: 0.656 ± 0.409
0.328HisIle: 0.328 ± 0.309
0.656HisLys: 0.656 ± 0.44
1.969HisLeu: 1.969 ± 0.725
0.0HisMet: 0.0 ± 0.0
0.328HisAsn: 0.328 ± 0.362
0.328HisPro: 0.328 ± 0.295
0.328HisGln: 0.328 ± 0.302
0.328HisArg: 0.328 ± 0.287
0.656HisSer: 0.656 ± 0.375
1.313HisThr: 1.313 ± 0.411
0.985HisVal: 0.985 ± 0.572
0.0HisTrp: 0.0 ± 0.0
0.328HisTyr: 0.328 ± 0.287
0.0HisXaa: 0.0 ± 0.0
Ile
6.236IleAla: 6.236 ± 1.45
0.328IleCys: 0.328 ± 0.379
6.236IleAsp: 6.236 ± 1.515
4.266IleGlu: 4.266 ± 1.158
2.626IlePhe: 2.626 ± 1.521
5.251IleGly: 5.251 ± 1.726
0.985IleHis: 0.985 ± 0.547
7.548IleIle: 7.548 ± 2.226
10.174IleLys: 10.174 ± 2.065
6.236IleLeu: 6.236 ± 1.913
3.282IleMet: 3.282 ± 1.183
7.548IleAsn: 7.548 ± 1.269
3.282IlePro: 3.282 ± 1.124
2.297IleGln: 2.297 ± 0.86
1.969IleArg: 1.969 ± 0.679
5.907IleSer: 5.907 ± 1.519
5.907IleThr: 5.907 ± 1.095
1.641IleVal: 1.641 ± 0.95
0.328IleTrp: 0.328 ± 0.379
3.282IleTyr: 3.282 ± 0.729
0.0IleXaa: 0.0 ± 0.0
Lys
7.22LysAla: 7.22 ± 1.891
0.0LysCys: 0.0 ± 0.0
5.907LysAsp: 5.907 ± 1.527
10.502LysGlu: 10.502 ± 1.87
1.969LysPhe: 1.969 ± 0.695
3.282LysGly: 3.282 ± 1.175
1.969LysHis: 1.969 ± 0.676
4.923LysIle: 4.923 ± 1.37
9.518LysLys: 9.518 ± 3.315
9.189LysLeu: 9.189 ± 1.897
4.266LysMet: 4.266 ± 1.467
6.892LysAsn: 6.892 ± 1.491
3.282LysPro: 3.282 ± 1.141
4.595LysGln: 4.595 ± 1.138
2.954LysArg: 2.954 ± 1.117
6.892LysSer: 6.892 ± 1.744
5.907LysThr: 5.907 ± 1.205
2.954LysVal: 2.954 ± 1.045
0.656LysTrp: 0.656 ± 0.423
3.282LysTyr: 3.282 ± 0.987
0.0LysXaa: 0.0 ± 0.0
Leu
6.564LeuAla: 6.564 ± 1.429
0.0LeuCys: 0.0 ± 0.0
6.564LeuAsp: 6.564 ± 2.472
8.533LeuGlu: 8.533 ± 1.72
3.938LeuPhe: 3.938 ± 1.198
6.236LeuGly: 6.236 ± 1.333
0.985LeuHis: 0.985 ± 0.533
9.518LeuIle: 9.518 ± 2.085
8.205LeuLys: 8.205 ± 1.445
14.44LeuLeu: 14.44 ± 1.846
1.969LeuMet: 1.969 ± 0.756
7.22LeuAsn: 7.22 ± 1.573
4.923LeuPro: 4.923 ± 0.984
4.923LeuGln: 4.923 ± 1.08
3.61LeuArg: 3.61 ± 0.845
6.564LeuSer: 6.564 ± 1.579
8.205LeuThr: 8.205 ± 1.442
6.236LeuVal: 6.236 ± 0.932
0.328LeuTrp: 0.328 ± 0.277
3.938LeuTyr: 3.938 ± 0.897
0.0LeuXaa: 0.0 ± 0.0
Met
2.297MetAla: 2.297 ± 1.058
0.328MetCys: 0.328 ± 0.376
1.313MetAsp: 1.313 ± 0.658
0.656MetGlu: 0.656 ± 0.4
0.656MetPhe: 0.656 ± 0.372
0.328MetGly: 0.328 ± 0.312
0.328MetHis: 0.328 ± 0.379
2.626MetIle: 2.626 ± 1.005
3.61MetLys: 3.61 ± 0.871
1.969MetLeu: 1.969 ± 0.903
0.0MetMet: 0.0 ± 0.0
2.626MetAsn: 2.626 ± 0.813
0.0MetPro: 0.0 ± 0.0
1.313MetGln: 1.313 ± 0.587
1.641MetArg: 1.641 ± 0.787
0.328MetSer: 0.328 ± 0.375
2.954MetThr: 2.954 ± 0.99
1.969MetVal: 1.969 ± 0.798
0.0MetTrp: 0.0 ± 0.0
0.985MetTyr: 0.985 ± 0.489
0.0MetXaa: 0.0 ± 0.0
Asn
3.282AsnAla: 3.282 ± 1.051
1.313AsnCys: 1.313 ± 0.643
4.266AsnAsp: 4.266 ± 1.047
7.22AsnGlu: 7.22 ± 1.533
3.61AsnPhe: 3.61 ± 0.995
3.61AsnGly: 3.61 ± 0.932
0.0AsnHis: 0.0 ± 0.0
2.954AsnIle: 2.954 ± 0.753
5.579AsnLys: 5.579 ± 1.003
7.548AsnLeu: 7.548 ± 1.277
2.297AsnMet: 2.297 ± 0.971
3.938AsnAsn: 3.938 ± 0.858
1.969AsnPro: 1.969 ± 0.793
1.641AsnGln: 1.641 ± 0.67
0.985AsnArg: 0.985 ± 0.62
3.938AsnSer: 3.938 ± 0.928
3.61AsnThr: 3.61 ± 1.023
2.954AsnVal: 2.954 ± 1.071
0.328AsnTrp: 0.328 ± 0.376
3.938AsnTyr: 3.938 ± 1.173
0.0AsnXaa: 0.0 ± 0.0
Pro
0.328ProAla: 0.328 ± 0.277
0.0ProCys: 0.0 ± 0.0
2.954ProAsp: 2.954 ± 1.123
2.297ProGlu: 2.297 ± 0.84
1.313ProPhe: 1.313 ± 0.433
0.0ProGly: 0.0 ± 0.0
0.328ProHis: 0.328 ± 0.287
1.313ProIle: 1.313 ± 0.589
2.954ProLys: 2.954 ± 0.894
3.61ProLeu: 3.61 ± 1.11
0.985ProMet: 0.985 ± 0.642
1.969ProAsn: 1.969 ± 0.733
1.641ProPro: 1.641 ± 0.988
1.313ProGln: 1.313 ± 0.755
1.313ProArg: 1.313 ± 0.757
0.985ProSer: 0.985 ± 0.406
3.282ProThr: 3.282 ± 0.947
1.641ProVal: 1.641 ± 0.684
0.0ProTrp: 0.0 ± 0.0
2.954ProTyr: 2.954 ± 0.803
0.0ProXaa: 0.0 ± 0.0
Gln
2.954GlnAla: 2.954 ± 1.108
0.0GlnCys: 0.0 ± 0.0
0.985GlnAsp: 0.985 ± 0.463
2.626GlnGlu: 2.626 ± 0.695
1.641GlnPhe: 1.641 ± 0.672
2.297GlnGly: 2.297 ± 0.985
0.328GlnHis: 0.328 ± 0.326
2.954GlnIle: 2.954 ± 0.903
2.297GlnLys: 2.297 ± 0.937
4.266GlnLeu: 4.266 ± 0.93
1.313GlnMet: 1.313 ± 0.728
0.985GlnAsn: 0.985 ± 0.427
1.313GlnPro: 1.313 ± 0.724
1.969GlnGln: 1.969 ± 0.715
2.626GlnArg: 2.626 ± 0.66
3.61GlnSer: 3.61 ± 0.925
2.626GlnThr: 2.626 ± 0.927
2.297GlnVal: 2.297 ± 1.018
0.328GlnTrp: 0.328 ± 0.258
2.626GlnTyr: 2.626 ± 0.814
0.0GlnXaa: 0.0 ± 0.0
Arg
0.656ArgAla: 0.656 ± 0.409
0.328ArgCys: 0.328 ± 0.277
3.282ArgAsp: 3.282 ± 0.982
2.297ArgGlu: 2.297 ± 0.909
0.985ArgPhe: 0.985 ± 0.46
2.297ArgGly: 2.297 ± 0.845
0.656ArgHis: 0.656 ± 0.436
1.969ArgIle: 1.969 ± 0.515
5.907ArgLys: 5.907 ± 1.673
7.877ArgLeu: 7.877 ± 1.381
1.313ArgMet: 1.313 ± 0.601
0.656ArgAsn: 0.656 ± 0.499
0.656ArgPro: 0.656 ± 0.42
1.969ArgGln: 1.969 ± 0.677
0.985ArgArg: 0.985 ± 0.418
1.969ArgSer: 1.969 ± 0.723
2.626ArgThr: 2.626 ± 0.935
2.954ArgVal: 2.954 ± 0.875
0.328ArgTrp: 0.328 ± 0.312
1.313ArgTyr: 1.313 ± 0.677
0.0ArgXaa: 0.0 ± 0.0
Ser
1.969SerAla: 1.969 ± 0.817
0.0SerCys: 0.0 ± 0.0
4.923SerAsp: 4.923 ± 1.413
2.297SerGlu: 2.297 ± 0.981
2.954SerPhe: 2.954 ± 0.913
1.969SerGly: 1.969 ± 0.727
1.641SerHis: 1.641 ± 0.583
5.907SerIle: 5.907 ± 1.892
5.579SerLys: 5.579 ± 1.196
3.938SerLeu: 3.938 ± 1.004
0.328SerMet: 0.328 ± 0.295
2.954SerAsn: 2.954 ± 0.791
2.297SerPro: 2.297 ± 0.917
1.641SerGln: 1.641 ± 0.847
3.282SerArg: 3.282 ± 0.959
3.61SerSer: 3.61 ± 1.453
0.656SerThr: 0.656 ± 0.394
2.954SerVal: 2.954 ± 1.151
0.328SerTrp: 0.328 ± 0.312
3.282SerTyr: 3.282 ± 0.74
0.0SerXaa: 0.0 ± 0.0
Thr
1.641ThrAla: 1.641 ± 0.562
0.0ThrCys: 0.0 ± 0.0
4.595ThrAsp: 4.595 ± 1.121
4.923ThrGlu: 4.923 ± 0.999
3.938ThrPhe: 3.938 ± 1.335
3.61ThrGly: 3.61 ± 0.923
0.656ThrHis: 0.656 ± 0.375
6.236ThrIle: 6.236 ± 1.242
5.907ThrLys: 5.907 ± 1.2
7.548ThrLeu: 7.548 ± 1.909
0.656ThrMet: 0.656 ± 0.499
3.282ThrAsn: 3.282 ± 1.111
2.626ThrPro: 2.626 ± 0.645
2.297ThrGln: 2.297 ± 0.799
3.282ThrArg: 3.282 ± 0.993
2.626ThrSer: 2.626 ± 0.773
4.923ThrThr: 4.923 ± 1.032
4.595ThrVal: 4.595 ± 1.016
0.985ThrTrp: 0.985 ± 0.498
2.626ThrTyr: 2.626 ± 1.046
0.0ThrXaa: 0.0 ± 0.0
Val
3.938ValAla: 3.938 ± 0.835
0.0ValCys: 0.0 ± 0.0
1.641ValAsp: 1.641 ± 0.738
2.297ValGlu: 2.297 ± 0.878
1.641ValPhe: 1.641 ± 0.641
1.641ValGly: 1.641 ± 0.466
0.656ValHis: 0.656 ± 0.329
3.61ValIle: 3.61 ± 0.866
4.266ValLys: 4.266 ± 1.144
3.282ValLeu: 3.282 ± 1.023
0.985ValMet: 0.985 ± 0.593
3.61ValAsn: 3.61 ± 1.259
0.656ValPro: 0.656 ± 0.429
1.313ValGln: 1.313 ± 0.541
0.985ValArg: 0.985 ± 0.471
3.282ValSer: 3.282 ± 1.313
2.626ValThr: 2.626 ± 0.936
1.969ValVal: 1.969 ± 0.948
1.313ValTrp: 1.313 ± 0.883
3.282ValTyr: 3.282 ± 1.088
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.656TrpGlu: 0.656 ± 0.471
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.328TrpHis: 0.328 ± 0.287
0.328TrpIle: 0.328 ± 0.258
0.656TrpLys: 0.656 ± 0.409
1.969TrpLeu: 1.969 ± 0.724
0.0TrpMet: 0.0 ± 0.0
0.328TrpAsn: 0.328 ± 0.287
0.0TrpPro: 0.0 ± 0.0
0.328TrpGln: 0.328 ± 0.312
0.328TrpArg: 0.328 ± 0.379
1.313TrpSer: 1.313 ± 0.45
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.328TrpTyr: 0.328 ± 0.312
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.328TyrAla: 0.328 ± 0.287
0.328TyrCys: 0.328 ± 0.295
3.282TyrAsp: 3.282 ± 1.058
3.938TyrGlu: 3.938 ± 1.416
1.313TyrPhe: 1.313 ± 0.564
1.969TyrGly: 1.969 ± 0.729
0.985TyrHis: 0.985 ± 0.498
5.579TyrIle: 5.579 ± 1.266
4.923TyrLys: 4.923 ± 1.579
3.61TyrLeu: 3.61 ± 0.9
0.985TyrMet: 0.985 ± 0.577
3.282TyrAsn: 3.282 ± 1.001
2.297TyrPro: 2.297 ± 0.772
2.954TyrGln: 2.954 ± 1.055
3.282TyrArg: 3.282 ± 1.299
3.282TyrSer: 3.282 ± 1.105
3.61TyrThr: 3.61 ± 1.045
1.313TyrVal: 1.313 ± 0.701
0.328TyrTrp: 0.328 ± 0.287
2.626TyrTyr: 2.626 ± 0.823
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 21 proteins (3048 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski