Amino acid dipepetide frequency for Streptococcus satellite phage Javan390

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.281AlaAla: 0.281 ± 0.283
1.405AlaCys: 1.405 ± 0.614
2.81AlaAsp: 2.81 ± 1.059
3.653AlaGlu: 3.653 ± 1.465
2.529AlaPhe: 2.529 ± 0.654
1.124AlaGly: 1.124 ± 0.565
0.281AlaHis: 0.281 ± 0.295
5.058AlaIle: 5.058 ± 0.981
5.058AlaLys: 5.058 ± 1.86
3.934AlaLeu: 3.934 ± 1.018
1.686AlaMet: 1.686 ± 0.597
1.967AlaAsn: 1.967 ± 1.063
0.0AlaPro: 0.0 ± 0.0
0.843AlaGln: 0.843 ± 0.464
2.529AlaArg: 2.529 ± 1.171
3.091AlaSer: 3.091 ± 1.037
2.529AlaThr: 2.529 ± 0.809
3.091AlaVal: 3.091 ± 0.778
0.281AlaTrp: 0.281 ± 0.239
1.967AlaTyr: 1.967 ± 0.585
0.0AlaXaa: 0.0 ± 0.0
Cys
1.124CysAla: 1.124 ± 0.606
0.0CysCys: 0.0 ± 0.0
0.562CysAsp: 0.562 ± 0.45
0.281CysGlu: 0.281 ± 0.324
0.562CysPhe: 0.562 ± 0.38
0.562CysGly: 0.562 ± 0.385
0.281CysHis: 0.281 ± 0.272
0.843CysIle: 0.843 ± 0.545
0.562CysLys: 0.562 ± 0.424
0.843CysLeu: 0.843 ± 0.655
0.281CysMet: 0.281 ± 0.27
0.843CysAsn: 0.843 ± 0.358
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.281CysArg: 0.281 ± 0.268
0.281CysSer: 0.281 ± 0.268
0.0CysThr: 0.0 ± 0.0
0.281CysVal: 0.281 ± 0.324
0.0CysTrp: 0.0 ± 0.0
0.281CysTyr: 0.281 ± 0.239
0.0CysXaa: 0.0 ± 0.0
Asp
2.248AspAla: 2.248 ± 0.817
0.281AspCys: 0.281 ± 0.272
3.934AspAsp: 3.934 ± 1.085
5.62AspGlu: 5.62 ± 1.344
2.81AspPhe: 2.81 ± 1.079
2.81AspGly: 2.81 ± 0.8
1.686AspHis: 1.686 ± 0.426
6.743AspIle: 6.743 ± 1.616
3.934AspLys: 3.934 ± 0.946
5.058AspLeu: 5.058 ± 1.021
1.124AspMet: 1.124 ± 0.467
5.058AspAsn: 5.058 ± 0.984
0.843AspPro: 0.843 ± 0.54
1.124AspGln: 1.124 ± 0.748
1.686AspArg: 1.686 ± 0.598
2.81AspSer: 2.81 ± 1.058
3.091AspThr: 3.091 ± 0.761
2.81AspVal: 2.81 ± 0.968
0.562AspTrp: 0.562 ± 0.434
3.934AspTyr: 3.934 ± 1.089
0.0AspXaa: 0.0 ± 0.0
Glu
3.934GluAla: 3.934 ± 1.182
0.843GluCys: 0.843 ± 0.465
4.215GluAsp: 4.215 ± 1.125
8.429GluGlu: 8.429 ± 2.195
3.091GluPhe: 3.091 ± 1.155
0.843GluGly: 0.843 ± 0.463
1.124GluHis: 1.124 ± 0.501
4.777GluIle: 4.777 ± 1.229
8.71GluLys: 8.71 ± 1.437
11.801GluLeu: 11.801 ± 1.43
3.653GluMet: 3.653 ± 0.911
5.62GluAsn: 5.62 ± 1.289
1.967GluPro: 1.967 ± 0.64
3.372GluGln: 3.372 ± 1.026
3.653GluArg: 3.653 ± 1.141
3.372GluSer: 3.372 ± 1.239
2.81GluThr: 2.81 ± 0.882
2.81GluVal: 2.81 ± 0.838
0.843GluTrp: 0.843 ± 0.503
4.777GluTyr: 4.777 ± 1.219
0.0GluXaa: 0.0 ± 0.0
Phe
2.248PheAla: 2.248 ± 0.881
0.0PheCys: 0.0 ± 0.0
3.091PheAsp: 3.091 ± 0.745
3.934PheGlu: 3.934 ± 1.07
3.372PhePhe: 3.372 ± 1.17
2.248PheGly: 2.248 ± 0.567
0.562PheHis: 0.562 ± 0.338
3.934PheIle: 3.934 ± 1.108
3.653PheLys: 3.653 ± 0.773
5.058PheLeu: 5.058 ± 1.541
0.843PheMet: 0.843 ± 0.482
3.653PheAsn: 3.653 ± 0.954
0.562PhePro: 0.562 ± 0.388
0.562PheGln: 0.562 ± 0.341
1.405PheArg: 1.405 ± 0.7
3.653PheSer: 3.653 ± 1.051
1.967PheThr: 1.967 ± 0.58
4.777PheVal: 4.777 ± 1.255
0.843PheTrp: 0.843 ± 0.41
1.124PheTyr: 1.124 ± 0.675
0.0PheXaa: 0.0 ± 0.0
Gly
2.81GlyAla: 2.81 ± 0.61
0.843GlyCys: 0.843 ± 0.553
1.686GlyAsp: 1.686 ± 0.642
3.653GlyGlu: 3.653 ± 1.068
1.686GlyPhe: 1.686 ± 0.605
2.529GlyGly: 2.529 ± 0.838
0.843GlyHis: 0.843 ± 0.53
3.653GlyIle: 3.653 ± 0.972
4.215GlyLys: 4.215 ± 1.033
6.182GlyLeu: 6.182 ± 1.635
1.967GlyMet: 1.967 ± 0.706
2.248GlyAsn: 2.248 ± 0.67
0.0GlyPro: 0.0 ± 0.0
0.843GlyGln: 0.843 ± 0.406
1.686GlyArg: 1.686 ± 0.625
2.529GlySer: 2.529 ± 0.789
3.372GlyThr: 3.372 ± 1.05
3.372GlyVal: 3.372 ± 0.912
0.281GlyTrp: 0.281 ± 0.283
1.686GlyTyr: 1.686 ± 0.655
0.0GlyXaa: 0.0 ± 0.0
His
0.843HisAla: 0.843 ± 0.816
0.0HisCys: 0.0 ± 0.0
0.843HisAsp: 0.843 ± 0.6
0.843HisGlu: 0.843 ± 0.402
1.124HisPhe: 1.124 ± 0.464
0.562HisGly: 0.562 ± 0.449
0.562HisHis: 0.562 ± 0.45
1.124HisIle: 1.124 ± 0.599
0.281HisLys: 0.281 ± 0.258
2.248HisLeu: 2.248 ± 0.837
0.843HisMet: 0.843 ± 0.451
0.843HisAsn: 0.843 ± 0.476
0.562HisPro: 0.562 ± 0.329
0.281HisGln: 0.281 ± 0.268
0.281HisArg: 0.281 ± 0.263
0.562HisSer: 0.562 ± 0.374
0.281HisThr: 0.281 ± 0.272
0.562HisVal: 0.562 ± 0.332
0.0HisTrp: 0.0 ± 0.0
0.281HisTyr: 0.281 ± 0.263
0.0HisXaa: 0.0 ± 0.0
Ile
4.777IleAla: 4.777 ± 1.493
0.562IleCys: 0.562 ± 0.398
5.901IleAsp: 5.901 ± 1.159
6.743IleGlu: 6.743 ± 0.826
6.182IlePhe: 6.182 ± 1.901
4.496IleGly: 4.496 ± 1.094
0.562IleHis: 0.562 ± 0.345
8.71IleIle: 8.71 ± 2.073
4.496IleLys: 4.496 ± 1.293
6.462IleLeu: 6.462 ± 1.765
0.843IleMet: 0.843 ± 0.508
7.024IleAsn: 7.024 ± 1.262
1.405IlePro: 1.405 ± 0.558
4.777IleGln: 4.777 ± 1.036
4.496IleArg: 4.496 ± 1.073
8.71IleSer: 8.71 ± 1.606
3.934IleThr: 3.934 ± 1.198
5.058IleVal: 5.058 ± 1.498
0.562IleTrp: 0.562 ± 0.372
2.248IleTyr: 2.248 ± 0.556
0.0IleXaa: 0.0 ± 0.0
Lys
5.339LysAla: 5.339 ± 1.679
0.0LysCys: 0.0 ± 0.0
5.058LysAsp: 5.058 ± 1.119
8.71LysGlu: 8.71 ± 1.358
2.529LysPhe: 2.529 ± 0.79
3.372LysGly: 3.372 ± 1.01
1.124LysHis: 1.124 ± 0.666
9.272LysIle: 9.272 ± 1.322
11.52LysLys: 11.52 ± 2.089
7.305LysLeu: 7.305 ± 1.633
2.529LysMet: 2.529 ± 0.695
5.058LysAsn: 5.058 ± 1.122
1.686LysPro: 1.686 ± 0.511
5.339LysGln: 5.339 ± 1.155
3.372LysArg: 3.372 ± 0.912
6.462LysSer: 6.462 ± 1.09
7.305LysThr: 7.305 ± 1.543
7.305LysVal: 7.305 ± 1.428
0.281LysTrp: 0.281 ± 0.264
3.653LysTyr: 3.653 ± 0.83
0.0LysXaa: 0.0 ± 0.0
Leu
5.058LeuAla: 5.058 ± 0.923
0.562LeuCys: 0.562 ± 0.349
6.743LeuAsp: 6.743 ± 1.672
6.743LeuGlu: 6.743 ± 1.387
4.777LeuPhe: 4.777 ± 1.218
7.305LeuGly: 7.305 ± 1.968
1.405LeuHis: 1.405 ± 0.651
10.396LeuIle: 10.396 ± 1.944
8.148LeuLys: 8.148 ± 1.252
10.115LeuLeu: 10.115 ± 2.113
3.091LeuMet: 3.091 ± 1.042
6.182LeuAsn: 6.182 ± 1.02
2.529LeuPro: 2.529 ± 0.652
6.182LeuGln: 6.182 ± 1.269
3.091LeuArg: 3.091 ± 0.885
4.777LeuSer: 4.777 ± 0.979
3.934LeuThr: 3.934 ± 1.017
5.62LeuVal: 5.62 ± 1.287
0.281LeuTrp: 0.281 ± 0.272
3.372LeuTyr: 3.372 ± 0.943
0.0LeuXaa: 0.0 ± 0.0
Met
1.405MetAla: 1.405 ± 0.546
0.281MetCys: 0.281 ± 0.272
1.124MetAsp: 1.124 ± 0.586
1.967MetGlu: 1.967 ± 0.887
0.843MetPhe: 0.843 ± 0.531
0.843MetGly: 0.843 ± 0.451
0.0MetHis: 0.0 ± 0.0
3.091MetIle: 3.091 ± 1.591
4.496MetLys: 4.496 ± 1.232
2.248MetLeu: 2.248 ± 0.8
1.124MetMet: 1.124 ± 0.46
2.248MetAsn: 2.248 ± 0.569
0.562MetPro: 0.562 ± 0.372
2.529MetGln: 2.529 ± 0.716
0.843MetArg: 0.843 ± 0.421
0.562MetSer: 0.562 ± 0.613
0.843MetThr: 0.843 ± 0.486
1.967MetVal: 1.967 ± 0.65
0.0MetTrp: 0.0 ± 0.0
1.405MetTyr: 1.405 ± 0.531
0.0MetXaa: 0.0 ± 0.0
Asn
3.091AsnAla: 3.091 ± 1.45
0.0AsnCys: 0.0 ± 0.0
3.372AsnAsp: 3.372 ± 0.941
6.462AsnGlu: 6.462 ± 1.6
2.529AsnPhe: 2.529 ± 0.709
3.372AsnGly: 3.372 ± 0.908
0.562AsnHis: 0.562 ± 0.386
5.058AsnIle: 5.058 ± 1.285
9.553AsnLys: 9.553 ± 1.1
6.743AsnLeu: 6.743 ± 1.394
1.124AsnMet: 1.124 ± 0.594
2.529AsnAsn: 2.529 ± 0.8
1.967AsnPro: 1.967 ± 0.834
2.529AsnGln: 2.529 ± 0.904
3.091AsnArg: 3.091 ± 0.694
4.777AsnSer: 4.777 ± 1.492
2.248AsnThr: 2.248 ± 0.931
1.967AsnVal: 1.967 ± 0.656
1.405AsnTrp: 1.405 ± 0.621
1.405AsnTyr: 1.405 ± 0.488
0.0AsnXaa: 0.0 ± 0.0
Pro
1.405ProAla: 1.405 ± 0.677
0.0ProCys: 0.0 ± 0.0
1.686ProAsp: 1.686 ± 0.656
1.686ProGlu: 1.686 ± 0.574
0.562ProPhe: 0.562 ± 0.359
0.0ProGly: 0.0 ± 0.0
0.0ProHis: 0.0 ± 0.0
0.843ProIle: 0.843 ± 0.352
1.405ProLys: 1.405 ± 0.837
2.248ProLeu: 2.248 ± 0.997
0.0ProMet: 0.0 ± 0.0
2.81ProAsn: 2.81 ± 0.798
1.967ProPro: 1.967 ± 0.74
0.562ProGln: 0.562 ± 0.368
1.124ProArg: 1.124 ± 0.541
1.405ProSer: 1.405 ± 0.503
1.967ProThr: 1.967 ± 0.678
1.124ProVal: 1.124 ± 0.746
0.0ProTrp: 0.0 ± 0.0
1.405ProTyr: 1.405 ± 0.554
0.0ProXaa: 0.0 ± 0.0
Gln
1.967GlnAla: 1.967 ± 0.611
0.843GlnCys: 0.843 ± 0.435
1.967GlnAsp: 1.967 ± 0.674
1.967GlnGlu: 1.967 ± 0.627
1.405GlnPhe: 1.405 ± 0.567
2.81GlnGly: 2.81 ± 0.866
0.562GlnHis: 0.562 ± 0.343
3.653GlnIle: 3.653 ± 1.213
5.339GlnLys: 5.339 ± 1.091
4.215GlnLeu: 4.215 ± 1.093
1.124GlnMet: 1.124 ± 0.664
1.405GlnAsn: 1.405 ± 0.487
0.562GlnPro: 0.562 ± 0.341
1.124GlnGln: 1.124 ± 0.411
2.248GlnArg: 2.248 ± 0.825
2.81GlnSer: 2.81 ± 0.891
2.248GlnThr: 2.248 ± 1.12
1.686GlnVal: 1.686 ± 0.6
0.281GlnTrp: 0.281 ± 0.302
1.405GlnTyr: 1.405 ± 0.475
0.0GlnXaa: 0.0 ± 0.0
Arg
1.124ArgAla: 1.124 ± 0.735
0.281ArgCys: 0.281 ± 0.268
1.686ArgAsp: 1.686 ± 0.885
4.215ArgGlu: 4.215 ± 1.312
2.248ArgPhe: 2.248 ± 0.794
1.686ArgGly: 1.686 ± 0.66
0.562ArgHis: 0.562 ± 0.3
3.653ArgIle: 3.653 ± 1.001
2.81ArgLys: 2.81 ± 0.771
3.091ArgLeu: 3.091 ± 0.735
1.124ArgMet: 1.124 ± 0.598
2.529ArgAsn: 2.529 ± 0.82
1.405ArgPro: 1.405 ± 0.722
3.653ArgGln: 3.653 ± 0.945
0.562ArgArg: 0.562 ± 0.357
2.248ArgSer: 2.248 ± 0.621
2.81ArgThr: 2.81 ± 0.754
2.529ArgVal: 2.529 ± 0.628
0.843ArgTrp: 0.843 ± 0.429
1.405ArgTyr: 1.405 ± 0.437
0.0ArgXaa: 0.0 ± 0.0
Ser
3.091SerAla: 3.091 ± 0.897
0.562SerCys: 0.562 ± 0.436
4.215SerAsp: 4.215 ± 0.96
2.529SerGlu: 2.529 ± 0.849
2.529SerPhe: 2.529 ± 0.981
2.81SerGly: 2.81 ± 0.756
1.124SerHis: 1.124 ± 0.482
5.901SerIle: 5.901 ± 1.512
6.182SerLys: 6.182 ± 1.786
6.462SerLeu: 6.462 ± 1.46
3.934SerMet: 3.934 ± 0.884
3.653SerAsn: 3.653 ± 0.787
0.843SerPro: 0.843 ± 0.444
1.686SerGln: 1.686 ± 0.562
2.529SerArg: 2.529 ± 0.604
2.81SerSer: 2.81 ± 1.057
2.248SerThr: 2.248 ± 0.794
2.81SerVal: 2.81 ± 0.935
0.843SerTrp: 0.843 ± 0.599
4.777SerTyr: 4.777 ± 1.219
0.0SerXaa: 0.0 ± 0.0
Thr
1.686ThrAla: 1.686 ± 0.784
0.281ThrCys: 0.281 ± 0.239
2.529ThrAsp: 2.529 ± 1.19
5.058ThrGlu: 5.058 ± 1.16
1.967ThrPhe: 1.967 ± 0.775
2.81ThrGly: 2.81 ± 0.897
0.281ThrHis: 0.281 ± 0.272
3.653ThrIle: 3.653 ± 0.954
3.372ThrLys: 3.372 ± 1.222
6.182ThrLeu: 6.182 ± 1.196
0.843ThrMet: 0.843 ± 0.459
3.653ThrAsn: 3.653 ± 1.122
1.124ThrPro: 1.124 ± 0.446
1.405ThrGln: 1.405 ± 0.628
2.529ThrArg: 2.529 ± 0.841
1.405ThrSer: 1.405 ± 0.523
3.372ThrThr: 3.372 ± 1.022
3.934ThrVal: 3.934 ± 0.921
1.124ThrTrp: 1.124 ± 0.616
2.529ThrTyr: 2.529 ± 0.686
0.0ThrXaa: 0.0 ± 0.0
Val
1.405ValAla: 1.405 ± 0.476
0.562ValCys: 0.562 ± 0.428
4.215ValAsp: 4.215 ± 1.317
4.496ValGlu: 4.496 ± 0.971
1.686ValPhe: 1.686 ± 0.737
2.248ValGly: 2.248 ± 0.662
0.843ValHis: 0.843 ± 0.595
3.372ValIle: 3.372 ± 1.214
5.62ValLys: 5.62 ± 1.043
5.62ValLeu: 5.62 ± 1.046
0.843ValMet: 0.843 ± 0.476
3.372ValAsn: 3.372 ± 0.762
2.81ValPro: 2.81 ± 1.029
1.124ValGln: 1.124 ± 0.549
1.967ValArg: 1.967 ± 1.09
5.339ValSer: 5.339 ± 0.859
3.091ValThr: 3.091 ± 0.65
3.934ValVal: 3.934 ± 0.993
0.281ValTrp: 0.281 ± 0.239
5.058ValTyr: 5.058 ± 1.418
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.281TrpAsp: 0.281 ± 0.302
1.686TrpGlu: 1.686 ± 0.566
0.843TrpPhe: 0.843 ± 0.59
0.843TrpGly: 0.843 ± 0.43
0.0TrpHis: 0.0 ± 0.0
1.124TrpIle: 1.124 ± 0.571
0.281TrpLys: 0.281 ± 0.283
1.124TrpLeu: 1.124 ± 0.536
0.281TrpMet: 0.281 ± 0.302
0.281TrpAsn: 0.281 ± 0.345
0.0TrpPro: 0.0 ± 0.0
0.281TrpGln: 0.281 ± 0.263
0.562TrpArg: 0.562 ± 0.329
0.562TrpSer: 0.562 ± 0.3
0.281TrpThr: 0.281 ± 0.258
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.562TrpTyr: 0.562 ± 0.359
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.281TyrAla: 0.281 ± 0.263
0.562TyrCys: 0.562 ± 0.349
2.248TyrAsp: 2.248 ± 0.735
2.248TyrGlu: 2.248 ± 0.914
4.215TyrPhe: 4.215 ± 1.053
2.81TyrGly: 2.81 ± 0.711
0.562TyrHis: 0.562 ± 0.419
3.091TyrIle: 3.091 ± 0.956
7.586TyrLys: 7.586 ± 1.195
3.372TyrLeu: 3.372 ± 0.808
0.843TyrMet: 0.843 ± 0.464
2.81TyrAsn: 2.81 ± 0.904
1.405TyrPro: 1.405 ± 0.693
1.405TyrGln: 1.405 ± 0.61
2.529TyrArg: 2.529 ± 0.957
3.372TyrSer: 3.372 ± 0.933
1.405TyrThr: 1.405 ± 0.797
2.248TyrVal: 2.248 ± 0.691
0.281TyrTrp: 0.281 ± 0.283
1.405TyrTyr: 1.405 ± 0.52
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 23 proteins (3560 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski