Amino acid dipepetide frequency for Arthrobacter phage Idaho

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
23.08AlaAla: 23.08 ± 2.74
0.613AlaCys: 0.613 ± 0.331
6.127AlaAsp: 6.127 ± 1.178
8.987AlaGlu: 8.987 ± 2.234
3.064AlaPhe: 3.064 ± 0.743
14.297AlaGly: 14.297 ± 1.814
2.655AlaHis: 2.655 ± 0.66
4.289AlaIle: 4.289 ± 0.854
5.31AlaLys: 5.31 ± 1.302
12.255AlaLeu: 12.255 ± 1.09
3.268AlaMet: 3.268 ± 0.658
6.944AlaAsn: 6.944 ± 0.985
6.944AlaPro: 6.944 ± 1.123
6.536AlaGln: 6.536 ± 1.425
6.74AlaArg: 6.74 ± 1.559
6.74AlaSer: 6.74 ± 1.607
8.17AlaThr: 8.17 ± 1.901
8.783AlaVal: 8.783 ± 1.448
1.838AlaTrp: 1.838 ± 0.632
1.634AlaTyr: 1.634 ± 0.548
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.204CysCys: 0.204 ± 0.224
0.204CysAsp: 0.204 ± 0.215
0.408CysGlu: 0.408 ± 0.386
0.0CysPhe: 0.0 ± 0.0
0.613CysGly: 0.613 ± 0.332
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.613CysLeu: 0.613 ± 0.36
0.204CysMet: 0.204 ± 0.181
0.408CysAsn: 0.408 ± 0.254
0.408CysPro: 0.408 ± 0.309
0.408CysGln: 0.408 ± 0.38
0.408CysArg: 0.408 ± 0.354
0.204CysSer: 0.204 ± 0.181
0.204CysThr: 0.204 ± 0.224
0.408CysVal: 0.408 ± 0.311
0.204CysTrp: 0.204 ± 0.177
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
9.6AspAla: 9.6 ± 1.589
0.204AspCys: 0.204 ± 0.244
5.106AspAsp: 5.106 ± 1.084
2.655AspGlu: 2.655 ± 0.804
0.817AspPhe: 0.817 ± 0.438
7.353AspGly: 7.353 ± 1.235
0.613AspHis: 0.613 ± 0.283
3.472AspIle: 3.472 ± 0.734
3.676AspLys: 3.676 ± 1.411
7.353AspLeu: 7.353 ± 1.447
1.634AspMet: 1.634 ± 0.465
1.838AspAsn: 1.838 ± 0.555
3.064AspPro: 3.064 ± 0.789
2.859AspGln: 2.859 ± 0.757
2.859AspArg: 2.859 ± 0.804
3.472AspSer: 3.472 ± 0.76
4.902AspThr: 4.902 ± 1.157
6.127AspVal: 6.127 ± 0.818
1.43AspTrp: 1.43 ± 0.556
1.634AspTyr: 1.634 ± 0.614
0.0AspXaa: 0.0 ± 0.0
Glu
8.17GluAla: 8.17 ± 2.564
0.613GluCys: 0.613 ± 0.343
3.881GluAsp: 3.881 ± 0.995
3.268GluGlu: 3.268 ± 1.153
1.634GluPhe: 1.634 ± 0.704
2.655GluGly: 2.655 ± 0.643
0.408GluHis: 0.408 ± 0.255
2.451GluIle: 2.451 ± 0.559
1.021GluLys: 1.021 ± 0.425
7.557GluLeu: 7.557 ± 1.007
1.021GluMet: 1.021 ± 0.319
1.634GluAsn: 1.634 ± 0.413
2.042GluPro: 2.042 ± 0.481
1.43GluGln: 1.43 ± 0.642
3.268GluArg: 3.268 ± 1.06
2.859GluSer: 2.859 ± 0.613
1.838GluThr: 1.838 ± 0.462
3.676GluVal: 3.676 ± 0.628
1.634GluTrp: 1.634 ± 0.54
1.225GluTyr: 1.225 ± 0.609
0.0GluXaa: 0.0 ± 0.0
Phe
2.655PheAla: 2.655 ± 0.912
0.204PheCys: 0.204 ± 0.259
2.247PheAsp: 2.247 ± 0.553
1.43PheGlu: 1.43 ± 0.443
0.613PhePhe: 0.613 ± 0.386
1.634PheGly: 1.634 ± 0.503
0.613PheHis: 0.613 ± 0.296
1.021PheIle: 1.021 ± 0.503
1.43PheLys: 1.43 ± 0.449
1.43PheLeu: 1.43 ± 0.613
0.408PheMet: 0.408 ± 0.254
1.021PheAsn: 1.021 ± 0.614
1.021PhePro: 1.021 ± 0.488
1.021PheGln: 1.021 ± 0.549
1.021PheArg: 1.021 ± 0.304
0.613PheSer: 0.613 ± 0.312
1.021PheThr: 1.021 ± 0.39
1.43PheVal: 1.43 ± 0.458
0.613PheTrp: 0.613 ± 0.438
0.613PheTyr: 0.613 ± 0.322
0.0PheXaa: 0.0 ± 0.0
Gly
9.804GlyAla: 9.804 ± 1.33
0.817GlyCys: 0.817 ± 0.319
5.31GlyAsp: 5.31 ± 1.258
4.493GlyGlu: 4.493 ± 0.71
1.838GlyPhe: 1.838 ± 0.475
6.944GlyGly: 6.944 ± 1.111
1.634GlyHis: 1.634 ± 0.549
2.655GlyIle: 2.655 ± 0.977
2.859GlyLys: 2.859 ± 0.802
8.987GlyLeu: 8.987 ± 1.611
2.247GlyMet: 2.247 ± 0.479
2.451GlyAsn: 2.451 ± 0.717
3.676GlyPro: 3.676 ± 0.833
2.859GlyGln: 2.859 ± 0.807
6.127GlyArg: 6.127 ± 1.265
4.902GlySer: 4.902 ± 1.266
9.804GlyThr: 9.804 ± 1.422
5.923GlyVal: 5.923 ± 1.015
2.247GlyTrp: 2.247 ± 0.713
2.655GlyTyr: 2.655 ± 0.585
0.0GlyXaa: 0.0 ± 0.0
His
1.838HisAla: 1.838 ± 0.51
0.0HisCys: 0.0 ± 0.0
0.613HisAsp: 0.613 ± 0.285
0.817HisGlu: 0.817 ± 0.356
0.0HisPhe: 0.0 ± 0.0
1.43HisGly: 1.43 ± 0.604
0.204HisHis: 0.204 ± 0.193
0.817HisIle: 0.817 ± 0.585
0.0HisLys: 0.0 ± 0.0
2.042HisLeu: 2.042 ± 0.799
0.408HisMet: 0.408 ± 0.262
0.408HisAsn: 0.408 ± 0.282
0.817HisPro: 0.817 ± 0.51
1.021HisGln: 1.021 ± 0.432
0.817HisArg: 0.817 ± 0.694
0.204HisSer: 0.204 ± 0.186
0.613HisThr: 0.613 ± 0.402
1.021HisVal: 1.021 ± 0.6
0.204HisTrp: 0.204 ± 0.184
0.204HisTyr: 0.204 ± 0.181
0.0HisXaa: 0.0 ± 0.0
Ile
3.676IleAla: 3.676 ± 0.777
0.0IleCys: 0.0 ± 0.0
3.268IleAsp: 3.268 ± 0.83
1.43IleGlu: 1.43 ± 0.435
0.817IlePhe: 0.817 ± 0.319
4.493IleGly: 4.493 ± 0.906
0.817IleHis: 0.817 ± 0.338
1.225IleIle: 1.225 ± 0.452
1.838IleLys: 1.838 ± 0.348
1.838IleLeu: 1.838 ± 0.829
0.817IleMet: 0.817 ± 0.356
2.451IleAsn: 2.451 ± 0.859
2.042IlePro: 2.042 ± 0.633
3.676IleGln: 3.676 ± 0.881
2.655IleArg: 2.655 ± 0.46
2.655IleSer: 2.655 ± 0.63
2.655IleThr: 2.655 ± 1.018
1.43IleVal: 1.43 ± 0.568
0.0IleTrp: 0.0 ± 0.0
2.042IleTyr: 2.042 ± 0.72
0.0IleXaa: 0.0 ± 0.0
Lys
6.944LysAla: 6.944 ± 1.48
0.0LysCys: 0.0 ± 0.0
1.838LysAsp: 1.838 ± 0.604
2.247LysGlu: 2.247 ± 0.585
0.613LysPhe: 0.613 ± 0.304
4.289LysGly: 4.289 ± 1.357
0.817LysHis: 0.817 ± 0.444
1.225LysIle: 1.225 ± 0.65
1.838LysLys: 1.838 ± 0.758
3.676LysLeu: 3.676 ± 1.274
1.021LysMet: 1.021 ± 0.331
1.634LysAsn: 1.634 ± 0.444
1.838LysPro: 1.838 ± 0.772
1.021LysGln: 1.021 ± 0.482
4.289LysArg: 4.289 ± 1.231
3.881LysSer: 3.881 ± 1.127
2.451LysThr: 2.451 ± 0.535
1.838LysVal: 1.838 ± 0.482
1.021LysTrp: 1.021 ± 0.639
0.408LysTyr: 0.408 ± 0.262
0.0LysXaa: 0.0 ± 0.0
Leu
11.642LeuAla: 11.642 ± 1.242
0.204LeuCys: 0.204 ± 0.193
10.212LeuAsp: 10.212 ± 1.673
6.332LeuGlu: 6.332 ± 1.205
1.838LeuPhe: 1.838 ± 0.531
7.557LeuGly: 7.557 ± 1.278
1.43LeuHis: 1.43 ± 0.506
3.064LeuIle: 3.064 ± 0.789
2.859LeuLys: 2.859 ± 0.767
6.127LeuLeu: 6.127 ± 1.182
1.225LeuMet: 1.225 ± 0.608
1.225LeuAsn: 1.225 ± 0.525
5.515LeuPro: 5.515 ± 1.028
2.655LeuGln: 2.655 ± 1.061
6.127LeuArg: 6.127 ± 1.013
5.719LeuSer: 5.719 ± 1.091
4.085LeuThr: 4.085 ± 0.74
7.761LeuVal: 7.761 ± 1.532
2.042LeuTrp: 2.042 ± 0.559
1.021LeuTyr: 1.021 ± 0.473
0.0LeuXaa: 0.0 ± 0.0
Met
3.881MetAla: 3.881 ± 0.712
0.0MetCys: 0.0 ± 0.0
2.859MetAsp: 2.859 ± 0.654
1.634MetGlu: 1.634 ± 0.482
0.408MetPhe: 0.408 ± 0.314
2.042MetGly: 2.042 ± 0.6
0.204MetHis: 0.204 ± 0.205
1.021MetIle: 1.021 ± 0.439
1.021MetLys: 1.021 ± 0.555
2.247MetLeu: 2.247 ± 0.743
0.408MetMet: 0.408 ± 0.253
0.817MetAsn: 0.817 ± 0.338
1.838MetPro: 1.838 ± 0.865
0.0MetGln: 0.0 ± 0.0
1.021MetArg: 1.021 ± 0.624
1.225MetSer: 1.225 ± 0.478
2.247MetThr: 2.247 ± 0.586
1.838MetVal: 1.838 ± 0.6
0.408MetTrp: 0.408 ± 0.309
0.408MetTyr: 0.408 ± 0.262
0.0MetXaa: 0.0 ± 0.0
Asn
3.676AsnAla: 3.676 ± 0.65
0.0AsnCys: 0.0 ± 0.0
2.042AsnAsp: 2.042 ± 0.576
1.43AsnGlu: 1.43 ± 0.593
1.021AsnPhe: 1.021 ± 0.488
2.859AsnGly: 2.859 ± 0.752
0.613AsnHis: 0.613 ± 0.367
0.613AsnIle: 0.613 ± 0.327
1.634AsnLys: 1.634 ± 0.519
4.085AsnLeu: 4.085 ± 0.844
1.225AsnMet: 1.225 ± 0.425
0.817AsnAsn: 0.817 ± 0.39
2.247AsnPro: 2.247 ± 0.784
0.408AsnGln: 0.408 ± 0.288
2.655AsnArg: 2.655 ± 0.792
0.613AsnSer: 0.613 ± 0.348
2.247AsnThr: 2.247 ± 0.697
2.655AsnVal: 2.655 ± 0.777
0.204AsnTrp: 0.204 ± 0.193
1.021AsnTyr: 1.021 ± 0.412
0.0AsnXaa: 0.0 ± 0.0
Pro
5.106ProAla: 5.106 ± 1.113
0.204ProCys: 0.204 ± 0.215
4.698ProAsp: 4.698 ± 0.835
3.676ProGlu: 3.676 ± 0.791
0.817ProPhe: 0.817 ± 0.294
5.923ProGly: 5.923 ± 1.245
0.408ProHis: 0.408 ± 0.31
2.247ProIle: 2.247 ± 0.661
2.655ProLys: 2.655 ± 0.722
3.676ProLeu: 3.676 ± 0.92
2.042ProMet: 2.042 ± 0.714
1.225ProAsn: 1.225 ± 0.561
2.655ProPro: 2.655 ± 0.957
1.634ProGln: 1.634 ± 0.573
2.042ProArg: 2.042 ± 0.584
3.676ProSer: 3.676 ± 0.835
2.042ProThr: 2.042 ± 0.852
3.064ProVal: 3.064 ± 0.687
0.817ProTrp: 0.817 ± 0.509
1.021ProTyr: 1.021 ± 0.394
0.0ProXaa: 0.0 ± 0.0
Gln
7.966GlnAla: 7.966 ± 1.479
0.408GlnCys: 0.408 ± 0.286
3.472GlnAsp: 3.472 ± 0.625
1.838GlnGlu: 1.838 ± 0.615
1.225GlnPhe: 1.225 ± 0.458
4.289GlnGly: 4.289 ± 1.125
0.204GlnHis: 0.204 ± 0.186
1.838GlnIle: 1.838 ± 0.604
1.225GlnLys: 1.225 ± 0.609
2.655GlnLeu: 2.655 ± 0.78
0.817GlnMet: 0.817 ± 0.592
0.0GlnAsn: 0.0 ± 0.0
0.613GlnPro: 0.613 ± 0.415
0.408GlnGln: 0.408 ± 0.262
1.838GlnArg: 1.838 ± 0.495
1.43GlnSer: 1.43 ± 0.709
2.042GlnThr: 2.042 ± 0.504
4.085GlnVal: 4.085 ± 0.979
0.204GlnTrp: 0.204 ± 0.181
0.408GlnTyr: 0.408 ± 0.304
0.0GlnXaa: 0.0 ± 0.0
Arg
8.987ArgAla: 8.987 ± 1.251
0.613ArgCys: 0.613 ± 0.314
3.676ArgAsp: 3.676 ± 0.912
2.859ArgGlu: 2.859 ± 0.92
1.225ArgPhe: 1.225 ± 0.522
4.289ArgGly: 4.289 ± 0.879
0.817ArgHis: 0.817 ± 0.644
2.859ArgIle: 2.859 ± 0.731
2.655ArgLys: 2.655 ± 0.741
6.332ArgLeu: 6.332 ± 0.928
1.225ArgMet: 1.225 ± 0.509
2.042ArgAsn: 2.042 ± 0.518
3.676ArgPro: 3.676 ± 1.037
2.042ArgGln: 2.042 ± 0.768
5.31ArgArg: 5.31 ± 1.035
2.042ArgSer: 2.042 ± 0.665
3.472ArgThr: 3.472 ± 0.857
3.881ArgVal: 3.881 ± 1.102
1.225ArgTrp: 1.225 ± 0.474
1.838ArgTyr: 1.838 ± 0.688
0.0ArgXaa: 0.0 ± 0.0
Ser
7.149SerAla: 7.149 ± 1.354
0.0SerCys: 0.0 ± 0.0
1.634SerAsp: 1.634 ± 0.531
1.634SerGlu: 1.634 ± 0.575
1.838SerPhe: 1.838 ± 0.592
6.944SerGly: 6.944 ± 1.028
0.613SerHis: 0.613 ± 0.48
2.859SerIle: 2.859 ± 0.959
2.655SerLys: 2.655 ± 0.735
4.698SerLeu: 4.698 ± 0.841
1.225SerMet: 1.225 ± 0.482
2.042SerAsn: 2.042 ± 0.538
3.268SerPro: 3.268 ± 0.943
1.634SerGln: 1.634 ± 0.418
2.655SerArg: 2.655 ± 0.636
2.247SerSer: 2.247 ± 0.783
4.698SerThr: 4.698 ± 0.867
3.676SerVal: 3.676 ± 0.755
1.43SerTrp: 1.43 ± 0.348
1.838SerTyr: 1.838 ± 0.581
0.0SerXaa: 0.0 ± 0.0
Thr
10.008ThrAla: 10.008 ± 1.182
0.204ThrCys: 0.204 ± 0.177
4.289ThrAsp: 4.289 ± 0.812
3.064ThrGlu: 3.064 ± 0.759
1.43ThrPhe: 1.43 ± 0.484
4.289ThrGly: 4.289 ± 1.153
0.0ThrHis: 0.0 ± 0.0
3.881ThrIle: 3.881 ± 0.862
3.676ThrLys: 3.676 ± 0.826
3.881ThrLeu: 3.881 ± 0.865
2.655ThrMet: 2.655 ± 0.443
1.634ThrAsn: 1.634 ± 0.429
4.085ThrPro: 4.085 ± 1.227
1.838ThrGln: 1.838 ± 0.627
3.676ThrArg: 3.676 ± 0.845
4.289ThrSer: 4.289 ± 0.799
5.31ThrThr: 5.31 ± 1.445
5.31ThrVal: 5.31 ± 0.853
0.817ThrTrp: 0.817 ± 0.384
2.042ThrTyr: 2.042 ± 0.835
0.0ThrXaa: 0.0 ± 0.0
Val
8.987ValAla: 8.987 ± 1.555
0.408ValCys: 0.408 ± 0.38
5.719ValAsp: 5.719 ± 1.223
2.655ValGlu: 2.655 ± 0.709
1.838ValPhe: 1.838 ± 0.714
3.268ValGly: 3.268 ± 0.719
1.021ValHis: 1.021 ± 0.462
3.472ValIle: 3.472 ± 0.774
3.676ValLys: 3.676 ± 0.842
6.127ValLeu: 6.127 ± 1.21
2.655ValMet: 2.655 ± 0.513
2.451ValAsn: 2.451 ± 0.748
2.451ValPro: 2.451 ± 0.852
3.676ValGln: 3.676 ± 0.986
3.881ValArg: 3.881 ± 1.144
5.515ValSer: 5.515 ± 0.963
6.74ValThr: 6.74 ± 1.044
4.698ValVal: 4.698 ± 1.111
0.817ValTrp: 0.817 ± 0.51
1.021ValTyr: 1.021 ± 0.472
0.0ValXaa: 0.0 ± 0.0
Trp
1.43TrpAla: 1.43 ± 0.626
0.204TrpCys: 0.204 ± 0.177
1.225TrpAsp: 1.225 ± 0.445
0.408TrpGlu: 0.408 ± 0.252
0.408TrpPhe: 0.408 ± 0.233
1.021TrpGly: 1.021 ± 0.355
0.204TrpHis: 0.204 ± 0.177
0.613TrpIle: 0.613 ± 0.309
1.838TrpLys: 1.838 ± 0.604
1.021TrpLeu: 1.021 ± 0.475
0.204TrpMet: 0.204 ± 0.286
0.613TrpAsn: 0.613 ± 0.336
1.021TrpPro: 1.021 ± 0.426
1.021TrpGln: 1.021 ± 0.399
1.838TrpArg: 1.838 ± 0.748
1.021TrpSer: 1.021 ± 0.444
0.817TrpThr: 0.817 ± 0.351
1.43TrpVal: 1.43 ± 0.464
0.817TrpTrp: 0.817 ± 0.357
1.225TrpTyr: 1.225 ± 0.547
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.676TyrAla: 3.676 ± 0.819
0.0TyrCys: 0.0 ± 0.0
2.247TyrAsp: 2.247 ± 0.769
1.021TyrGlu: 1.021 ± 0.395
0.613TyrPhe: 0.613 ± 0.393
2.042TyrGly: 2.042 ± 0.725
0.204TyrHis: 0.204 ± 0.186
0.204TyrIle: 0.204 ± 0.25
1.021TyrLys: 1.021 ± 0.437
2.042TyrLeu: 2.042 ± 0.534
0.613TyrMet: 0.613 ± 0.335
0.408TyrAsn: 0.408 ± 0.28
0.408TyrPro: 0.408 ± 0.304
1.021TyrGln: 1.021 ± 0.364
1.838TyrArg: 1.838 ± 0.602
1.43TyrSer: 1.43 ± 0.539
1.021TyrThr: 1.021 ± 0.395
2.042TyrVal: 2.042 ± 0.626
0.408TyrTrp: 0.408 ± 0.28
0.613TyrTyr: 0.613 ± 0.427
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 22 proteins (4897 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski