Amino acid dipepetide frequency for Xanthomonas phage Xf109

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.993AlaAla: 14.993 ± 2.655
2.272AlaCys: 2.272 ± 0.836
6.361AlaAsp: 6.361 ± 1.305
3.635AlaGlu: 3.635 ± 1.368
4.998AlaPhe: 4.998 ± 2.273
6.361AlaGly: 6.361 ± 1.408
1.817AlaHis: 1.817 ± 0.747
6.361AlaIle: 6.361 ± 2.059
6.361AlaLys: 6.361 ± 2.039
15.448AlaLeu: 15.448 ± 4.548
4.543AlaMet: 4.543 ± 1.46
1.817AlaAsn: 1.817 ± 1.157
4.089AlaPro: 4.089 ± 1.173
6.361AlaGln: 6.361 ± 1.635
8.632AlaArg: 8.632 ± 2.334
7.269AlaSer: 7.269 ± 1.413
5.452AlaThr: 5.452 ± 1.226
6.815AlaVal: 6.815 ± 2.989
4.089AlaTrp: 4.089 ± 1.356
5.906AlaTyr: 5.906 ± 1.512
0.0AlaXaa: 0.0 ± 0.0
Cys
3.635CysAla: 3.635 ± 1.67
0.0CysCys: 0.0 ± 0.0
2.272CysAsp: 2.272 ± 1.688
0.909CysGlu: 0.909 ± 0.551
0.0CysPhe: 0.0 ± 0.0
1.817CysGly: 1.817 ± 0.771
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.909CysLys: 0.909 ± 0.675
1.363CysLeu: 1.363 ± 0.83
1.363CysMet: 1.363 ± 0.842
0.909CysAsn: 0.909 ± 0.675
2.272CysPro: 2.272 ± 1.688
0.454CysGln: 0.454 ± 0.338
1.363CysArg: 1.363 ± 0.572
2.272CysSer: 2.272 ± 1.075
1.817CysThr: 1.817 ± 0.718
1.363CysVal: 1.363 ± 0.843
0.454CysTrp: 0.454 ± 0.435
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
7.269AspAla: 7.269 ± 1.278
1.817AspCys: 1.817 ± 1.351
2.726AspAsp: 2.726 ± 1.312
1.817AspGlu: 1.817 ± 0.71
1.817AspPhe: 1.817 ± 0.71
8.178AspGly: 8.178 ± 2.432
0.454AspHis: 0.454 ± 0.435
1.363AspIle: 1.363 ± 0.589
1.363AspLys: 1.363 ± 0.595
3.18AspLeu: 3.18 ± 1.416
0.0AspMet: 0.0 ± 0.0
0.909AspAsn: 0.909 ± 0.613
3.18AspPro: 3.18 ± 0.859
2.726AspGln: 2.726 ± 0.855
2.272AspArg: 2.272 ± 0.645
1.817AspSer: 1.817 ± 0.771
3.635AspThr: 3.635 ± 1.301
4.089AspVal: 4.089 ± 1.333
0.454AspTrp: 0.454 ± 0.338
1.817AspTyr: 1.817 ± 1.042
0.0AspXaa: 0.0 ± 0.0
Glu
4.543GluAla: 4.543 ± 1.507
0.909GluCys: 0.909 ± 0.675
0.909GluAsp: 0.909 ± 0.56
1.817GluGlu: 1.817 ± 0.922
1.363GluPhe: 1.363 ± 1.003
3.18GluGly: 3.18 ± 1.264
0.909GluHis: 0.909 ± 0.468
1.363GluIle: 1.363 ± 0.827
1.817GluLys: 1.817 ± 0.771
4.543GluLeu: 4.543 ± 1.337
0.454GluMet: 0.454 ± 0.439
1.817GluAsn: 1.817 ± 0.702
2.272GluPro: 2.272 ± 1.368
2.726GluGln: 2.726 ± 0.864
2.726GluArg: 2.726 ± 1.588
2.726GluSer: 2.726 ± 0.911
0.454GluThr: 0.454 ± 0.338
1.363GluVal: 1.363 ± 0.572
0.454GluTrp: 0.454 ± 0.493
0.909GluTyr: 0.909 ± 0.468
0.0GluXaa: 0.0 ± 0.0
Phe
3.18PheAla: 3.18 ± 1.233
0.909PheCys: 0.909 ± 0.477
3.18PheAsp: 3.18 ± 1.086
0.454PheGlu: 0.454 ± 0.338
0.909PhePhe: 0.909 ± 0.572
8.178PheGly: 8.178 ± 1.687
0.454PheHis: 0.454 ± 0.435
1.363PheIle: 1.363 ± 0.811
2.272PheLys: 2.272 ± 1.355
1.817PheLeu: 1.817 ± 0.96
0.0PheMet: 0.0 ± 0.0
0.909PheAsn: 0.909 ± 0.503
3.18PhePro: 3.18 ± 1.087
0.909PheGln: 0.909 ± 0.764
4.998PheArg: 4.998 ± 1.725
1.363PheSer: 1.363 ± 0.756
0.909PheThr: 0.909 ± 0.572
1.817PheVal: 1.817 ± 1.308
0.454PheTrp: 0.454 ± 0.43
0.909PheTyr: 0.909 ± 0.597
0.0PheXaa: 0.0 ± 0.0
Gly
8.632GlyAla: 8.632 ± 1.647
2.272GlyCys: 2.272 ± 1.688
5.906GlyAsp: 5.906 ± 2.135
4.998GlyGlu: 4.998 ± 1.912
4.543GlyPhe: 4.543 ± 1.354
10.904GlyGly: 10.904 ± 2.686
1.817GlyHis: 1.817 ± 0.652
3.18GlyIle: 3.18 ± 1.101
3.635GlyLys: 3.635 ± 0.758
8.632GlyLeu: 8.632 ± 1.803
2.272GlyMet: 2.272 ± 1.061
3.18GlyAsn: 3.18 ± 1.07
2.726GlyPro: 2.726 ± 1.013
4.998GlyGln: 4.998 ± 2.012
5.452GlyArg: 5.452 ± 1.761
7.269GlySer: 7.269 ± 1.409
7.724GlyThr: 7.724 ± 2.077
4.089GlyVal: 4.089 ± 1.278
2.726GlyTrp: 2.726 ± 0.759
4.998GlyTyr: 4.998 ± 2.306
0.0GlyXaa: 0.0 ± 0.0
His
1.817HisAla: 1.817 ± 1.147
0.0HisCys: 0.0 ± 0.0
0.909HisAsp: 0.909 ± 0.503
0.454HisGlu: 0.454 ± 0.514
1.363HisPhe: 1.363 ± 0.63
1.817HisGly: 1.817 ± 0.953
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.454HisLys: 0.454 ± 0.439
1.363HisLeu: 1.363 ± 0.905
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.909HisPro: 0.909 ± 0.705
0.0HisGln: 0.0 ± 0.0
2.272HisArg: 2.272 ± 1.112
0.0HisSer: 0.0 ± 0.0
1.363HisThr: 1.363 ± 0.935
2.272HisVal: 2.272 ± 1.037
0.454HisTrp: 0.454 ± 0.435
0.909HisTyr: 0.909 ± 0.86
0.0HisXaa: 0.0 ± 0.0
Ile
5.906IleAla: 5.906 ± 1.228
1.363IleCys: 1.363 ± 0.636
4.089IleAsp: 4.089 ± 1.083
1.817IleGlu: 1.817 ± 0.891
1.817IlePhe: 1.817 ± 1.244
4.543IleGly: 4.543 ± 1.173
0.454IleHis: 0.454 ± 0.514
1.363IleIle: 1.363 ± 1.046
1.817IleLys: 1.817 ± 1.002
1.363IleLeu: 1.363 ± 0.801
1.817IleMet: 1.817 ± 1.144
0.454IleAsn: 0.454 ± 0.518
1.817IlePro: 1.817 ± 0.769
1.817IleGln: 1.817 ± 0.916
2.726IleArg: 2.726 ± 1.174
0.909IleSer: 0.909 ± 0.731
2.726IleThr: 2.726 ± 0.893
1.817IleVal: 1.817 ± 0.833
0.454IleTrp: 0.454 ± 0.526
0.454IleTyr: 0.454 ± 0.493
0.0IleXaa: 0.0 ± 0.0
Lys
4.543LysAla: 4.543 ± 1.573
0.454LysCys: 0.454 ± 0.338
2.726LysAsp: 2.726 ± 0.741
0.454LysGlu: 0.454 ± 0.439
1.817LysPhe: 1.817 ± 0.759
2.726LysGly: 2.726 ± 1.154
1.363LysHis: 1.363 ± 0.941
0.454LysIle: 0.454 ± 0.526
1.817LysLys: 1.817 ± 0.853
1.363LysLeu: 1.363 ± 1.232
0.909LysMet: 0.909 ± 0.619
1.817LysAsn: 1.817 ± 0.997
2.272LysPro: 2.272 ± 0.898
0.454LysGln: 0.454 ± 0.439
4.543LysArg: 4.543 ± 1.477
4.089LysSer: 4.089 ± 0.817
2.726LysThr: 2.726 ± 1.16
3.18LysVal: 3.18 ± 1.367
1.817LysTrp: 1.817 ± 0.957
0.909LysTyr: 0.909 ± 0.705
0.0LysXaa: 0.0 ± 0.0
Leu
14.085LeuAla: 14.085 ± 3.036
2.272LeuCys: 2.272 ± 1.508
4.998LeuAsp: 4.998 ± 1.699
2.272LeuGlu: 2.272 ± 0.873
3.18LeuPhe: 3.18 ± 1.954
5.452LeuGly: 5.452 ± 1.433
2.272LeuHis: 2.272 ± 0.874
4.089LeuIle: 4.089 ± 1.494
2.726LeuLys: 2.726 ± 1.037
5.906LeuLeu: 5.906 ± 2.334
2.726LeuMet: 2.726 ± 0.641
0.909LeuAsn: 0.909 ± 0.558
4.543LeuPro: 4.543 ± 1.829
2.272LeuGln: 2.272 ± 0.724
4.543LeuArg: 4.543 ± 1.221
2.272LeuSer: 2.272 ± 0.933
6.361LeuThr: 6.361 ± 2.646
9.995LeuVal: 9.995 ± 3.138
2.272LeuTrp: 2.272 ± 1.012
1.363LeuTyr: 1.363 ± 0.905
0.0LeuXaa: 0.0 ± 0.0
Met
4.543MetAla: 4.543 ± 1.147
0.454MetCys: 0.454 ± 0.338
0.0MetAsp: 0.0 ± 0.0
0.454MetGlu: 0.454 ± 0.439
0.0MetPhe: 0.0 ± 0.0
1.363MetGly: 1.363 ± 0.746
0.0MetHis: 0.0 ± 0.0
0.454MetIle: 0.454 ± 0.439
0.454MetLys: 0.454 ± 0.443
1.817MetLeu: 1.817 ± 0.757
1.817MetMet: 1.817 ± 1.018
0.0MetAsn: 0.0 ± 0.0
2.272MetPro: 2.272 ± 0.894
1.817MetGln: 1.817 ± 0.907
1.363MetArg: 1.363 ± 0.831
3.635MetSer: 3.635 ± 0.844
2.726MetThr: 2.726 ± 1.086
1.817MetVal: 1.817 ± 1.13
0.0MetTrp: 0.0 ± 0.0
0.454MetTyr: 0.454 ± 0.43
0.0MetXaa: 0.0 ± 0.0
Asn
3.18AsnAla: 3.18 ± 1.07
0.909AsnCys: 0.909 ± 0.477
1.363AsnAsp: 1.363 ± 0.688
1.363AsnGlu: 1.363 ± 0.935
0.454AsnPhe: 0.454 ± 0.518
4.543AsnGly: 4.543 ± 1.664
0.0AsnHis: 0.0 ± 0.0
0.909AsnIle: 0.909 ± 0.477
1.817AsnLys: 1.817 ± 0.81
0.454AsnLeu: 0.454 ± 0.435
0.0AsnMet: 0.0 ± 0.0
1.363AsnAsn: 1.363 ± 0.728
1.817AsnPro: 1.817 ± 1.033
0.454AsnGln: 0.454 ± 0.439
0.909AsnArg: 0.909 ± 0.86
0.454AsnSer: 0.454 ± 0.338
0.454AsnThr: 0.454 ± 0.338
1.817AsnVal: 1.817 ± 0.993
0.0AsnTrp: 0.0 ± 0.0
0.909AsnTyr: 0.909 ± 0.63
0.0AsnXaa: 0.0 ± 0.0
Pro
4.543ProAla: 4.543 ± 1.509
0.454ProCys: 0.454 ± 0.493
3.18ProAsp: 3.18 ± 1.649
3.18ProGlu: 3.18 ± 1.378
1.363ProPhe: 1.363 ± 0.66
4.543ProGly: 4.543 ± 1.565
0.909ProHis: 0.909 ± 0.86
1.817ProIle: 1.817 ± 1.021
2.726ProLys: 2.726 ± 0.881
2.272ProLeu: 2.272 ± 1.082
0.909ProMet: 0.909 ± 0.595
0.909ProAsn: 0.909 ± 0.879
1.363ProPro: 1.363 ± 1.013
0.909ProGln: 0.909 ± 0.629
3.635ProArg: 3.635 ± 1.985
3.18ProSer: 3.18 ± 1.157
5.452ProThr: 5.452 ± 1.298
3.635ProVal: 3.635 ± 1.612
3.18ProTrp: 3.18 ± 0.811
0.909ProTyr: 0.909 ± 0.589
0.0ProXaa: 0.0 ± 0.0
Gln
3.18GlnAla: 3.18 ± 1.018
0.909GlnCys: 0.909 ± 0.468
0.909GlnAsp: 0.909 ± 0.468
1.363GlnGlu: 1.363 ± 0.843
1.363GlnPhe: 1.363 ± 0.783
4.998GlnGly: 4.998 ± 1.738
0.909GlnHis: 0.909 ± 0.558
1.817GlnIle: 1.817 ± 0.72
0.909GlnLys: 0.909 ± 0.595
2.726GlnLeu: 2.726 ± 0.969
0.454GlnMet: 0.454 ± 0.511
0.909GlnAsn: 0.909 ± 0.589
4.089GlnPro: 4.089 ± 1.683
3.635GlnGln: 3.635 ± 3.036
3.18GlnArg: 3.18 ± 1.789
1.817GlnSer: 1.817 ± 0.75
1.817GlnThr: 1.817 ± 0.71
1.817GlnVal: 1.817 ± 0.872
2.272GlnTrp: 2.272 ± 1.201
0.454GlnTyr: 0.454 ± 0.557
0.0GlnXaa: 0.0 ± 0.0
Arg
6.815ArgAla: 6.815 ± 2.259
0.0ArgCys: 0.0 ± 0.0
5.452ArgAsp: 5.452 ± 1.728
4.998ArgGlu: 4.998 ± 2.249
1.817ArgPhe: 1.817 ± 0.877
6.361ArgGly: 6.361 ± 1.328
0.909ArgHis: 0.909 ± 0.558
4.998ArgIle: 4.998 ± 0.862
2.726ArgLys: 2.726 ± 1.101
7.724ArgLeu: 7.724 ± 1.342
0.909ArgMet: 0.909 ± 0.86
2.726ArgAsn: 2.726 ± 1.087
1.363ArgPro: 1.363 ± 0.596
1.817ArgGln: 1.817 ± 1.308
6.815ArgArg: 6.815 ± 2.365
4.089ArgSer: 4.089 ± 1.45
4.089ArgThr: 4.089 ± 1.194
2.726ArgVal: 2.726 ± 0.915
2.726ArgTrp: 2.726 ± 1.085
1.363ArgTyr: 1.363 ± 0.74
0.0ArgXaa: 0.0 ± 0.0
Ser
10.45SerAla: 10.45 ± 2.19
2.726SerCys: 2.726 ± 1.613
2.272SerAsp: 2.272 ± 1.212
1.817SerGlu: 1.817 ± 1.05
2.272SerPhe: 2.272 ± 1.32
6.361SerGly: 6.361 ± 2.186
0.454SerHis: 0.454 ± 0.557
2.272SerIle: 2.272 ± 0.925
2.726SerLys: 2.726 ± 1.154
4.998SerLeu: 4.998 ± 1.131
1.363SerMet: 1.363 ± 0.639
0.454SerAsn: 0.454 ± 0.338
3.635SerPro: 3.635 ± 1.011
2.272SerGln: 2.272 ± 1.688
2.272SerArg: 2.272 ± 0.848
4.543SerSer: 4.543 ± 1.119
1.817SerThr: 1.817 ± 1.346
2.272SerVal: 2.272 ± 1.282
0.909SerTrp: 0.909 ± 1.028
0.454SerTyr: 0.454 ± 0.523
0.0SerXaa: 0.0 ± 0.0
Thr
10.45ThrAla: 10.45 ± 2.058
3.635ThrCys: 3.635 ± 1.252
0.454ThrAsp: 0.454 ± 0.523
3.18ThrGlu: 3.18 ± 1.006
1.817ThrPhe: 1.817 ± 0.933
6.815ThrGly: 6.815 ± 1.819
1.817ThrHis: 1.817 ± 0.579
2.726ThrIle: 2.726 ± 0.826
0.909ThrLys: 0.909 ± 0.879
3.18ThrLeu: 3.18 ± 1.023
1.817ThrMet: 1.817 ± 0.876
0.454ThrAsn: 0.454 ± 0.523
2.272ThrPro: 2.272 ± 1.338
3.18ThrGln: 3.18 ± 1.383
4.543ThrArg: 4.543 ± 1.06
2.272ThrSer: 2.272 ± 1.242
3.18ThrThr: 3.18 ± 2.527
4.089ThrVal: 4.089 ± 1.688
0.454ThrTrp: 0.454 ± 0.338
1.363ThrTyr: 1.363 ± 0.589
0.0ThrXaa: 0.0 ± 0.0
Val
6.815ValAla: 6.815 ± 2.232
0.909ValCys: 0.909 ± 0.675
2.272ValAsp: 2.272 ± 0.817
1.363ValGlu: 1.363 ± 0.595
3.635ValPhe: 3.635 ± 1.138
6.815ValGly: 6.815 ± 2.437
1.363ValHis: 1.363 ± 0.565
2.726ValIle: 2.726 ± 0.976
0.909ValLys: 0.909 ± 0.607
10.904ValLeu: 10.904 ± 4.563
2.726ValMet: 2.726 ± 0.81
0.454ValAsn: 0.454 ± 0.338
2.726ValPro: 2.726 ± 0.824
2.272ValGln: 2.272 ± 1.14
4.089ValArg: 4.089 ± 1.376
3.635ValSer: 3.635 ± 1.674
2.726ValThr: 2.726 ± 1.29
5.452ValVal: 5.452 ± 1.806
3.18ValTrp: 3.18 ± 1.276
1.363ValTyr: 1.363 ± 0.811
0.0ValXaa: 0.0 ± 0.0
Trp
2.726TrpAla: 2.726 ± 1.684
0.454TrpCys: 0.454 ± 0.338
0.0TrpAsp: 0.0 ± 0.0
0.454TrpGlu: 0.454 ± 0.439
1.817TrpPhe: 1.817 ± 0.727
1.817TrpGly: 1.817 ± 0.848
0.0TrpHis: 0.0 ± 0.0
1.817TrpIle: 1.817 ± 1.061
2.726TrpLys: 2.726 ± 1.228
3.635TrpLeu: 3.635 ± 2.018
0.909TrpMet: 0.909 ± 0.551
1.817TrpAsn: 1.817 ± 0.624
1.363TrpPro: 1.363 ± 0.909
0.0TrpGln: 0.0 ± 0.0
1.817TrpArg: 1.817 ± 0.906
0.909TrpSer: 0.909 ± 0.579
1.363TrpThr: 1.363 ± 0.905
1.817TrpVal: 1.817 ± 1.182
1.363TrpTrp: 1.363 ± 0.909
1.817TrpTyr: 1.817 ± 0.465
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.726TyrAla: 2.726 ± 1.01
0.454TyrCys: 0.454 ± 0.514
0.909TyrAsp: 0.909 ± 0.63
0.454TyrGlu: 0.454 ± 0.493
1.817TyrPhe: 1.817 ± 1.251
3.18TyrGly: 3.18 ± 0.938
0.454TyrHis: 0.454 ± 0.43
0.454TyrIle: 0.454 ± 0.443
1.363TyrLys: 1.363 ± 0.687
1.817TyrLeu: 1.817 ± 0.771
0.0TyrMet: 0.0 ± 0.0
1.363TyrAsn: 1.363 ± 0.899
0.909TyrPro: 0.909 ± 0.468
0.454TyrGln: 0.454 ± 0.435
2.272TyrArg: 2.272 ± 0.708
1.817TyrSer: 1.817 ± 0.977
1.817TyrThr: 1.817 ± 0.859
4.089TyrVal: 4.089 ± 1.261
0.909TyrTrp: 0.909 ± 0.661
1.363TyrTyr: 1.363 ± 0.624
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12 proteins (2202 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski