Amino acid dipepetide frequency for Flavobacterium phage FLiP

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.381AlaAla: 0.381 ± 0.376
0.763AlaCys: 0.763 ± 0.5
2.67AlaAsp: 2.67 ± 0.686
3.814AlaGlu: 3.814 ± 1.238
4.195AlaPhe: 4.195 ± 1.373
3.051AlaGly: 3.051 ± 1.714
1.907AlaHis: 1.907 ± 0.711
2.288AlaIle: 2.288 ± 1.088
6.102AlaLys: 6.102 ± 1.795
4.958AlaLeu: 4.958 ± 1.234
1.144AlaMet: 1.144 ± 0.619
3.814AlaAsn: 3.814 ± 1.139
0.381AlaPro: 0.381 ± 0.406
1.526AlaGln: 1.526 ± 0.706
1.144AlaArg: 1.144 ± 0.537
3.432AlaSer: 3.432 ± 1.342
3.051AlaThr: 3.051 ± 1.12
6.102AlaVal: 6.102 ± 1.359
0.381AlaTrp: 0.381 ± 0.371
2.67AlaTyr: 2.67 ± 0.638
0.0AlaXaa: 0.0 ± 0.0
Cys
0.381CysAla: 0.381 ± 0.371
0.0CysCys: 0.0 ± 0.0
0.381CysAsp: 0.381 ± 0.376
0.0CysGlu: 0.0 ± 0.0
1.144CysPhe: 1.144 ± 0.558
0.381CysGly: 0.381 ± 0.376
0.0CysHis: 0.0 ± 0.0
1.144CysIle: 1.144 ± 0.853
0.381CysLys: 0.381 ± 0.376
0.763CysLeu: 0.763 ± 0.481
0.0CysMet: 0.0 ± 0.0
0.763CysAsn: 0.763 ± 0.431
0.381CysPro: 0.381 ± 0.344
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.381CysSer: 0.381 ± 0.378
1.144CysThr: 1.144 ± 0.571
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.763CysTyr: 0.763 ± 0.393
0.0CysXaa: 0.0 ± 0.0
Asp
1.526AspAla: 1.526 ± 0.757
0.763AspCys: 0.763 ± 0.602
2.288AspAsp: 2.288 ± 0.769
1.144AspGlu: 1.144 ± 0.687
4.195AspPhe: 4.195 ± 1.199
1.907AspGly: 1.907 ± 1.099
0.381AspHis: 0.381 ± 0.344
6.865AspIle: 6.865 ± 0.858
6.484AspLys: 6.484 ± 1.808
7.246AspLeu: 7.246 ± 1.563
1.144AspMet: 1.144 ± 0.619
2.67AspAsn: 2.67 ± 1.059
2.288AspPro: 2.288 ± 0.805
2.67AspGln: 2.67 ± 0.772
3.432AspArg: 3.432 ± 1.632
4.577AspSer: 4.577 ± 1.348
2.288AspThr: 2.288 ± 1.241
6.102AspVal: 6.102 ± 2.423
1.144AspTrp: 1.144 ± 0.611
3.814AspTyr: 3.814 ± 1.346
0.0AspXaa: 0.0 ± 0.0
Glu
2.67GluAla: 2.67 ± 1.05
0.381GluCys: 0.381 ± 0.376
2.288GluAsp: 2.288 ± 1.017
5.721GluGlu: 5.721 ± 1.794
3.051GluPhe: 3.051 ± 1.106
1.526GluGly: 1.526 ± 0.53
0.381GluHis: 0.381 ± 0.376
4.958GluIle: 4.958 ± 1.175
5.339GluLys: 5.339 ± 1.224
5.339GluLeu: 5.339 ± 1.087
2.67GluMet: 2.67 ± 0.875
4.195GluAsn: 4.195 ± 1.08
2.67GluPro: 2.67 ± 1.391
1.907GluGln: 1.907 ± 0.775
2.288GluArg: 2.288 ± 0.73
4.577GluSer: 4.577 ± 1.39
1.526GluThr: 1.526 ± 0.573
2.288GluVal: 2.288 ± 0.749
0.763GluTrp: 0.763 ± 0.572
2.67GluTyr: 2.67 ± 0.664
0.0GluXaa: 0.0 ± 0.0
Phe
3.432PheAla: 3.432 ± 1.212
0.763PheCys: 0.763 ± 0.512
5.721PheAsp: 5.721 ± 1.445
1.907PheGlu: 1.907 ± 0.758
3.432PhePhe: 3.432 ± 0.904
1.526PheGly: 1.526 ± 0.84
0.381PheHis: 0.381 ± 0.378
5.721PheIle: 5.721 ± 1.144
6.865PheLys: 6.865 ± 1.687
5.721PheLeu: 5.721 ± 1.442
1.526PheMet: 1.526 ± 0.577
5.721PheAsn: 5.721 ± 1.301
1.526PhePro: 1.526 ± 1.14
1.907PheGln: 1.907 ± 0.618
2.67PheArg: 2.67 ± 1.04
4.195PheSer: 4.195 ± 0.962
3.814PheThr: 3.814 ± 1.521
5.339PheVal: 5.339 ± 1.392
1.144PheTrp: 1.144 ± 0.488
2.288PheTyr: 2.288 ± 0.852
0.0PheXaa: 0.0 ± 0.0
Gly
3.051GlyAla: 3.051 ± 1.666
0.0GlyCys: 0.0 ± 0.0
2.67GlyAsp: 2.67 ± 1.071
2.288GlyGlu: 2.288 ± 1.714
3.432GlyPhe: 3.432 ± 1.534
4.577GlyGly: 4.577 ± 1.82
0.0GlyHis: 0.0 ± 0.0
3.814GlyIle: 3.814 ± 0.96
3.051GlyLys: 3.051 ± 0.749
3.814GlyLeu: 3.814 ± 0.826
1.144GlyMet: 1.144 ± 0.602
1.907GlyAsn: 1.907 ± 0.709
0.381GlyPro: 0.381 ± 0.406
1.526GlyGln: 1.526 ± 0.806
0.763GlyArg: 0.763 ± 0.532
3.432GlySer: 3.432 ± 0.798
1.907GlyThr: 1.907 ± 1.134
6.484GlyVal: 6.484 ± 1.379
1.144GlyTrp: 1.144 ± 0.578
2.67GlyTyr: 2.67 ± 1.104
0.0GlyXaa: 0.0 ± 0.0
His
1.907HisAla: 1.907 ± 0.719
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.763HisGlu: 0.763 ± 0.455
2.67HisPhe: 2.67 ± 0.856
0.381HisGly: 0.381 ± 0.355
0.381HisHis: 0.381 ± 0.301
1.144HisIle: 1.144 ± 0.79
0.381HisLys: 0.381 ± 0.301
3.432HisLeu: 3.432 ± 0.778
0.0HisMet: 0.0 ± 0.0
2.288HisAsn: 2.288 ± 1.2
0.0HisPro: 0.0 ± 0.0
0.381HisGln: 0.381 ± 0.376
0.381HisArg: 0.381 ± 0.371
0.763HisSer: 0.763 ± 0.462
1.144HisThr: 1.144 ± 0.611
0.381HisVal: 0.381 ± 0.378
0.0HisTrp: 0.0 ± 0.0
0.381HisTyr: 0.381 ± 0.301
0.0HisXaa: 0.0 ± 0.0
Ile
3.051IleAla: 3.051 ± 1.357
0.763IleCys: 0.763 ± 0.507
6.484IleAsp: 6.484 ± 1.341
4.958IleGlu: 4.958 ± 1.274
2.67IlePhe: 2.67 ± 1.141
3.814IleGly: 3.814 ± 1.0
1.907IleHis: 1.907 ± 0.677
4.577IleIle: 4.577 ± 1.304
4.577IleLys: 4.577 ± 1.622
6.102IleLeu: 6.102 ± 1.37
1.526IleMet: 1.526 ± 0.652
3.814IleAsn: 3.814 ± 0.84
4.577IlePro: 4.577 ± 1.149
2.288IleGln: 2.288 ± 1.1
2.288IleArg: 2.288 ± 0.712
4.195IleSer: 4.195 ± 0.826
2.67IleThr: 2.67 ± 0.667
3.432IleVal: 3.432 ± 1.488
0.381IleTrp: 0.381 ± 0.344
2.288IleTyr: 2.288 ± 0.745
0.0IleXaa: 0.0 ± 0.0
Lys
6.484LysAla: 6.484 ± 1.874
0.763LysCys: 0.763 ± 0.455
4.577LysAsp: 4.577 ± 1.258
6.102LysGlu: 6.102 ± 1.396
5.339LysPhe: 5.339 ± 1.712
3.814LysGly: 3.814 ± 1.488
1.144LysHis: 1.144 ± 0.683
5.721LysIle: 5.721 ± 1.839
9.535LysLys: 9.535 ± 1.986
5.339LysLeu: 5.339 ± 0.946
3.051LysMet: 3.051 ± 0.87
6.484LysAsn: 6.484 ± 1.35
1.907LysPro: 1.907 ± 0.763
1.144LysGln: 1.144 ± 0.583
3.814LysArg: 3.814 ± 0.87
4.195LysSer: 4.195 ± 1.545
4.958LysThr: 4.958 ± 1.547
6.102LysVal: 6.102 ± 1.35
1.144LysTrp: 1.144 ± 0.642
3.814LysTyr: 3.814 ± 1.065
0.0LysXaa: 0.0 ± 0.0
Leu
5.339LeuAla: 5.339 ± 1.279
1.526LeuCys: 1.526 ± 0.699
6.102LeuAsp: 6.102 ± 1.266
6.102LeuGlu: 6.102 ± 1.2
4.195LeuPhe: 4.195 ± 1.123
6.484LeuGly: 6.484 ± 1.484
1.144LeuHis: 1.144 ± 0.904
4.958LeuIle: 4.958 ± 1.268
8.772LeuLys: 8.772 ± 1.925
4.958LeuLeu: 4.958 ± 1.502
4.195LeuMet: 4.195 ± 1.018
3.051LeuAsn: 3.051 ± 0.902
2.288LeuPro: 2.288 ± 0.622
1.526LeuGln: 1.526 ± 0.57
3.051LeuArg: 3.051 ± 1.056
8.009LeuSer: 8.009 ± 2.597
5.721LeuThr: 5.721 ± 1.212
6.102LeuVal: 6.102 ± 1.472
0.381LeuTrp: 0.381 ± 0.371
3.432LeuTyr: 3.432 ± 1.196
0.0LeuXaa: 0.0 ± 0.0
Met
2.288MetAla: 2.288 ± 1.099
0.0MetCys: 0.0 ± 0.0
0.763MetAsp: 0.763 ± 0.481
3.051MetGlu: 3.051 ± 0.984
0.381MetPhe: 0.381 ± 0.344
0.763MetGly: 0.763 ± 0.493
0.381MetHis: 0.381 ± 0.301
0.763MetIle: 0.763 ± 0.456
3.051MetLys: 3.051 ± 0.908
1.907MetLeu: 1.907 ± 0.668
0.0MetMet: 0.0 ± 0.0
2.288MetAsn: 2.288 ± 0.908
0.381MetPro: 0.381 ± 0.301
1.526MetGln: 1.526 ± 0.598
0.763MetArg: 0.763 ± 0.478
2.67MetSer: 2.67 ± 0.823
1.907MetThr: 1.907 ± 1.199
0.381MetVal: 0.381 ± 0.376
0.0MetTrp: 0.0 ± 0.0
1.144MetTyr: 1.144 ± 0.625
0.0MetXaa: 0.0 ± 0.0
Asn
2.67AsnAla: 2.67 ± 0.747
0.381AsnCys: 0.381 ± 0.364
4.577AsnAsp: 4.577 ± 1.302
2.288AsnGlu: 2.288 ± 0.872
3.814AsnPhe: 3.814 ± 0.639
4.195AsnGly: 4.195 ± 2.117
1.144AsnHis: 1.144 ± 0.489
4.195AsnIle: 4.195 ± 1.823
4.195AsnLys: 4.195 ± 1.093
7.628AsnLeu: 7.628 ± 1.192
0.763AsnMet: 0.763 ± 0.575
1.526AsnAsn: 1.526 ± 0.891
1.907AsnPro: 1.907 ± 0.551
4.958AsnGln: 4.958 ± 1.185
3.051AsnArg: 3.051 ± 0.873
3.432AsnSer: 3.432 ± 0.756
4.958AsnThr: 4.958 ± 1.05
3.051AsnVal: 3.051 ± 0.781
0.763AsnTrp: 0.763 ± 0.462
2.67AsnTyr: 2.67 ± 0.943
0.0AsnXaa: 0.0 ± 0.0
Pro
2.67ProAla: 2.67 ± 1.237
0.0ProCys: 0.0 ± 0.0
2.67ProAsp: 2.67 ± 0.813
1.526ProGlu: 1.526 ± 0.587
4.577ProPhe: 4.577 ± 1.272
0.0ProGly: 0.0 ± 0.0
0.381ProHis: 0.381 ± 0.301
1.144ProIle: 1.144 ± 0.565
1.526ProLys: 1.526 ± 0.567
3.051ProLeu: 3.051 ± 1.028
0.763ProMet: 0.763 ± 0.729
2.67ProAsn: 2.67 ± 0.849
0.763ProPro: 0.763 ± 0.711
0.381ProGln: 0.381 ± 0.4
1.526ProArg: 1.526 ± 0.884
2.67ProSer: 2.67 ± 1.026
1.144ProThr: 1.144 ± 0.666
1.907ProVal: 1.907 ± 0.668
0.0ProTrp: 0.0 ± 0.0
1.144ProTyr: 1.144 ± 0.792
0.0ProXaa: 0.0 ± 0.0
Gln
1.526GlnAla: 1.526 ± 0.79
0.0GlnCys: 0.0 ± 0.0
2.67GlnAsp: 2.67 ± 1.235
2.288GlnGlu: 2.288 ± 0.856
2.67GlnPhe: 2.67 ± 0.976
1.907GlnGly: 1.907 ± 0.797
0.0GlnHis: 0.0 ± 0.0
2.288GlnIle: 2.288 ± 0.846
3.814GlnLys: 3.814 ± 0.86
4.195GlnLeu: 4.195 ± 1.056
0.763GlnMet: 0.763 ± 0.572
1.526GlnAsn: 1.526 ± 0.907
0.0GlnPro: 0.0 ± 0.0
1.526GlnGln: 1.526 ± 0.866
2.288GlnArg: 2.288 ± 1.099
1.907GlnSer: 1.907 ± 0.701
2.288GlnThr: 2.288 ± 1.134
1.907GlnVal: 1.907 ± 0.935
0.763GlnTrp: 0.763 ± 0.455
1.526GlnTyr: 1.526 ± 0.903
0.0GlnXaa: 0.0 ± 0.0
Arg
1.144ArgAla: 1.144 ± 0.644
0.0ArgCys: 0.0 ± 0.0
1.907ArgAsp: 1.907 ± 0.64
1.907ArgGlu: 1.907 ± 0.623
1.907ArgPhe: 1.907 ± 1.041
3.051ArgGly: 3.051 ± 0.845
0.763ArgHis: 0.763 ± 0.393
1.526ArgIle: 1.526 ± 0.521
2.288ArgLys: 2.288 ± 1.242
2.288ArgLeu: 2.288 ± 0.711
1.526ArgMet: 1.526 ± 0.669
1.526ArgAsn: 1.526 ± 0.701
1.907ArgPro: 1.907 ± 0.937
0.763ArgGln: 0.763 ± 0.575
3.432ArgArg: 3.432 ± 3.057
3.814ArgSer: 3.814 ± 1.077
3.432ArgThr: 3.432 ± 1.539
3.432ArgVal: 3.432 ± 1.361
0.0ArgTrp: 0.0 ± 0.0
3.432ArgTyr: 3.432 ± 1.076
0.0ArgXaa: 0.0 ± 0.0
Ser
4.958SerAla: 4.958 ± 1.437
0.381SerCys: 0.381 ± 0.4
4.958SerAsp: 4.958 ± 1.183
4.195SerGlu: 4.195 ± 1.251
6.102SerPhe: 6.102 ± 1.496
4.195SerGly: 4.195 ± 1.213
3.051SerHis: 3.051 ± 0.898
2.288SerIle: 2.288 ± 0.846
4.577SerLys: 4.577 ± 1.436
3.432SerLeu: 3.432 ± 0.742
1.144SerMet: 1.144 ± 0.758
5.721SerAsn: 5.721 ± 1.406
3.814SerPro: 3.814 ± 1.207
2.67SerGln: 2.67 ± 1.061
0.381SerArg: 0.381 ± 0.364
3.051SerSer: 3.051 ± 1.064
4.577SerThr: 4.577 ± 1.108
5.721SerVal: 5.721 ± 1.207
0.381SerTrp: 0.381 ± 0.371
4.195SerTyr: 4.195 ± 1.261
0.0SerXaa: 0.0 ± 0.0
Thr
4.577ThrAla: 4.577 ± 1.479
0.0ThrCys: 0.0 ± 0.0
2.67ThrAsp: 2.67 ± 0.715
1.144ThrGlu: 1.144 ± 0.758
3.051ThrPhe: 3.051 ± 1.077
2.67ThrGly: 2.67 ± 1.166
0.381ThrHis: 0.381 ± 0.344
5.339ThrIle: 5.339 ± 1.553
6.865ThrLys: 6.865 ± 1.458
4.195ThrLeu: 4.195 ± 1.703
0.381ThrMet: 0.381 ± 0.376
3.432ThrAsn: 3.432 ± 1.129
1.907ThrPro: 1.907 ± 0.547
2.67ThrGln: 2.67 ± 0.686
3.051ThrArg: 3.051 ± 0.882
3.051ThrSer: 3.051 ± 1.407
4.195ThrThr: 4.195 ± 1.561
2.67ThrVal: 2.67 ± 0.729
0.0ThrTrp: 0.0 ± 0.0
3.432ThrTyr: 3.432 ± 1.427
0.0ThrXaa: 0.0 ± 0.0
Val
4.195ValAla: 4.195 ± 1.477
1.144ValCys: 1.144 ± 0.518
6.102ValAsp: 6.102 ± 1.041
3.051ValGlu: 3.051 ± 0.802
1.907ValPhe: 1.907 ± 0.952
1.526ValGly: 1.526 ± 0.933
1.526ValHis: 1.526 ± 1.143
3.051ValIle: 3.051 ± 1.099
5.339ValLys: 5.339 ± 1.367
6.102ValLeu: 6.102 ± 1.114
1.526ValMet: 1.526 ± 0.672
5.721ValAsn: 5.721 ± 0.917
1.526ValPro: 1.526 ± 0.782
4.195ValGln: 4.195 ± 1.668
4.195ValArg: 4.195 ± 1.293
6.484ValSer: 6.484 ± 1.062
3.051ValThr: 3.051 ± 0.896
4.577ValVal: 4.577 ± 1.7
0.381ValTrp: 0.381 ± 0.301
4.958ValTyr: 4.958 ± 1.181
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.381TrpAsp: 0.381 ± 0.301
0.381TrpGlu: 0.381 ± 0.376
2.288TrpPhe: 2.288 ± 0.946
1.144TrpGly: 1.144 ± 0.588
0.0TrpHis: 0.0 ± 0.0
0.763TrpIle: 0.763 ± 0.543
0.0TrpLys: 0.0 ± 0.0
1.526TrpLeu: 1.526 ± 0.861
0.0TrpMet: 0.0 ± 0.0
1.144TrpAsn: 1.144 ± 0.812
0.381TrpPro: 0.381 ± 0.376
0.381TrpGln: 0.381 ± 0.364
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.763TrpThr: 0.763 ± 0.431
0.381TrpVal: 0.381 ± 0.371
0.0TrpTrp: 0.0 ± 0.0
0.381TrpTyr: 0.381 ± 0.406
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.526TyrAla: 1.526 ± 0.846
0.381TyrCys: 0.381 ± 0.344
3.051TyrAsp: 3.051 ± 0.784
4.577TyrGlu: 4.577 ± 1.097
4.577TyrPhe: 4.577 ± 1.099
1.144TyrGly: 1.144 ± 0.486
1.526TyrHis: 1.526 ± 0.617
3.814TyrIle: 3.814 ± 0.771
2.67TyrLys: 2.67 ± 1.016
5.339TyrLeu: 5.339 ± 1.113
0.763TyrMet: 0.763 ± 0.411
1.907TyrAsn: 1.907 ± 0.629
1.526TyrPro: 1.526 ± 0.851
1.907TyrGln: 1.907 ± 0.861
1.526TyrArg: 1.526 ± 0.671
4.958TyrSer: 4.958 ± 0.938
1.144TyrThr: 1.144 ± 0.655
4.195TyrVal: 4.195 ± 1.275
1.144TyrTrp: 1.144 ± 0.797
1.907TyrTyr: 1.907 ± 0.75
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 16 proteins (2623 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski