Amino acid dipepetide frequency for Pseudoalteromonas phage C5a

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.651AlaAla: 9.651 ± 1.293
0.682AlaCys: 0.682 ± 0.32
4.874AlaAsp: 4.874 ± 0.676
4.972AlaGlu: 4.972 ± 0.853
3.022AlaPhe: 3.022 ± 0.634
7.311AlaGly: 7.311 ± 0.912
1.657AlaHis: 1.657 ± 0.465
5.752AlaIle: 5.752 ± 0.765
5.362AlaLys: 5.362 ± 0.659
8.286AlaLeu: 8.286 ± 1.029
3.217AlaMet: 3.217 ± 0.534
5.947AlaAsn: 5.947 ± 0.847
2.827AlaPro: 2.827 ± 0.386
4.972AlaGln: 4.972 ± 0.78
3.022AlaArg: 3.022 ± 0.665
5.752AlaSer: 5.752 ± 0.856
5.362AlaThr: 5.362 ± 0.924
5.264AlaVal: 5.264 ± 1.003
1.56AlaTrp: 1.56 ± 0.491
2.535AlaTyr: 2.535 ± 0.434
0.0AlaXaa: 0.0 ± 0.0
Cys
0.585CysAla: 0.585 ± 0.255
0.097CysCys: 0.097 ± 0.093
0.682CysAsp: 0.682 ± 0.214
0.78CysGlu: 0.78 ± 0.256
0.195CysPhe: 0.195 ± 0.147
0.682CysGly: 0.682 ± 0.261
0.097CysHis: 0.097 ± 0.11
0.682CysIle: 0.682 ± 0.263
1.17CysLys: 1.17 ± 0.327
0.682CysLeu: 0.682 ± 0.223
0.195CysMet: 0.195 ± 0.125
0.195CysAsn: 0.195 ± 0.117
0.487CysPro: 0.487 ± 0.241
0.292CysGln: 0.292 ± 0.17
0.292CysArg: 0.292 ± 0.23
0.487CysSer: 0.487 ± 0.199
0.78CysThr: 0.78 ± 0.284
0.195CysVal: 0.195 ± 0.125
0.0CysTrp: 0.0 ± 0.0
0.195CysTyr: 0.195 ± 0.146
0.0CysXaa: 0.0 ± 0.0
Asp
5.557AspAla: 5.557 ± 0.573
0.585AspCys: 0.585 ± 0.26
3.412AspAsp: 3.412 ± 0.57
5.069AspGlu: 5.069 ± 0.712
2.827AspPhe: 2.827 ± 0.427
2.535AspGly: 2.535 ± 0.629
1.95AspHis: 1.95 ± 0.36
5.752AspIle: 5.752 ± 0.702
3.607AspLys: 3.607 ± 0.575
5.459AspLeu: 5.459 ± 0.648
1.365AspMet: 1.365 ± 0.267
2.73AspAsn: 2.73 ± 0.495
2.632AspPro: 2.632 ± 0.712
2.632AspGln: 2.632 ± 0.595
1.56AspArg: 1.56 ± 0.337
3.314AspSer: 3.314 ± 0.64
3.12AspThr: 3.12 ± 0.489
3.997AspVal: 3.997 ± 0.708
0.585AspTrp: 0.585 ± 0.208
2.242AspTyr: 2.242 ± 0.407
0.0AspXaa: 0.0 ± 0.0
Glu
3.509GluAla: 3.509 ± 0.498
0.292GluCys: 0.292 ± 0.174
3.314GluAsp: 3.314 ± 0.533
3.509GluGlu: 3.509 ± 0.528
2.925GluPhe: 2.925 ± 0.577
3.704GluGly: 3.704 ± 0.722
1.267GluHis: 1.267 ± 0.32
3.802GluIle: 3.802 ± 0.593
3.412GluLys: 3.412 ± 0.603
6.434GluLeu: 6.434 ± 0.662
1.56GluMet: 1.56 ± 0.361
3.022GluAsn: 3.022 ± 0.662
2.632GluPro: 2.632 ± 0.621
3.704GluGln: 3.704 ± 0.623
2.925GluArg: 2.925 ± 0.708
3.217GluSer: 3.217 ± 0.607
3.12GluThr: 3.12 ± 0.591
2.827GluVal: 2.827 ± 0.36
0.585GluTrp: 0.585 ± 0.218
2.242GluTyr: 2.242 ± 0.372
0.0GluXaa: 0.0 ± 0.0
Phe
3.509PheAla: 3.509 ± 0.483
0.39PheCys: 0.39 ± 0.238
3.12PheAsp: 3.12 ± 0.573
2.145PheGlu: 2.145 ± 0.39
1.462PhePhe: 1.462 ± 0.465
2.535PheGly: 2.535 ± 0.4
0.585PheHis: 0.585 ± 0.311
3.217PheIle: 3.217 ± 0.699
2.535PheLys: 2.535 ± 0.459
2.925PheLeu: 2.925 ± 0.744
0.78PheMet: 0.78 ± 0.227
2.145PheAsn: 2.145 ± 0.533
1.267PhePro: 1.267 ± 0.335
0.78PheGln: 0.78 ± 0.24
1.462PheArg: 1.462 ± 0.412
3.412PheSer: 3.412 ± 0.837
3.022PheThr: 3.022 ± 0.815
3.607PheVal: 3.607 ± 0.611
0.682PheTrp: 0.682 ± 0.283
1.17PheTyr: 1.17 ± 0.258
0.0PheXaa: 0.0 ± 0.0
Gly
4.777GlyAla: 4.777 ± 0.631
0.78GlyCys: 0.78 ± 0.24
3.997GlyAsp: 3.997 ± 0.543
4.484GlyGlu: 4.484 ± 0.58
2.827GlyPhe: 2.827 ± 0.458
4.679GlyGly: 4.679 ± 0.849
1.365GlyHis: 1.365 ± 0.426
3.899GlyIle: 3.899 ± 0.767
4.289GlyLys: 4.289 ± 0.736
6.531GlyLeu: 6.531 ± 1.178
1.56GlyMet: 1.56 ± 0.472
3.607GlyAsn: 3.607 ± 0.645
1.365GlyPro: 1.365 ± 0.355
2.827GlyGln: 2.827 ± 0.491
3.314GlyArg: 3.314 ± 0.641
3.899GlySer: 3.899 ± 0.596
3.412GlyThr: 3.412 ± 0.761
4.484GlyVal: 4.484 ± 0.705
0.78GlyTrp: 0.78 ± 0.275
2.34GlyTyr: 2.34 ± 0.569
0.0GlyXaa: 0.0 ± 0.0
His
1.56HisAla: 1.56 ± 0.436
0.097HisCys: 0.097 ± 0.09
1.267HisAsp: 1.267 ± 0.443
0.585HisGlu: 0.585 ± 0.221
1.072HisPhe: 1.072 ± 0.347
1.56HisGly: 1.56 ± 0.434
0.487HisHis: 0.487 ± 0.248
0.975HisIle: 0.975 ± 0.325
2.34HisLys: 2.34 ± 0.573
1.852HisLeu: 1.852 ± 0.478
0.682HisMet: 0.682 ± 0.188
0.682HisAsn: 0.682 ± 0.331
0.585HisPro: 0.585 ± 0.262
0.682HisGln: 0.682 ± 0.253
1.462HisArg: 1.462 ± 0.336
1.17HisSer: 1.17 ± 0.342
1.56HisThr: 1.56 ± 0.301
1.365HisVal: 1.365 ± 0.49
0.487HisTrp: 0.487 ± 0.183
0.682HisTyr: 0.682 ± 0.27
0.0HisXaa: 0.0 ± 0.0
Ile
6.726IleAla: 6.726 ± 0.814
0.292IleCys: 0.292 ± 0.188
5.167IleAsp: 5.167 ± 0.872
4.679IleGlu: 4.679 ± 0.757
2.047IlePhe: 2.047 ± 0.766
3.997IleGly: 3.997 ± 0.591
0.877IleHis: 0.877 ± 0.239
2.535IleIle: 2.535 ± 0.618
3.704IleLys: 3.704 ± 0.406
5.167IleLeu: 5.167 ± 0.731
1.462IleMet: 1.462 ± 0.455
3.607IleAsn: 3.607 ± 0.666
1.95IlePro: 1.95 ± 0.486
1.56IleGln: 1.56 ± 0.334
3.314IleArg: 3.314 ± 0.459
4.484IleSer: 4.484 ± 0.692
4.484IleThr: 4.484 ± 0.501
3.704IleVal: 3.704 ± 0.667
0.487IleTrp: 0.487 ± 0.213
1.852IleTyr: 1.852 ± 0.387
0.0IleXaa: 0.0 ± 0.0
Lys
6.531LysAla: 6.531 ± 0.953
0.097LysCys: 0.097 ± 0.085
3.12LysAsp: 3.12 ± 0.499
3.022LysGlu: 3.022 ± 0.483
2.632LysPhe: 2.632 ± 0.612
4.874LysGly: 4.874 ± 0.634
2.047LysHis: 2.047 ± 0.377
3.412LysIle: 3.412 ± 0.56
4.484LysLys: 4.484 ± 0.765
5.167LysLeu: 5.167 ± 0.611
1.56LysMet: 1.56 ± 0.38
3.802LysAsn: 3.802 ± 0.555
2.925LysPro: 2.925 ± 0.562
3.509LysGln: 3.509 ± 0.592
2.632LysArg: 2.632 ± 0.589
4.289LysSer: 4.289 ± 0.609
4.972LysThr: 4.972 ± 0.802
3.802LysVal: 3.802 ± 0.601
0.487LysTrp: 0.487 ± 0.186
2.34LysTyr: 2.34 ± 0.542
0.0LysXaa: 0.0 ± 0.0
Leu
8.481LeuAla: 8.481 ± 1.184
0.975LeuCys: 0.975 ± 0.223
4.094LeuAsp: 4.094 ± 0.659
4.094LeuGlu: 4.094 ± 0.505
3.802LeuPhe: 3.802 ± 0.686
5.752LeuGly: 5.752 ± 0.966
1.365LeuHis: 1.365 ± 0.296
5.752LeuIle: 5.752 ± 0.863
6.629LeuLys: 6.629 ± 0.71
6.434LeuLeu: 6.434 ± 0.896
2.242LeuMet: 2.242 ± 0.538
5.947LeuAsn: 5.947 ± 0.604
3.12LeuPro: 3.12 ± 0.911
3.607LeuGln: 3.607 ± 0.559
4.777LeuArg: 4.777 ± 0.97
5.459LeuSer: 5.459 ± 0.848
5.557LeuThr: 5.557 ± 0.592
5.654LeuVal: 5.654 ± 0.722
0.487LeuTrp: 0.487 ± 0.226
1.17LeuTyr: 1.17 ± 0.4
0.0LeuXaa: 0.0 ± 0.0
Met
2.047MetAla: 2.047 ± 0.518
0.195MetCys: 0.195 ± 0.141
1.072MetAsp: 1.072 ± 0.303
1.072MetGlu: 1.072 ± 0.282
0.682MetPhe: 0.682 ± 0.257
1.365MetGly: 1.365 ± 0.408
0.78MetHis: 0.78 ± 0.303
1.17MetIle: 1.17 ± 0.251
1.072MetLys: 1.072 ± 0.251
1.95MetLeu: 1.95 ± 0.362
0.097MetMet: 0.097 ± 0.107
1.95MetAsn: 1.95 ± 0.492
1.657MetPro: 1.657 ± 0.38
0.682MetGln: 0.682 ± 0.213
1.267MetArg: 1.267 ± 0.356
2.73MetSer: 2.73 ± 0.591
1.267MetThr: 1.267 ± 0.378
1.852MetVal: 1.852 ± 0.376
0.292MetTrp: 0.292 ± 0.144
0.78MetTyr: 0.78 ± 0.283
0.0MetXaa: 0.0 ± 0.0
Asn
5.459AsnAla: 5.459 ± 0.647
0.39AsnCys: 0.39 ± 0.197
3.217AsnAsp: 3.217 ± 0.502
3.802AsnGlu: 3.802 ± 0.52
2.632AsnPhe: 2.632 ± 0.501
3.607AsnGly: 3.607 ± 0.667
1.17AsnHis: 1.17 ± 0.423
3.217AsnIle: 3.217 ± 0.487
3.12AsnLys: 3.12 ± 0.508
4.289AsnLeu: 4.289 ± 0.657
1.365AsnMet: 1.365 ± 0.372
3.12AsnAsn: 3.12 ± 0.561
2.73AsnPro: 2.73 ± 0.496
2.535AsnGln: 2.535 ± 0.493
2.34AsnArg: 2.34 ± 0.38
4.874AsnSer: 4.874 ± 1.053
3.022AsnThr: 3.022 ± 0.892
3.022AsnVal: 3.022 ± 0.479
1.267AsnTrp: 1.267 ± 0.376
0.682AsnTyr: 0.682 ± 0.251
0.0AsnXaa: 0.0 ± 0.0
Pro
3.899ProAla: 3.899 ± 0.566
0.682ProCys: 0.682 ± 0.227
2.242ProAsp: 2.242 ± 0.452
2.437ProGlu: 2.437 ± 0.54
1.852ProPhe: 1.852 ± 0.498
1.755ProGly: 1.755 ± 0.372
0.877ProHis: 0.877 ± 0.272
2.242ProIle: 2.242 ± 0.473
2.242ProLys: 2.242 ± 0.578
2.827ProLeu: 2.827 ± 0.638
1.072ProMet: 1.072 ± 0.339
2.047ProAsn: 2.047 ± 0.39
1.755ProPro: 1.755 ± 0.392
1.267ProGln: 1.267 ± 0.326
1.267ProArg: 1.267 ± 0.329
2.827ProSer: 2.827 ± 0.592
2.145ProThr: 2.145 ± 0.502
3.022ProVal: 3.022 ± 0.674
0.292ProTrp: 0.292 ± 0.149
1.267ProTyr: 1.267 ± 0.392
0.0ProXaa: 0.0 ± 0.0
Gln
4.582GlnAla: 4.582 ± 0.806
0.292GlnCys: 0.292 ± 0.208
2.047GlnAsp: 2.047 ± 0.358
2.145GlnGlu: 2.145 ± 0.445
2.047GlnPhe: 2.047 ± 0.55
2.242GlnGly: 2.242 ± 0.51
1.072GlnHis: 1.072 ± 0.318
3.12GlnIle: 3.12 ± 0.499
2.632GlnLys: 2.632 ± 0.467
3.802GlnLeu: 3.802 ± 0.79
0.877GlnMet: 0.877 ± 0.323
2.34GlnAsn: 2.34 ± 0.44
1.56GlnPro: 1.56 ± 0.409
3.022GlnGln: 3.022 ± 0.535
1.852GlnArg: 1.852 ± 0.404
3.607GlnSer: 3.607 ± 0.601
2.437GlnThr: 2.437 ± 0.575
2.827GlnVal: 2.827 ± 0.574
0.682GlnTrp: 0.682 ± 0.278
1.755GlnTyr: 1.755 ± 0.461
0.0GlnXaa: 0.0 ± 0.0
Arg
3.899ArgAla: 3.899 ± 0.596
0.585ArgCys: 0.585 ± 0.244
2.34ArgAsp: 2.34 ± 0.583
2.827ArgGlu: 2.827 ± 0.757
2.34ArgPhe: 2.34 ± 0.413
2.535ArgGly: 2.535 ± 0.393
0.682ArgHis: 0.682 ± 0.268
2.535ArgIle: 2.535 ± 0.442
3.314ArgLys: 3.314 ± 0.588
3.704ArgLeu: 3.704 ± 0.473
0.975ArgMet: 0.975 ± 0.302
2.145ArgAsn: 2.145 ± 0.499
1.365ArgPro: 1.365 ± 0.478
1.852ArgGln: 1.852 ± 0.438
2.242ArgArg: 2.242 ± 0.587
2.242ArgSer: 2.242 ± 0.48
3.12ArgThr: 3.12 ± 0.508
3.217ArgVal: 3.217 ± 0.614
0.585ArgTrp: 0.585 ± 0.277
1.462ArgTyr: 1.462 ± 0.494
0.0ArgXaa: 0.0 ± 0.0
Ser
5.752SerAla: 5.752 ± 0.907
0.78SerCys: 0.78 ± 0.3
4.582SerAsp: 4.582 ± 0.801
3.704SerGlu: 3.704 ± 0.719
2.145SerPhe: 2.145 ± 0.505
3.802SerGly: 3.802 ± 0.584
1.17SerHis: 1.17 ± 0.452
5.069SerIle: 5.069 ± 0.635
4.972SerLys: 4.972 ± 0.725
4.874SerLeu: 4.874 ± 0.724
1.365SerMet: 1.365 ± 0.362
3.704SerAsn: 3.704 ± 0.701
1.852SerPro: 1.852 ± 0.452
2.632SerGln: 2.632 ± 0.434
2.925SerArg: 2.925 ± 0.574
3.412SerSer: 3.412 ± 0.766
3.899SerThr: 3.899 ± 0.473
4.874SerVal: 4.874 ± 0.566
1.365SerTrp: 1.365 ± 0.336
1.852SerTyr: 1.852 ± 0.427
0.0SerXaa: 0.0 ± 0.0
Thr
6.142ThrAla: 6.142 ± 0.787
0.487ThrCys: 0.487 ± 0.221
5.557ThrAsp: 5.557 ± 0.543
2.535ThrGlu: 2.535 ± 0.501
2.047ThrPhe: 2.047 ± 0.506
4.192ThrGly: 4.192 ± 0.519
0.975ThrHis: 0.975 ± 0.225
3.997ThrIle: 3.997 ± 0.433
3.704ThrLys: 3.704 ± 0.547
4.972ThrLeu: 4.972 ± 0.668
1.657ThrMet: 1.657 ± 0.546
2.632ThrAsn: 2.632 ± 0.447
2.827ThrPro: 2.827 ± 0.512
3.022ThrGln: 3.022 ± 0.452
2.047ThrArg: 2.047 ± 0.512
3.607ThrSer: 3.607 ± 0.589
2.632ThrThr: 2.632 ± 0.469
4.387ThrVal: 4.387 ± 0.649
0.975ThrTrp: 0.975 ± 0.323
1.365ThrTyr: 1.365 ± 0.409
0.0ThrXaa: 0.0 ± 0.0
Val
5.557ValAla: 5.557 ± 0.89
0.585ValCys: 0.585 ± 0.2
3.997ValAsp: 3.997 ± 0.738
4.679ValGlu: 4.679 ± 0.761
2.145ValPhe: 2.145 ± 0.523
5.167ValGly: 5.167 ± 0.746
1.462ValHis: 1.462 ± 0.457
3.412ValIle: 3.412 ± 0.567
4.484ValLys: 4.484 ± 0.543
5.069ValLeu: 5.069 ± 0.789
1.267ValMet: 1.267 ± 0.343
4.192ValAsn: 4.192 ± 0.617
2.73ValPro: 2.73 ± 0.553
3.509ValGln: 3.509 ± 0.655
2.73ValArg: 2.73 ± 0.538
3.314ValSer: 3.314 ± 0.513
3.314ValThr: 3.314 ± 0.576
4.192ValVal: 4.192 ± 0.787
0.78ValTrp: 0.78 ± 0.251
2.34ValTyr: 2.34 ± 0.389
0.0ValXaa: 0.0 ± 0.0
Trp
1.17TrpAla: 1.17 ± 0.291
0.0TrpCys: 0.0 ± 0.0
1.072TrpAsp: 1.072 ± 0.252
0.585TrpGlu: 0.585 ± 0.219
0.682TrpPhe: 0.682 ± 0.225
0.78TrpGly: 0.78 ± 0.323
0.585TrpHis: 0.585 ± 0.239
0.682TrpIle: 0.682 ± 0.302
0.78TrpLys: 0.78 ± 0.274
1.56TrpLeu: 1.56 ± 0.413
0.195TrpMet: 0.195 ± 0.115
0.39TrpAsn: 0.39 ± 0.239
0.877TrpPro: 0.877 ± 0.257
0.585TrpGln: 0.585 ± 0.221
1.17TrpArg: 1.17 ± 0.544
0.487TrpSer: 0.487 ± 0.278
0.78TrpThr: 0.78 ± 0.318
0.585TrpVal: 0.585 ± 0.228
0.292TrpTrp: 0.292 ± 0.196
0.487TrpTyr: 0.487 ± 0.187
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.632TyrAla: 2.632 ± 0.469
0.487TyrCys: 0.487 ± 0.183
1.755TyrAsp: 1.755 ± 0.513
0.877TyrGlu: 0.877 ± 0.294
1.365TyrPhe: 1.365 ± 0.26
2.242TyrGly: 2.242 ± 0.453
0.682TyrHis: 0.682 ± 0.236
0.877TyrIle: 0.877 ± 0.298
1.852TyrLys: 1.852 ± 0.335
3.412TyrLeu: 3.412 ± 0.547
0.39TyrMet: 0.39 ± 0.228
1.657TyrAsn: 1.657 ± 0.283
0.877TyrPro: 0.877 ± 0.24
1.267TyrGln: 1.267 ± 0.334
1.365TyrArg: 1.365 ± 0.355
2.145TyrSer: 2.145 ± 0.589
1.755TyrThr: 1.755 ± 0.449
2.047TyrVal: 2.047 ± 0.541
1.072TyrTrp: 1.072 ± 0.315
1.267TyrTyr: 1.267 ± 0.283
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (10259 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski