Amino acid dipepetide frequency for Enterobacter phage LAU1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.806AlaAla: 10.806 ± 2.793
0.629AlaCys: 0.629 ± 0.302
5.77AlaAsp: 5.77 ± 1.03
6.609AlaGlu: 6.609 ± 0.658
2.623AlaPhe: 2.623 ± 0.476
7.973AlaGly: 7.973 ± 0.907
1.679AlaHis: 1.679 ± 0.523
5.56AlaIle: 5.56 ± 0.756
4.721AlaLys: 4.721 ± 0.75
8.183AlaLeu: 8.183 ± 1.196
3.042AlaMet: 3.042 ± 0.73
5.665AlaAsn: 5.665 ± 0.777
1.574AlaPro: 1.574 ± 0.381
5.875AlaGln: 5.875 ± 1.037
4.931AlaArg: 4.931 ± 0.804
8.078AlaSer: 8.078 ± 1.428
5.245AlaThr: 5.245 ± 0.944
6.295AlaVal: 6.295 ± 1.124
1.259AlaTrp: 1.259 ± 0.387
1.679AlaTyr: 1.679 ± 0.466
0.0AlaXaa: 0.0 ± 0.0
Cys
0.525CysAla: 0.525 ± 0.268
0.21CysCys: 0.21 ± 0.146
0.734CysAsp: 0.734 ± 0.248
0.42CysGlu: 0.42 ± 0.182
0.21CysPhe: 0.21 ± 0.117
0.839CysGly: 0.839 ± 0.39
0.42CysHis: 0.42 ± 0.222
0.21CysIle: 0.21 ± 0.153
0.734CysLys: 0.734 ± 0.252
0.629CysLeu: 0.629 ± 0.291
0.21CysMet: 0.21 ± 0.146
0.629CysAsn: 0.629 ± 0.31
0.21CysPro: 0.21 ± 0.179
0.0CysGln: 0.0 ± 0.0
0.42CysArg: 0.42 ± 0.244
0.525CysSer: 0.525 ± 0.275
0.944CysThr: 0.944 ± 0.308
0.105CysVal: 0.105 ± 0.098
0.105CysTrp: 0.105 ± 0.103
0.105CysTyr: 0.105 ± 0.109
0.0CysXaa: 0.0 ± 0.0
Asp
5.455AspAla: 5.455 ± 0.882
0.315AspCys: 0.315 ± 0.184
3.777AspAsp: 3.777 ± 0.685
4.616AspGlu: 4.616 ± 0.767
2.098AspPhe: 2.098 ± 0.58
3.987AspGly: 3.987 ± 0.75
0.525AspHis: 0.525 ± 0.3
3.147AspIle: 3.147 ± 0.542
1.993AspLys: 1.993 ± 0.477
4.721AspLeu: 4.721 ± 0.708
1.574AspMet: 1.574 ± 0.381
3.147AspAsn: 3.147 ± 0.509
1.993AspPro: 1.993 ± 0.347
2.098AspGln: 2.098 ± 0.498
3.462AspArg: 3.462 ± 0.728
4.301AspSer: 4.301 ± 0.885
2.833AspThr: 2.833 ± 0.684
5.141AspVal: 5.141 ± 1.025
0.42AspTrp: 0.42 ± 0.213
2.203AspTyr: 2.203 ± 0.358
0.0AspXaa: 0.0 ± 0.0
Glu
5.665GluAla: 5.665 ± 0.863
0.42GluCys: 0.42 ± 0.242
1.469GluAsp: 1.469 ± 0.295
4.091GluGlu: 4.091 ± 0.899
2.833GluPhe: 2.833 ± 0.69
4.091GluGly: 4.091 ± 0.568
0.839GluHis: 0.839 ± 0.348
4.406GluIle: 4.406 ± 0.688
3.987GluLys: 3.987 ± 0.991
5.35GluLeu: 5.35 ± 0.632
1.154GluMet: 1.154 ± 0.316
3.147GluAsn: 3.147 ± 0.552
3.357GluPro: 3.357 ± 0.666
3.882GluGln: 3.882 ± 0.85
3.987GluArg: 3.987 ± 0.649
3.252GluSer: 3.252 ± 0.611
3.462GluThr: 3.462 ± 0.526
3.672GluVal: 3.672 ± 0.651
0.734GluTrp: 0.734 ± 0.271
2.098GluTyr: 2.098 ± 0.523
0.0GluXaa: 0.0 ± 0.0
Phe
2.833PheAla: 2.833 ± 0.644
0.315PheCys: 0.315 ± 0.2
2.728PheAsp: 2.728 ± 0.856
1.679PheGlu: 1.679 ± 0.384
1.154PhePhe: 1.154 ± 0.423
2.098PheGly: 2.098 ± 0.416
0.734PheHis: 0.734 ± 0.327
2.098PheIle: 2.098 ± 0.724
1.993PheLys: 1.993 ± 0.635
2.203PheLeu: 2.203 ± 0.703
0.21PheMet: 0.21 ± 0.141
1.993PheAsn: 1.993 ± 0.5
1.469PhePro: 1.469 ± 0.417
1.679PheGln: 1.679 ± 0.47
2.728PheArg: 2.728 ± 0.673
3.567PheSer: 3.567 ± 0.629
1.993PheThr: 1.993 ± 0.588
2.623PheVal: 2.623 ± 0.729
0.734PheTrp: 0.734 ± 0.283
0.839PheTyr: 0.839 ± 0.346
0.0PheXaa: 0.0 ± 0.0
Gly
6.399GlyAla: 6.399 ± 1.052
0.839GlyCys: 0.839 ± 0.356
4.826GlyAsp: 4.826 ± 0.809
4.091GlyGlu: 4.091 ± 0.49
3.042GlyPhe: 3.042 ± 0.588
6.19GlyGly: 6.19 ± 0.966
1.259GlyHis: 1.259 ± 0.466
4.091GlyIle: 4.091 ± 0.786
3.777GlyLys: 3.777 ± 0.679
6.714GlyLeu: 6.714 ± 0.986
1.993GlyMet: 1.993 ± 0.44
4.301GlyAsn: 4.301 ± 0.694
2.308GlyPro: 2.308 ± 0.575
3.987GlyGln: 3.987 ± 0.541
3.672GlyArg: 3.672 ± 0.594
4.511GlySer: 4.511 ± 0.892
6.399GlyThr: 6.399 ± 1.255
5.141GlyVal: 5.141 ± 1.128
1.049GlyTrp: 1.049 ± 0.479
2.623GlyTyr: 2.623 ± 0.495
0.0GlyXaa: 0.0 ± 0.0
His
1.574HisAla: 1.574 ± 0.522
0.21HisCys: 0.21 ± 0.135
0.629HisAsp: 0.629 ± 0.239
0.629HisGlu: 0.629 ± 0.311
0.315HisPhe: 0.315 ± 0.197
1.049HisGly: 1.049 ± 0.362
0.315HisHis: 0.315 ± 0.184
0.734HisIle: 0.734 ± 0.314
0.21HisLys: 0.21 ± 0.191
0.839HisLeu: 0.839 ± 0.364
0.21HisMet: 0.21 ± 0.157
0.525HisAsn: 0.525 ± 0.252
0.839HisPro: 0.839 ± 0.325
0.629HisGln: 0.629 ± 0.281
1.469HisArg: 1.469 ± 0.499
0.629HisSer: 0.629 ± 0.304
1.049HisThr: 1.049 ± 0.26
0.42HisVal: 0.42 ± 0.286
0.315HisTrp: 0.315 ± 0.199
0.42HisTyr: 0.42 ± 0.194
0.0HisXaa: 0.0 ± 0.0
Ile
5.141IleAla: 5.141 ± 0.792
0.105IleCys: 0.105 ± 0.089
3.567IleAsp: 3.567 ± 0.594
3.672IleGlu: 3.672 ± 0.721
2.203IlePhe: 2.203 ± 0.565
3.357IleGly: 3.357 ± 0.775
0.42IleHis: 0.42 ± 0.199
2.937IleIle: 2.937 ± 0.607
3.462IleLys: 3.462 ± 0.819
4.301IleLeu: 4.301 ± 0.874
1.364IleMet: 1.364 ± 0.422
2.728IleAsn: 2.728 ± 0.453
2.098IlePro: 2.098 ± 0.424
3.252IleGln: 3.252 ± 0.628
2.728IleArg: 2.728 ± 0.492
5.036IleSer: 5.036 ± 0.742
3.777IleThr: 3.777 ± 0.668
2.728IleVal: 2.728 ± 0.653
0.629IleTrp: 0.629 ± 0.274
1.993IleTyr: 1.993 ± 0.516
0.0IleXaa: 0.0 ± 0.0
Lys
3.987LysAla: 3.987 ± 0.699
0.105LysCys: 0.105 ± 0.109
3.147LysAsp: 3.147 ± 0.657
2.623LysGlu: 2.623 ± 0.616
0.839LysPhe: 0.839 ± 0.279
3.357LysGly: 3.357 ± 0.544
0.629LysHis: 0.629 ± 0.238
2.623LysIle: 2.623 ± 0.628
2.833LysLys: 2.833 ± 0.593
3.777LysLeu: 3.777 ± 0.841
1.259LysMet: 1.259 ± 0.322
2.833LysAsn: 2.833 ± 0.596
1.469LysPro: 1.469 ± 0.347
2.728LysGln: 2.728 ± 0.657
2.098LysArg: 2.098 ± 0.64
4.406LysSer: 4.406 ± 0.776
3.987LysThr: 3.987 ± 0.573
3.357LysVal: 3.357 ± 0.545
0.629LysTrp: 0.629 ± 0.262
1.469LysTyr: 1.469 ± 0.417
0.0LysXaa: 0.0 ± 0.0
Leu
8.498LeuAla: 8.498 ± 1.031
1.993LeuCys: 1.993 ± 0.565
4.406LeuAsp: 4.406 ± 0.685
5.56LeuGlu: 5.56 ± 0.871
2.413LeuPhe: 2.413 ± 0.507
4.826LeuGly: 4.826 ± 0.805
0.42LeuHis: 0.42 ± 0.215
4.931LeuIle: 4.931 ± 0.687
4.406LeuLys: 4.406 ± 0.65
7.239LeuLeu: 7.239 ± 1.174
1.993LeuMet: 1.993 ± 0.488
4.091LeuAsn: 4.091 ± 0.647
3.357LeuPro: 3.357 ± 0.636
3.357LeuGln: 3.357 ± 0.634
6.924LeuArg: 6.924 ± 0.964
7.554LeuSer: 7.554 ± 0.798
4.721LeuThr: 4.721 ± 0.576
3.777LeuVal: 3.777 ± 0.795
0.525LeuTrp: 0.525 ± 0.207
1.783LeuTyr: 1.783 ± 0.433
0.0LeuXaa: 0.0 ± 0.0
Met
3.462MetAla: 3.462 ± 0.584
0.42MetCys: 0.42 ± 0.204
1.469MetAsp: 1.469 ± 0.521
0.944MetGlu: 0.944 ± 0.368
1.049MetPhe: 1.049 ± 0.495
1.469MetGly: 1.469 ± 0.393
0.105MetHis: 0.105 ± 0.107
0.734MetIle: 0.734 ± 0.27
1.469MetLys: 1.469 ± 0.414
1.679MetLeu: 1.679 ± 0.436
0.839MetMet: 0.839 ± 0.319
1.049MetAsn: 1.049 ± 0.324
1.364MetPro: 1.364 ± 0.344
1.049MetGln: 1.049 ± 0.261
1.574MetArg: 1.574 ± 0.302
1.469MetSer: 1.469 ± 0.37
1.679MetThr: 1.679 ± 0.363
0.734MetVal: 0.734 ± 0.309
0.21MetTrp: 0.21 ± 0.201
0.629MetTyr: 0.629 ± 0.217
0.0MetXaa: 0.0 ± 0.0
Asn
4.826AsnAla: 4.826 ± 0.674
0.105AsnCys: 0.105 ± 0.111
2.728AsnAsp: 2.728 ± 0.472
3.042AsnGlu: 3.042 ± 0.524
1.993AsnPhe: 1.993 ± 0.495
4.826AsnGly: 4.826 ± 0.87
0.629AsnHis: 0.629 ± 0.254
3.042AsnIle: 3.042 ± 0.457
1.993AsnLys: 1.993 ± 0.323
3.462AsnLeu: 3.462 ± 0.612
1.259AsnMet: 1.259 ± 0.381
2.623AsnAsn: 2.623 ± 0.609
3.147AsnPro: 3.147 ± 0.64
2.203AsnGln: 2.203 ± 0.417
2.098AsnArg: 2.098 ± 0.497
1.993AsnSer: 1.993 ± 0.546
3.567AsnThr: 3.567 ± 0.669
2.413AsnVal: 2.413 ± 0.44
1.259AsnTrp: 1.259 ± 0.322
1.364AsnTyr: 1.364 ± 0.313
0.0AsnXaa: 0.0 ± 0.0
Pro
4.616ProAla: 4.616 ± 0.857
0.315ProCys: 0.315 ± 0.168
1.364ProAsp: 1.364 ± 0.495
2.098ProGlu: 2.098 ± 0.451
1.049ProPhe: 1.049 ± 0.349
3.882ProGly: 3.882 ± 0.612
0.525ProHis: 0.525 ± 0.178
1.049ProIle: 1.049 ± 0.397
2.098ProLys: 2.098 ± 0.432
2.937ProLeu: 2.937 ± 0.548
1.049ProMet: 1.049 ± 0.47
1.364ProAsn: 1.364 ± 0.397
1.574ProPro: 1.574 ± 0.49
2.413ProGln: 2.413 ± 0.553
1.993ProArg: 1.993 ± 0.587
2.623ProSer: 2.623 ± 0.532
1.574ProThr: 1.574 ± 0.505
3.672ProVal: 3.672 ± 0.874
0.315ProTrp: 0.315 ± 0.173
1.469ProTyr: 1.469 ± 0.373
0.0ProXaa: 0.0 ± 0.0
Gln
6.399GlnAla: 6.399 ± 1.526
0.105GlnCys: 0.105 ± 0.115
2.203GlnAsp: 2.203 ± 0.471
2.937GlnGlu: 2.937 ± 0.734
1.993GlnPhe: 1.993 ± 0.422
3.672GlnGly: 3.672 ± 0.729
0.525GlnHis: 0.525 ± 0.293
3.357GlnIle: 3.357 ± 0.512
2.518GlnLys: 2.518 ± 0.558
4.616GlnLeu: 4.616 ± 0.728
0.629GlnMet: 0.629 ± 0.33
3.042GlnAsn: 3.042 ± 0.689
1.993GlnPro: 1.993 ± 0.674
1.888GlnGln: 1.888 ± 0.557
3.042GlnArg: 3.042 ± 0.62
2.623GlnSer: 2.623 ± 0.522
2.833GlnThr: 2.833 ± 0.713
3.357GlnVal: 3.357 ± 0.643
0.315GlnTrp: 0.315 ± 0.182
1.154GlnTyr: 1.154 ± 0.424
0.0GlnXaa: 0.0 ± 0.0
Arg
5.455ArgAla: 5.455 ± 1.03
0.42ArgCys: 0.42 ± 0.181
4.091ArgAsp: 4.091 ± 0.608
3.777ArgGlu: 3.777 ± 0.779
2.623ArgPhe: 2.623 ± 0.46
3.252ArgGly: 3.252 ± 0.493
0.944ArgHis: 0.944 ± 0.402
2.308ArgIle: 2.308 ± 0.611
3.042ArgLys: 3.042 ± 0.662
6.819ArgLeu: 6.819 ± 1.085
1.574ArgMet: 1.574 ± 0.395
2.833ArgAsn: 2.833 ± 0.499
1.259ArgPro: 1.259 ± 0.404
3.567ArgGln: 3.567 ± 0.613
3.252ArgArg: 3.252 ± 0.717
2.833ArgSer: 2.833 ± 0.528
3.567ArgThr: 3.567 ± 0.432
4.406ArgVal: 4.406 ± 1.019
0.944ArgTrp: 0.944 ± 0.286
2.413ArgTyr: 2.413 ± 0.551
0.0ArgXaa: 0.0 ± 0.0
Ser
7.029SerAla: 7.029 ± 1.15
0.105SerCys: 0.105 ± 0.131
4.721SerAsp: 4.721 ± 0.992
4.826SerGlu: 4.826 ± 0.715
1.783SerPhe: 1.783 ± 0.57
8.183SerGly: 8.183 ± 1.073
0.525SerHis: 0.525 ± 0.276
3.147SerIle: 3.147 ± 0.461
1.993SerLys: 1.993 ± 0.572
6.504SerLeu: 6.504 ± 0.831
1.679SerMet: 1.679 ± 0.486
2.728SerAsn: 2.728 ± 0.565
3.042SerPro: 3.042 ± 0.585
3.672SerGln: 3.672 ± 0.796
4.616SerArg: 4.616 ± 0.817
4.826SerSer: 4.826 ± 0.817
3.672SerThr: 3.672 ± 0.634
4.931SerVal: 4.931 ± 0.93
1.574SerTrp: 1.574 ± 0.396
1.888SerTyr: 1.888 ± 0.445
0.0SerXaa: 0.0 ± 0.0
Thr
6.714ThrAla: 6.714 ± 1.123
0.21ThrCys: 0.21 ± 0.165
3.672ThrAsp: 3.672 ± 0.662
3.882ThrGlu: 3.882 ± 0.642
3.462ThrPhe: 3.462 ± 0.651
5.98ThrGly: 5.98 ± 0.659
0.839ThrHis: 0.839 ± 0.293
4.511ThrIle: 4.511 ± 0.804
1.679ThrLys: 1.679 ± 0.544
4.721ThrLeu: 4.721 ± 0.789
1.154ThrMet: 1.154 ± 0.345
1.679ThrAsn: 1.679 ± 0.437
2.413ThrPro: 2.413 ± 0.632
2.413ThrGln: 2.413 ± 0.358
3.357ThrArg: 3.357 ± 0.578
4.826ThrSer: 4.826 ± 0.622
4.196ThrThr: 4.196 ± 0.907
4.616ThrVal: 4.616 ± 0.702
1.259ThrTrp: 1.259 ± 0.331
1.364ThrTyr: 1.364 ± 0.362
0.0ThrXaa: 0.0 ± 0.0
Val
4.931ValAla: 4.931 ± 0.587
0.629ValCys: 0.629 ± 0.256
4.091ValAsp: 4.091 ± 0.694
3.567ValGlu: 3.567 ± 0.461
2.833ValPhe: 2.833 ± 0.616
4.511ValGly: 4.511 ± 0.931
0.839ValHis: 0.839 ± 0.334
3.987ValIle: 3.987 ± 0.532
3.147ValLys: 3.147 ± 0.492
5.245ValLeu: 5.245 ± 0.772
1.154ValMet: 1.154 ± 0.296
2.833ValAsn: 2.833 ± 0.868
3.147ValPro: 3.147 ± 0.475
2.413ValGln: 2.413 ± 0.447
3.462ValArg: 3.462 ± 0.748
5.141ValSer: 5.141 ± 0.613
5.141ValThr: 5.141 ± 0.906
4.616ValVal: 4.616 ± 0.866
1.364ValTrp: 1.364 ± 0.435
2.203ValTyr: 2.203 ± 0.362
0.0ValXaa: 0.0 ± 0.0
Trp
0.944TrpAla: 0.944 ± 0.311
0.42TrpCys: 0.42 ± 0.249
0.734TrpAsp: 0.734 ± 0.334
0.944TrpGlu: 0.944 ± 0.296
0.315TrpPhe: 0.315 ± 0.173
1.049TrpGly: 1.049 ± 0.288
0.734TrpHis: 0.734 ± 0.274
0.944TrpIle: 0.944 ± 0.274
0.734TrpLys: 0.734 ± 0.388
0.525TrpLeu: 0.525 ± 0.247
0.525TrpMet: 0.525 ± 0.216
0.315TrpAsn: 0.315 ± 0.155
0.629TrpPro: 0.629 ± 0.234
0.734TrpGln: 0.734 ± 0.314
1.259TrpArg: 1.259 ± 0.3
0.525TrpSer: 0.525 ± 0.236
0.734TrpThr: 0.734 ± 0.251
1.154TrpVal: 1.154 ± 0.301
0.315TrpTrp: 0.315 ± 0.154
0.734TrpTyr: 0.734 ± 0.187
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.937TyrAla: 2.937 ± 0.514
0.315TyrCys: 0.315 ± 0.157
1.679TyrAsp: 1.679 ± 0.357
1.888TyrGlu: 1.888 ± 0.466
0.839TyrPhe: 0.839 ± 0.346
2.623TyrGly: 2.623 ± 0.512
0.105TyrHis: 0.105 ± 0.135
1.679TyrIle: 1.679 ± 0.448
1.364TyrLys: 1.364 ± 0.491
2.308TyrLeu: 2.308 ± 0.665
0.42TyrMet: 0.42 ± 0.274
1.049TyrAsn: 1.049 ± 0.412
0.734TyrPro: 0.734 ± 0.229
1.364TyrGln: 1.364 ± 0.364
2.308TyrArg: 2.308 ± 0.375
2.728TyrSer: 2.728 ± 0.47
1.574TyrThr: 1.574 ± 0.473
2.203TyrVal: 2.203 ± 0.545
0.315TyrTrp: 0.315 ± 0.192
0.734TyrTyr: 0.734 ± 0.237
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 33 proteins (9533 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski