Amino acid dipepetide frequency for Streptococcus phage 31B4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.501AlaAla: 2.501 ± 0.775
0.089AlaCys: 0.089 ± 0.099
3.752AlaAsp: 3.752 ± 0.549
3.126AlaGlu: 3.126 ± 0.403
2.144AlaPhe: 2.144 ± 0.655
4.288AlaGly: 4.288 ± 0.654
0.715AlaHis: 0.715 ± 0.251
4.913AlaIle: 4.913 ± 0.701
6.789AlaLys: 6.789 ± 0.961
6.163AlaLeu: 6.163 ± 0.661
1.519AlaMet: 1.519 ± 0.375
4.556AlaAsn: 4.556 ± 0.867
2.322AlaPro: 2.322 ± 0.427
2.59AlaGln: 2.59 ± 0.479
2.68AlaArg: 2.68 ± 0.478
4.556AlaSer: 4.556 ± 0.691
4.824AlaThr: 4.824 ± 0.731
3.93AlaVal: 3.93 ± 0.58
1.072AlaTrp: 1.072 ± 0.253
1.787AlaTyr: 1.787 ± 0.324
0.0AlaXaa: 0.0 ± 0.0
Cys
0.179CysAla: 0.179 ± 0.108
0.0CysCys: 0.0 ± 0.0
0.715CysAsp: 0.715 ± 0.305
0.268CysGlu: 0.268 ± 0.163
0.179CysPhe: 0.179 ± 0.126
0.089CysGly: 0.089 ± 0.09
0.089CysHis: 0.089 ± 0.088
0.179CysIle: 0.179 ± 0.131
0.357CysLys: 0.357 ± 0.229
0.268CysLeu: 0.268 ± 0.194
0.0CysMet: 0.0 ± 0.0
0.268CysAsn: 0.268 ± 0.145
0.179CysPro: 0.179 ± 0.132
0.179CysGln: 0.179 ± 0.135
0.357CysArg: 0.357 ± 0.226
0.179CysSer: 0.179 ± 0.132
0.268CysThr: 0.268 ± 0.161
0.357CysVal: 0.357 ± 0.134
0.179CysTrp: 0.179 ± 0.124
0.179CysTyr: 0.179 ± 0.11
0.0CysXaa: 0.0 ± 0.0
Asp
3.841AspAla: 3.841 ± 0.517
0.268AspCys: 0.268 ± 0.159
4.109AspAsp: 4.109 ± 0.5
4.824AspGlu: 4.824 ± 0.622
4.466AspPhe: 4.466 ± 0.782
5.985AspGly: 5.985 ± 0.808
0.893AspHis: 0.893 ± 0.341
4.824AspIle: 4.824 ± 0.618
5.538AspLys: 5.538 ± 0.644
4.198AspLeu: 4.198 ± 0.727
2.59AspMet: 2.59 ± 0.491
4.466AspAsn: 4.466 ± 0.605
2.501AspPro: 2.501 ± 0.406
1.161AspGln: 1.161 ± 0.272
2.769AspArg: 2.769 ± 0.416
3.037AspSer: 3.037 ± 0.407
3.573AspThr: 3.573 ± 0.57
3.662AspVal: 3.662 ± 0.709
0.893AspTrp: 0.893 ± 0.241
2.948AspTyr: 2.948 ± 0.448
0.0AspXaa: 0.0 ± 0.0
Glu
4.645GluAla: 4.645 ± 0.486
0.447GluCys: 0.447 ± 0.159
3.573GluAsp: 3.573 ± 0.5
4.02GluGlu: 4.02 ± 0.772
2.59GluPhe: 2.59 ± 0.56
3.394GluGly: 3.394 ± 0.456
1.072GluHis: 1.072 ± 0.331
6.074GluIle: 6.074 ± 0.882
4.02GluLys: 4.02 ± 1.092
6.163GluLeu: 6.163 ± 0.816
2.144GluMet: 2.144 ± 0.501
4.109GluAsn: 4.109 ± 0.667
2.054GluPro: 2.054 ± 0.559
3.126GluGln: 3.126 ± 0.518
3.126GluArg: 3.126 ± 0.49
3.305GluSer: 3.305 ± 0.392
3.573GluThr: 3.573 ± 0.635
4.198GluVal: 4.198 ± 0.622
1.072GluTrp: 1.072 ± 0.224
3.305GluTyr: 3.305 ± 0.556
0.089GluXaa: 0.089 ± 0.072
Phe
3.573PheAla: 3.573 ± 0.507
0.179PheCys: 0.179 ± 0.123
3.93PheAsp: 3.93 ± 0.566
2.054PheGlu: 2.054 ± 0.567
2.054PhePhe: 2.054 ± 0.357
3.394PheGly: 3.394 ± 0.654
0.625PheHis: 0.625 ± 0.227
2.501PheIle: 2.501 ± 0.527
4.466PheLys: 4.466 ± 0.578
2.858PheLeu: 2.858 ± 0.512
0.804PheMet: 0.804 ± 0.264
3.752PheAsn: 3.752 ± 0.758
0.447PhePro: 0.447 ± 0.16
1.34PheGln: 1.34 ± 0.266
1.519PheArg: 1.519 ± 0.312
2.858PheSer: 2.858 ± 0.43
3.037PheThr: 3.037 ± 0.515
2.769PheVal: 2.769 ± 0.384
0.804PheTrp: 0.804 ± 0.25
1.697PheTyr: 1.697 ± 0.44
0.0PheXaa: 0.0 ± 0.0
Gly
3.216GlyAla: 3.216 ± 0.797
0.447GlyCys: 0.447 ± 0.211
4.824GlyAsp: 4.824 ± 0.722
4.466GlyGlu: 4.466 ± 0.691
3.216GlyPhe: 3.216 ± 0.471
5.181GlyGly: 5.181 ± 0.839
0.893GlyHis: 0.893 ± 0.246
5.36GlyIle: 5.36 ± 0.633
5.895GlyLys: 5.895 ± 0.651
5.806GlyLeu: 5.806 ± 0.74
1.429GlyMet: 1.429 ± 0.309
3.662GlyAsn: 3.662 ± 0.587
1.072GlyPro: 1.072 ± 0.28
2.948GlyGln: 2.948 ± 0.727
2.769GlyArg: 2.769 ± 0.512
4.556GlySer: 4.556 ± 0.713
4.645GlyThr: 4.645 ± 0.762
3.93GlyVal: 3.93 ± 0.765
1.251GlyTrp: 1.251 ± 0.315
2.68GlyTyr: 2.68 ± 0.397
0.0GlyXaa: 0.0 ± 0.0
His
0.268HisAla: 0.268 ± 0.136
0.0HisCys: 0.0 ± 0.0
0.983HisAsp: 0.983 ± 0.287
0.447HisGlu: 0.447 ± 0.222
0.536HisPhe: 0.536 ± 0.229
1.072HisGly: 1.072 ± 0.28
0.447HisHis: 0.447 ± 0.164
0.804HisIle: 0.804 ± 0.316
0.893HisLys: 0.893 ± 0.216
1.608HisLeu: 1.608 ± 0.328
0.447HisMet: 0.447 ± 0.227
0.893HisAsn: 0.893 ± 0.328
0.893HisPro: 0.893 ± 0.32
0.715HisGln: 0.715 ± 0.244
0.804HisArg: 0.804 ± 0.257
0.625HisSer: 0.625 ± 0.165
0.715HisThr: 0.715 ± 0.227
1.608HisVal: 1.608 ± 0.273
0.089HisTrp: 0.089 ± 0.095
0.804HisTyr: 0.804 ± 0.286
0.0HisXaa: 0.0 ± 0.0
Ile
5.36IleAla: 5.36 ± 0.99
0.536IleCys: 0.536 ± 0.209
5.181IleAsp: 5.181 ± 0.608
5.002IleGlu: 5.002 ± 0.85
1.965IlePhe: 1.965 ± 0.438
4.377IleGly: 4.377 ± 0.528
0.715IleHis: 0.715 ± 0.241
4.02IleIle: 4.02 ± 0.71
6.699IleLys: 6.699 ± 0.747
3.394IleLeu: 3.394 ± 0.541
1.965IleMet: 1.965 ± 0.453
4.109IleAsn: 4.109 ± 0.437
3.573IlePro: 3.573 ± 0.526
2.68IleGln: 2.68 ± 0.423
2.948IleArg: 2.948 ± 0.512
4.377IleSer: 4.377 ± 0.453
3.305IleThr: 3.305 ± 0.497
2.858IleVal: 2.858 ± 0.628
1.072IleTrp: 1.072 ± 0.234
2.412IleTyr: 2.412 ± 0.474
0.0IleXaa: 0.0 ± 0.0
Lys
5.181LysAla: 5.181 ± 0.614
0.268LysCys: 0.268 ± 0.164
5.002LysAsp: 5.002 ± 0.702
6.699LysGlu: 6.699 ± 0.915
3.93LysPhe: 3.93 ± 0.69
4.913LysGly: 4.913 ± 0.578
1.161LysHis: 1.161 ± 0.418
4.824LysIle: 4.824 ± 0.635
7.503LysLys: 7.503 ± 1.281
7.146LysLeu: 7.146 ± 0.795
2.322LysMet: 2.322 ± 0.477
5.538LysAsn: 5.538 ± 0.692
3.216LysPro: 3.216 ± 0.444
3.394LysGln: 3.394 ± 0.626
3.662LysArg: 3.662 ± 0.486
4.198LysSer: 4.198 ± 0.535
4.913LysThr: 4.913 ± 0.635
4.288LysVal: 4.288 ± 0.551
0.893LysTrp: 0.893 ± 0.264
3.394LysTyr: 3.394 ± 0.866
0.0LysXaa: 0.0 ± 0.0
Leu
7.146LeuAla: 7.146 ± 0.765
0.447LeuCys: 0.447 ± 0.219
5.628LeuAsp: 5.628 ± 0.515
6.253LeuGlu: 6.253 ± 0.885
2.948LeuPhe: 2.948 ± 0.404
5.538LeuGly: 5.538 ± 1.017
0.715LeuHis: 0.715 ± 0.303
4.288LeuIle: 4.288 ± 0.595
6.431LeuLys: 6.431 ± 0.648
5.092LeuLeu: 5.092 ± 0.684
2.322LeuMet: 2.322 ± 0.336
5.538LeuAsn: 5.538 ± 0.815
3.126LeuPro: 3.126 ± 0.506
2.233LeuGln: 2.233 ± 0.45
3.394LeuArg: 3.394 ± 0.766
4.734LeuSer: 4.734 ± 0.554
5.36LeuThr: 5.36 ± 0.844
4.109LeuVal: 4.109 ± 0.567
0.715LeuTrp: 0.715 ± 0.275
1.965LeuTyr: 1.965 ± 0.384
0.0LeuXaa: 0.0 ± 0.0
Met
1.876MetAla: 1.876 ± 0.277
0.0MetCys: 0.0 ± 0.0
0.983MetAsp: 0.983 ± 0.214
1.608MetGlu: 1.608 ± 0.456
1.787MetPhe: 1.787 ± 0.29
1.251MetGly: 1.251 ± 0.317
0.268MetHis: 0.268 ± 0.148
1.519MetIle: 1.519 ± 0.328
2.68MetLys: 2.68 ± 0.49
1.697MetLeu: 1.697 ± 0.277
0.357MetMet: 0.357 ± 0.183
1.072MetAsn: 1.072 ± 0.278
0.893MetPro: 0.893 ± 0.197
1.072MetGln: 1.072 ± 0.359
0.804MetArg: 0.804 ± 0.191
1.965MetSer: 1.965 ± 0.45
1.519MetThr: 1.519 ± 0.266
2.054MetVal: 2.054 ± 0.434
0.447MetTrp: 0.447 ± 0.185
0.893MetTyr: 0.893 ± 0.297
0.0MetXaa: 0.0 ± 0.0
Asn
5.092AsnAla: 5.092 ± 1.106
0.179AsnCys: 0.179 ± 0.126
3.484AsnAsp: 3.484 ± 0.458
3.841AsnGlu: 3.841 ± 0.635
2.948AsnPhe: 2.948 ± 0.515
8.039AsnGly: 8.039 ± 1.149
1.429AsnHis: 1.429 ± 0.367
3.662AsnIle: 3.662 ± 0.484
3.752AsnLys: 3.752 ± 0.48
5.628AsnLeu: 5.628 ± 0.597
1.161AsnMet: 1.161 ± 0.31
4.824AsnAsn: 4.824 ± 0.836
2.858AsnPro: 2.858 ± 0.442
2.948AsnGln: 2.948 ± 0.436
2.054AsnArg: 2.054 ± 0.53
3.841AsnSer: 3.841 ± 0.638
3.305AsnThr: 3.305 ± 0.594
3.394AsnVal: 3.394 ± 0.552
1.251AsnTrp: 1.251 ± 0.317
1.965AsnTyr: 1.965 ± 0.425
0.0AsnXaa: 0.0 ± 0.0
Pro
1.787ProAla: 1.787 ± 0.305
0.179ProCys: 0.179 ± 0.129
1.876ProAsp: 1.876 ± 0.417
2.769ProGlu: 2.769 ± 0.446
1.251ProPhe: 1.251 ± 0.305
1.161ProGly: 1.161 ± 0.336
0.625ProHis: 0.625 ± 0.233
1.697ProIle: 1.697 ± 0.323
3.484ProLys: 3.484 ± 0.568
2.858ProLeu: 2.858 ± 0.352
0.268ProMet: 0.268 ± 0.15
2.68ProAsn: 2.68 ± 0.544
0.804ProPro: 0.804 ± 0.316
1.34ProGln: 1.34 ± 0.342
1.072ProArg: 1.072 ± 0.366
3.126ProSer: 3.126 ± 0.593
2.412ProThr: 2.412 ± 0.392
1.965ProVal: 1.965 ± 0.431
0.447ProTrp: 0.447 ± 0.164
0.983ProTyr: 0.983 ± 0.34
0.0ProXaa: 0.0 ± 0.0
Gln
3.484GlnAla: 3.484 ± 0.573
0.179GlnCys: 0.179 ± 0.118
1.697GlnAsp: 1.697 ± 0.348
2.858GlnGlu: 2.858 ± 0.476
1.429GlnPhe: 1.429 ± 0.361
3.305GlnGly: 3.305 ± 0.669
0.447GlnHis: 0.447 ± 0.21
2.769GlnIle: 2.769 ± 0.675
3.037GlnLys: 3.037 ± 0.569
3.037GlnLeu: 3.037 ± 0.394
1.251GlnMet: 1.251 ± 0.319
2.68GlnAsn: 2.68 ± 0.468
0.625GlnPro: 0.625 ± 0.251
2.322GlnGln: 2.322 ± 0.505
1.519GlnArg: 1.519 ± 0.373
2.501GlnSer: 2.501 ± 0.442
2.858GlnThr: 2.858 ± 0.505
1.965GlnVal: 1.965 ± 0.513
0.536GlnTrp: 0.536 ± 0.239
1.519GlnTyr: 1.519 ± 0.37
0.0GlnXaa: 0.0 ± 0.0
Arg
1.876ArgAla: 1.876 ± 0.348
0.089ArgCys: 0.089 ± 0.102
2.501ArgAsp: 2.501 ± 0.386
2.59ArgGlu: 2.59 ± 0.539
2.322ArgPhe: 2.322 ± 0.351
2.501ArgGly: 2.501 ± 0.443
0.804ArgHis: 0.804 ± 0.28
3.305ArgIle: 3.305 ± 0.729
2.59ArgLys: 2.59 ± 0.514
3.752ArgLeu: 3.752 ± 0.647
1.34ArgMet: 1.34 ± 0.323
2.769ArgAsn: 2.769 ± 0.415
1.161ArgPro: 1.161 ± 0.243
2.054ArgGln: 2.054 ± 0.339
1.34ArgArg: 1.34 ± 0.301
2.144ArgSer: 2.144 ± 0.413
2.59ArgThr: 2.59 ± 0.659
2.769ArgVal: 2.769 ± 0.462
1.251ArgTrp: 1.251 ± 0.306
1.965ArgTyr: 1.965 ± 0.505
0.0ArgXaa: 0.0 ± 0.0
Ser
3.573SerAla: 3.573 ± 0.537
0.357SerCys: 0.357 ± 0.249
4.377SerAsp: 4.377 ± 0.393
3.662SerGlu: 3.662 ± 0.42
2.68SerPhe: 2.68 ± 0.393
3.93SerGly: 3.93 ± 0.554
0.447SerHis: 0.447 ± 0.187
4.913SerIle: 4.913 ± 0.652
4.913SerLys: 4.913 ± 0.733
4.377SerLeu: 4.377 ± 0.585
2.054SerMet: 2.054 ± 0.363
4.288SerAsn: 4.288 ± 0.703
2.054SerPro: 2.054 ± 0.309
2.769SerGln: 2.769 ± 0.588
3.037SerArg: 3.037 ± 0.616
3.662SerSer: 3.662 ± 0.561
4.109SerThr: 4.109 ± 0.767
4.734SerVal: 4.734 ± 0.766
0.893SerTrp: 0.893 ± 0.34
1.876SerTyr: 1.876 ± 0.421
0.0SerXaa: 0.0 ± 0.0
Thr
4.109ThrAla: 4.109 ± 0.69
0.268ThrCys: 0.268 ± 0.142
4.466ThrAsp: 4.466 ± 0.642
3.573ThrGlu: 3.573 ± 0.475
3.662ThrPhe: 3.662 ± 0.53
3.662ThrGly: 3.662 ± 0.469
1.429ThrHis: 1.429 ± 0.328
3.841ThrIle: 3.841 ± 0.571
5.181ThrLys: 5.181 ± 0.615
6.074ThrLeu: 6.074 ± 0.727
0.983ThrMet: 0.983 ± 0.273
3.573ThrAsn: 3.573 ± 0.488
1.697ThrPro: 1.697 ± 0.372
2.322ThrGln: 2.322 ± 0.478
1.429ThrArg: 1.429 ± 0.252
3.93ThrSer: 3.93 ± 0.555
3.394ThrThr: 3.394 ± 0.637
4.466ThrVal: 4.466 ± 0.543
0.893ThrTrp: 0.893 ± 0.265
3.037ThrTyr: 3.037 ± 0.592
0.0ThrXaa: 0.0 ± 0.0
Val
3.841ValAla: 3.841 ± 0.442
0.179ValCys: 0.179 ± 0.11
5.538ValAsp: 5.538 ± 0.596
4.198ValGlu: 4.198 ± 0.63
1.965ValPhe: 1.965 ± 0.431
3.662ValGly: 3.662 ± 0.594
0.625ValHis: 0.625 ± 0.211
4.109ValIle: 4.109 ± 0.451
4.913ValLys: 4.913 ± 0.651
3.484ValLeu: 3.484 ± 0.643
1.072ValMet: 1.072 ± 0.239
4.109ValAsn: 4.109 ± 0.677
1.787ValPro: 1.787 ± 0.346
1.787ValGln: 1.787 ± 0.42
3.037ValArg: 3.037 ± 0.567
4.824ValSer: 4.824 ± 0.6
5.002ValThr: 5.002 ± 0.827
3.662ValVal: 3.662 ± 0.522
1.072ValTrp: 1.072 ± 0.309
1.965ValTyr: 1.965 ± 0.39
0.0ValXaa: 0.0 ± 0.0
Trp
0.447TrpAla: 0.447 ± 0.196
0.089TrpCys: 0.089 ± 0.088
1.608TrpAsp: 1.608 ± 0.423
0.804TrpGlu: 0.804 ± 0.228
0.804TrpPhe: 0.804 ± 0.285
0.715TrpGly: 0.715 ± 0.237
0.536TrpHis: 0.536 ± 0.193
0.804TrpIle: 0.804 ± 0.224
0.804TrpLys: 0.804 ± 0.25
1.34TrpLeu: 1.34 ± 0.345
0.0TrpMet: 0.0 ± 0.0
1.072TrpAsn: 1.072 ± 0.303
0.089TrpPro: 0.089 ± 0.094
0.804TrpGln: 0.804 ± 0.267
0.804TrpArg: 0.804 ± 0.218
1.787TrpSer: 1.787 ± 0.609
0.625TrpThr: 0.625 ± 0.175
1.608TrpVal: 1.608 ± 0.262
0.447TrpTrp: 0.447 ± 0.202
0.268TrpTyr: 0.268 ± 0.184
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.322TyrAla: 2.322 ± 0.367
0.268TyrCys: 0.268 ± 0.226
2.412TyrAsp: 2.412 ± 0.349
3.037TyrGlu: 3.037 ± 0.527
1.787TyrPhe: 1.787 ± 0.423
1.697TyrGly: 1.697 ± 0.453
0.715TyrHis: 0.715 ± 0.191
2.144TyrIle: 2.144 ± 0.422
2.501TyrLys: 2.501 ± 0.404
3.037TyrLeu: 3.037 ± 0.502
0.625TyrMet: 0.625 ± 0.217
1.697TyrAsn: 1.697 ± 0.352
1.429TyrPro: 1.429 ± 0.416
2.233TyrGln: 2.233 ± 0.363
2.59TyrArg: 2.59 ± 0.618
2.501TyrSer: 2.501 ± 0.584
1.965TyrThr: 1.965 ± 0.435
2.501TyrVal: 2.501 ± 0.447
0.179TyrTrp: 0.179 ± 0.119
2.412TyrTyr: 2.412 ± 0.59
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.089XaaGly: 0.089 ± 0.072
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 40 proteins (11196 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski