Amino acid dipepetide frequency for Flavobacterium phage vB_FspM_pippi8-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.543AlaAla: 4.543 ± 0.742
0.624AlaCys: 0.624 ± 0.293
2.227AlaAsp: 2.227 ± 0.379
6.414AlaGlu: 6.414 ± 0.786
4.187AlaPhe: 4.187 ± 0.723
6.682AlaGly: 6.682 ± 1.167
0.713AlaHis: 0.713 ± 0.243
4.276AlaIle: 4.276 ± 0.723
6.236AlaLys: 6.236 ± 0.877
7.038AlaLeu: 7.038 ± 0.948
1.158AlaMet: 1.158 ± 0.376
3.385AlaAsn: 3.385 ± 0.595
2.405AlaPro: 2.405 ± 0.397
2.316AlaGln: 2.316 ± 0.498
2.049AlaArg: 2.049 ± 0.479
4.365AlaSer: 4.365 ± 0.637
4.365AlaThr: 4.365 ± 0.864
5.434AlaVal: 5.434 ± 0.853
0.98AlaTrp: 0.98 ± 0.377
2.762AlaTyr: 2.762 ± 0.497
0.0AlaXaa: 0.0 ± 0.0
Cys
0.624CysAla: 0.624 ± 0.21
0.089CysCys: 0.089 ± 0.082
0.267CysAsp: 0.267 ± 0.138
0.891CysGlu: 0.891 ± 0.378
0.891CysPhe: 0.891 ± 0.251
0.356CysGly: 0.356 ± 0.221
0.089CysHis: 0.089 ± 0.09
0.891CysIle: 0.891 ± 0.296
1.069CysLys: 1.069 ± 0.298
0.713CysLeu: 0.713 ± 0.267
0.089CysMet: 0.089 ± 0.076
0.535CysAsn: 0.535 ± 0.201
0.178CysPro: 0.178 ± 0.125
0.356CysGln: 0.356 ± 0.177
0.178CysArg: 0.178 ± 0.126
0.445CysSer: 0.445 ± 0.259
0.445CysThr: 0.445 ± 0.169
0.535CysVal: 0.535 ± 0.249
0.089CysTrp: 0.089 ± 0.098
0.356CysTyr: 0.356 ± 0.169
0.0CysXaa: 0.0 ± 0.0
Asp
4.989AspAla: 4.989 ± 0.645
0.178AspCys: 0.178 ± 0.15
1.693AspAsp: 1.693 ± 0.36
4.098AspGlu: 4.098 ± 0.624
3.474AspPhe: 3.474 ± 0.483
4.276AspGly: 4.276 ± 0.687
0.535AspHis: 0.535 ± 0.23
4.276AspIle: 4.276 ± 0.561
4.276AspLys: 4.276 ± 0.477
4.811AspLeu: 4.811 ± 0.689
0.98AspMet: 0.98 ± 0.359
2.851AspAsn: 2.851 ± 0.464
2.316AspPro: 2.316 ± 0.422
1.514AspGln: 1.514 ± 0.314
1.336AspArg: 1.336 ± 0.396
1.96AspSer: 1.96 ± 0.36
3.92AspThr: 3.92 ± 0.532
3.296AspVal: 3.296 ± 0.502
0.445AspTrp: 0.445 ± 0.181
2.227AspTyr: 2.227 ± 0.477
0.0AspXaa: 0.0 ± 0.0
Glu
7.394GluAla: 7.394 ± 0.865
0.713GluCys: 0.713 ± 0.337
4.365GluAsp: 4.365 ± 0.736
6.592GluGlu: 6.592 ± 0.894
5.523GluPhe: 5.523 ± 0.573
4.633GluGly: 4.633 ± 0.602
0.98GluHis: 0.98 ± 0.3
8.998GluIle: 8.998 ± 1.058
7.572GluLys: 7.572 ± 1.109
6.236GluLeu: 6.236 ± 0.698
2.138GluMet: 2.138 ± 0.447
5.88GluAsn: 5.88 ± 0.907
1.247GluPro: 1.247 ± 0.391
3.831GluGln: 3.831 ± 0.491
2.673GluArg: 2.673 ± 0.451
1.514GluSer: 1.514 ± 0.308
3.118GluThr: 3.118 ± 0.474
5.88GluVal: 5.88 ± 0.775
0.356GluTrp: 0.356 ± 0.215
1.514GluTyr: 1.514 ± 0.349
0.0GluXaa: 0.0 ± 0.0
Phe
3.742PheAla: 3.742 ± 0.649
0.624PheCys: 0.624 ± 0.188
3.92PheAsp: 3.92 ± 0.55
4.009PheGlu: 4.009 ± 0.616
2.584PhePhe: 2.584 ± 0.578
3.563PheGly: 3.563 ± 0.527
0.713PheHis: 0.713 ± 0.262
4.276PheIle: 4.276 ± 0.489
4.276PheLys: 4.276 ± 0.658
4.722PheLeu: 4.722 ± 0.835
1.158PheMet: 1.158 ± 0.327
3.029PheAsn: 3.029 ± 0.477
1.514PhePro: 1.514 ± 0.372
1.871PheGln: 1.871 ± 0.368
2.494PheArg: 2.494 ± 0.388
3.831PheSer: 3.831 ± 0.686
3.118PheThr: 3.118 ± 0.513
3.563PheVal: 3.563 ± 0.551
0.713PheTrp: 0.713 ± 0.228
2.94PheTyr: 2.94 ± 0.536
0.0PheXaa: 0.0 ± 0.0
Gly
4.454GlyAla: 4.454 ± 0.887
0.445GlyCys: 0.445 ± 0.246
3.92GlyAsp: 3.92 ± 0.735
4.009GlyGlu: 4.009 ± 0.644
4.989GlyPhe: 4.989 ± 0.784
4.633GlyGly: 4.633 ± 0.768
0.891GlyHis: 0.891 ± 0.279
3.92GlyIle: 3.92 ± 0.688
5.523GlyLys: 5.523 ± 0.542
5.167GlyLeu: 5.167 ± 0.734
0.624GlyMet: 0.624 ± 0.224
2.405GlyAsn: 2.405 ± 0.541
0.0GlyPro: 0.0 ± 0.0
1.604GlyGln: 1.604 ± 0.344
1.514GlyArg: 1.514 ± 0.388
3.385GlySer: 3.385 ± 0.527
4.811GlyThr: 4.811 ± 0.71
4.9GlyVal: 4.9 ± 0.643
0.802GlyTrp: 0.802 ± 0.266
2.673GlyTyr: 2.673 ± 0.511
0.0GlyXaa: 0.0 ± 0.0
His
0.713HisAla: 0.713 ± 0.208
0.178HisCys: 0.178 ± 0.116
0.356HisAsp: 0.356 ± 0.171
0.624HisGlu: 0.624 ± 0.22
0.891HisPhe: 0.891 ± 0.27
0.802HisGly: 0.802 ± 0.274
0.178HisHis: 0.178 ± 0.125
0.98HisIle: 0.98 ± 0.289
1.247HisLys: 1.247 ± 0.275
1.247HisLeu: 1.247 ± 0.366
0.089HisMet: 0.089 ± 0.078
0.802HisAsn: 0.802 ± 0.269
0.445HisPro: 0.445 ± 0.202
0.178HisGln: 0.178 ± 0.134
0.535HisArg: 0.535 ± 0.19
0.624HisSer: 0.624 ± 0.222
0.713HisThr: 0.713 ± 0.285
0.713HisVal: 0.713 ± 0.27
0.267HisTrp: 0.267 ± 0.134
0.445HisTyr: 0.445 ± 0.201
0.0HisXaa: 0.0 ± 0.0
Ile
4.365IleAla: 4.365 ± 0.7
0.891IleCys: 0.891 ± 0.305
4.811IleAsp: 4.811 ± 0.627
7.84IleGlu: 7.84 ± 0.965
3.563IlePhe: 3.563 ± 0.629
3.653IleGly: 3.653 ± 0.68
0.624IleHis: 0.624 ± 0.264
6.414IleIle: 6.414 ± 0.752
8.285IleLys: 8.285 ± 0.931
5.078IleLeu: 5.078 ± 0.59
0.98IleMet: 0.98 ± 0.295
4.633IleAsn: 4.633 ± 0.658
2.316IlePro: 2.316 ± 0.447
3.118IleGln: 3.118 ± 0.577
2.584IleArg: 2.584 ± 0.448
6.771IleSer: 6.771 ± 0.878
4.989IleThr: 4.989 ± 0.532
3.742IleVal: 3.742 ± 0.531
0.445IleTrp: 0.445 ± 0.233
2.673IleTyr: 2.673 ± 0.618
0.0IleXaa: 0.0 ± 0.0
Lys
7.038LysAla: 7.038 ± 0.864
0.356LysCys: 0.356 ± 0.174
6.147LysAsp: 6.147 ± 0.802
7.394LysGlu: 7.394 ± 0.998
3.296LysPhe: 3.296 ± 0.537
4.633LysGly: 4.633 ± 0.664
1.693LysHis: 1.693 ± 0.331
7.661LysIle: 7.661 ± 0.938
8.463LysLys: 8.463 ± 1.274
6.414LysLeu: 6.414 ± 0.908
2.227LysMet: 2.227 ± 0.359
6.236LysAsn: 6.236 ± 0.737
2.227LysPro: 2.227 ± 0.474
3.118LysGln: 3.118 ± 0.576
3.742LysArg: 3.742 ± 0.578
3.92LysSer: 3.92 ± 0.633
7.305LysThr: 7.305 ± 0.733
5.345LysVal: 5.345 ± 0.758
0.445LysTrp: 0.445 ± 0.192
2.94LysTyr: 2.94 ± 0.642
0.0LysXaa: 0.0 ± 0.0
Leu
4.989LeuAla: 4.989 ± 0.799
0.802LeuCys: 0.802 ± 0.297
4.543LeuAsp: 4.543 ± 0.792
6.771LeuGlu: 6.771 ± 0.806
4.633LeuPhe: 4.633 ± 0.808
4.098LeuGly: 4.098 ± 0.681
1.425LeuHis: 1.425 ± 0.329
6.592LeuIle: 6.592 ± 0.734
9.176LeuLys: 9.176 ± 0.961
5.612LeuLeu: 5.612 ± 0.768
1.604LeuMet: 1.604 ± 0.357
5.612LeuAsn: 5.612 ± 0.804
2.673LeuPro: 2.673 ± 0.458
2.94LeuGln: 2.94 ± 0.596
3.207LeuArg: 3.207 ± 0.53
4.811LeuSer: 4.811 ± 0.497
5.702LeuThr: 5.702 ± 0.781
4.9LeuVal: 4.9 ± 0.54
0.624LeuTrp: 0.624 ± 0.21
2.227LeuTyr: 2.227 ± 0.308
0.0LeuXaa: 0.0 ± 0.0
Met
2.227MetAla: 2.227 ± 0.523
0.089MetCys: 0.089 ± 0.103
0.535MetAsp: 0.535 ± 0.208
0.891MetGlu: 0.891 ± 0.227
0.891MetPhe: 0.891 ± 0.274
0.713MetGly: 0.713 ± 0.268
0.089MetHis: 0.089 ± 0.084
1.247MetIle: 1.247 ± 0.301
2.584MetLys: 2.584 ± 0.388
1.336MetLeu: 1.336 ± 0.307
0.0MetMet: 0.0 ± 0.0
1.514MetAsn: 1.514 ± 0.34
0.178MetPro: 0.178 ± 0.106
0.445MetGln: 0.445 ± 0.187
0.624MetArg: 0.624 ± 0.226
1.247MetSer: 1.247 ± 0.301
1.336MetThr: 1.336 ± 0.3
0.713MetVal: 0.713 ± 0.246
0.089MetTrp: 0.089 ± 0.076
0.713MetTyr: 0.713 ± 0.245
0.0MetXaa: 0.0 ± 0.0
Asn
3.831AsnAla: 3.831 ± 0.605
0.891AsnCys: 0.891 ± 0.23
3.653AsnAsp: 3.653 ± 0.651
5.167AsnGlu: 5.167 ± 0.649
3.653AsnPhe: 3.653 ± 0.558
3.831AsnGly: 3.831 ± 0.542
0.535AsnHis: 0.535 ± 0.194
4.811AsnIle: 4.811 ± 0.821
4.276AsnLys: 4.276 ± 0.648
5.523AsnLeu: 5.523 ± 0.835
1.336AsnMet: 1.336 ± 0.324
2.94AsnAsn: 2.94 ± 0.423
2.494AsnPro: 2.494 ± 0.525
1.425AsnGln: 1.425 ± 0.381
2.227AsnArg: 2.227 ± 0.48
3.742AsnSer: 3.742 ± 0.65
3.385AsnThr: 3.385 ± 0.522
3.653AsnVal: 3.653 ± 0.672
1.158AsnTrp: 1.158 ± 0.271
2.405AsnTyr: 2.405 ± 0.444
0.0AsnXaa: 0.0 ± 0.0
Pro
3.296ProAla: 3.296 ± 0.623
0.178ProCys: 0.178 ± 0.118
0.98ProAsp: 0.98 ± 0.259
2.673ProGlu: 2.673 ± 0.559
1.871ProPhe: 1.871 ± 0.359
0.0ProGly: 0.0 ± 0.0
0.089ProHis: 0.089 ± 0.101
2.316ProIle: 2.316 ± 0.445
3.118ProLys: 3.118 ± 0.506
2.673ProLeu: 2.673 ± 0.546
0.267ProMet: 0.267 ± 0.171
2.316ProAsn: 2.316 ± 0.413
0.713ProPro: 0.713 ± 0.206
0.624ProGln: 0.624 ± 0.226
1.158ProArg: 1.158 ± 0.361
1.871ProSer: 1.871 ± 0.502
1.158ProThr: 1.158 ± 0.313
2.227ProVal: 2.227 ± 0.485
0.445ProTrp: 0.445 ± 0.188
0.624ProTyr: 0.624 ± 0.238
0.0ProXaa: 0.0 ± 0.0
Gln
2.762GlnAla: 2.762 ± 0.429
0.178GlnCys: 0.178 ± 0.146
1.336GlnAsp: 1.336 ± 0.374
2.316GlnGlu: 2.316 ± 0.469
1.336GlnPhe: 1.336 ± 0.34
1.693GlnGly: 1.693 ± 0.481
0.356GlnHis: 0.356 ± 0.171
3.385GlnIle: 3.385 ± 0.578
3.207GlnLys: 3.207 ± 0.454
2.405GlnLeu: 2.405 ± 0.524
0.98GlnMet: 0.98 ± 0.277
2.049GlnAsn: 2.049 ± 0.459
0.713GlnPro: 0.713 ± 0.275
1.158GlnGln: 1.158 ± 0.269
1.782GlnArg: 1.782 ± 0.351
1.871GlnSer: 1.871 ± 0.359
1.871GlnThr: 1.871 ± 0.442
1.871GlnVal: 1.871 ± 0.376
0.267GlnTrp: 0.267 ± 0.137
0.891GlnTyr: 0.891 ± 0.316
0.0GlnXaa: 0.0 ± 0.0
Arg
2.049ArgAla: 2.049 ± 0.542
0.356ArgCys: 0.356 ± 0.159
1.247ArgAsp: 1.247 ± 0.324
2.494ArgGlu: 2.494 ± 0.539
1.693ArgPhe: 1.693 ± 0.4
2.494ArgGly: 2.494 ± 0.465
0.713ArgHis: 0.713 ± 0.227
2.762ArgIle: 2.762 ± 0.392
2.94ArgLys: 2.94 ± 0.613
3.563ArgLeu: 3.563 ± 0.868
0.713ArgMet: 0.713 ± 0.265
2.138ArgAsn: 2.138 ± 0.387
1.069ArgPro: 1.069 ± 0.25
0.98ArgGln: 0.98 ± 0.247
2.494ArgArg: 2.494 ± 0.446
2.227ArgSer: 2.227 ± 0.32
2.316ArgThr: 2.316 ± 0.488
2.316ArgVal: 2.316 ± 0.44
0.267ArgTrp: 0.267 ± 0.153
1.069ArgTyr: 1.069 ± 0.326
0.0ArgXaa: 0.0 ± 0.0
Ser
3.92SerAla: 3.92 ± 0.773
0.802SerCys: 0.802 ± 0.303
4.187SerAsp: 4.187 ± 0.518
5.702SerGlu: 5.702 ± 0.65
3.831SerPhe: 3.831 ± 0.532
3.653SerGly: 3.653 ± 0.592
0.356SerHis: 0.356 ± 0.175
3.563SerIle: 3.563 ± 0.49
4.365SerLys: 4.365 ± 0.453
5.256SerLeu: 5.256 ± 0.675
0.891SerMet: 0.891 ± 0.405
2.94SerAsn: 2.94 ± 0.632
2.138SerPro: 2.138 ± 0.393
1.604SerGln: 1.604 ± 0.389
1.782SerArg: 1.782 ± 0.378
4.187SerSer: 4.187 ± 0.843
2.494SerThr: 2.494 ± 0.396
4.098SerVal: 4.098 ± 0.601
0.624SerTrp: 0.624 ± 0.235
1.425SerTyr: 1.425 ± 0.412
0.0SerXaa: 0.0 ± 0.0
Thr
5.167ThrAla: 5.167 ± 0.705
0.445ThrCys: 0.445 ± 0.184
4.365ThrAsp: 4.365 ± 0.756
6.147ThrGlu: 6.147 ± 0.684
3.118ThrPhe: 3.118 ± 0.507
4.276ThrGly: 4.276 ± 0.75
0.802ThrHis: 0.802 ± 0.333
4.365ThrIle: 4.365 ± 0.561
3.92ThrLys: 3.92 ± 0.487
4.989ThrLeu: 4.989 ± 0.71
0.802ThrMet: 0.802 ± 0.269
3.563ThrAsn: 3.563 ± 0.615
2.851ThrPro: 2.851 ± 0.484
2.138ThrGln: 2.138 ± 0.4
1.247ThrArg: 1.247 ± 0.346
3.92ThrSer: 3.92 ± 0.528
4.187ThrThr: 4.187 ± 0.538
3.474ThrVal: 3.474 ± 0.615
0.445ThrTrp: 0.445 ± 0.202
1.514ThrTyr: 1.514 ± 0.274
0.0ThrXaa: 0.0 ± 0.0
Val
3.474ValAla: 3.474 ± 0.534
0.802ValCys: 0.802 ± 0.298
2.405ValAsp: 2.405 ± 0.59
4.811ValGlu: 4.811 ± 0.676
2.673ValPhe: 2.673 ± 0.559
3.385ValGly: 3.385 ± 0.754
0.891ValHis: 0.891 ± 0.243
4.009ValIle: 4.009 ± 0.601
5.969ValLys: 5.969 ± 0.817
7.127ValLeu: 7.127 ± 0.741
0.624ValMet: 0.624 ± 0.216
4.722ValAsn: 4.722 ± 0.598
2.316ValPro: 2.316 ± 0.365
2.316ValGln: 2.316 ± 0.434
2.673ValArg: 2.673 ± 0.42
4.098ValSer: 4.098 ± 0.618
3.742ValThr: 3.742 ± 0.626
5.167ValVal: 5.167 ± 0.714
0.802ValTrp: 0.802 ± 0.252
2.851ValTyr: 2.851 ± 0.449
0.0ValXaa: 0.0 ± 0.0
Trp
0.535TrpAla: 0.535 ± 0.2
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.535TrpGlu: 0.535 ± 0.176
0.624TrpPhe: 0.624 ± 0.217
0.713TrpGly: 0.713 ± 0.279
0.089TrpHis: 0.089 ± 0.076
0.802TrpIle: 0.802 ± 0.304
0.98TrpLys: 0.98 ± 0.262
1.425TrpLeu: 1.425 ± 0.295
0.0TrpMet: 0.0 ± 0.0
0.535TrpAsn: 0.535 ± 0.217
0.0TrpPro: 0.0 ± 0.0
0.089TrpGln: 0.089 ± 0.075
0.535TrpArg: 0.535 ± 0.212
0.802TrpSer: 0.802 ± 0.211
0.535TrpThr: 0.535 ± 0.204
0.802TrpVal: 0.802 ± 0.221
0.089TrpTrp: 0.089 ± 0.098
0.267TrpTyr: 0.267 ± 0.156
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.049TyrAla: 2.049 ± 0.417
0.445TyrCys: 0.445 ± 0.178
2.494TyrAsp: 2.494 ± 0.554
1.782TyrGlu: 1.782 ± 0.543
2.94TyrPhe: 2.94 ± 0.468
2.584TyrGly: 2.584 ± 0.431
0.267TyrHis: 0.267 ± 0.142
2.227TyrIle: 2.227 ± 0.384
3.118TyrLys: 3.118 ± 0.565
1.604TyrLeu: 1.604 ± 0.357
0.713TyrMet: 0.713 ± 0.231
2.851TyrAsn: 2.851 ± 0.572
0.802TyrPro: 0.802 ± 0.284
0.802TyrGln: 0.802 ± 0.28
1.069TyrArg: 1.069 ± 0.265
2.227TyrSer: 2.227 ± 0.446
2.316TyrThr: 2.316 ± 0.493
2.227TyrVal: 2.227 ± 0.39
0.0TyrTrp: 0.0 ± 0.0
1.871TyrTyr: 1.871 ± 0.549
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (11226 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski