Amino acid dipepetide frequency for Chrysochromulina parva virophage Moe

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.107AlaAla: 5.107 ± 1.769
0.0AlaCys: 0.0 ± 0.0
3.004AlaAsp: 3.004 ± 0.608
4.055AlaGlu: 4.055 ± 0.972
2.253AlaPhe: 2.253 ± 0.452
5.858AlaGly: 5.858 ± 1.538
1.051AlaHis: 1.051 ± 0.249
4.055AlaIle: 4.055 ± 0.614
6.458AlaLys: 6.458 ± 1.406
5.858AlaLeu: 5.858 ± 0.876
1.802AlaMet: 1.802 ± 0.609
4.356AlaAsn: 4.356 ± 0.534
4.055AlaPro: 4.055 ± 1.816
2.553AlaGln: 2.553 ± 0.816
4.055AlaArg: 4.055 ± 1.246
6.008AlaSer: 6.008 ± 0.998
2.854AlaThr: 2.854 ± 0.94
2.403AlaVal: 2.403 ± 0.722
0.451AlaTrp: 0.451 ± 0.229
2.103AlaTyr: 2.103 ± 0.348
0.0AlaXaa: 0.0 ± 0.0
Cys
1.051CysAla: 1.051 ± 0.516
0.0CysCys: 0.0 ± 0.0
0.601CysAsp: 0.601 ± 0.312
1.051CysGlu: 1.051 ± 0.404
0.451CysPhe: 0.451 ± 0.231
0.601CysGly: 0.601 ± 0.314
0.0CysHis: 0.0 ± 0.0
0.451CysIle: 0.451 ± 0.271
0.601CysLys: 0.601 ± 0.354
0.3CysLeu: 0.3 ± 0.218
0.15CysMet: 0.15 ± 0.156
0.451CysAsn: 0.451 ± 0.422
0.15CysPro: 0.15 ± 0.142
0.601CysGln: 0.601 ± 0.326
0.451CysArg: 0.451 ± 0.244
1.051CysSer: 1.051 ± 0.496
0.3CysThr: 0.3 ± 0.247
0.601CysVal: 0.601 ± 0.302
0.15CysTrp: 0.15 ± 0.142
0.3CysTyr: 0.3 ± 0.167
0.0CysXaa: 0.0 ± 0.0
Asp
4.055AspAla: 4.055 ± 1.285
0.601AspCys: 0.601 ± 0.362
4.356AspAsp: 4.356 ± 1.144
6.008AspGlu: 6.008 ± 0.898
2.704AspPhe: 2.704 ± 0.511
2.403AspGly: 2.403 ± 0.584
1.051AspHis: 1.051 ± 0.451
6.008AspIle: 6.008 ± 0.963
4.205AspLys: 4.205 ± 0.669
4.956AspLeu: 4.956 ± 0.845
1.652AspMet: 1.652 ± 0.489
3.304AspAsn: 3.304 ± 0.522
4.205AspPro: 4.205 ± 1.103
1.502AspGln: 1.502 ± 0.47
2.403AspArg: 2.403 ± 0.781
2.704AspSer: 2.704 ± 0.844
4.205AspThr: 4.205 ± 1.197
3.154AspVal: 3.154 ± 0.583
0.601AspTrp: 0.601 ± 0.327
4.205AspTyr: 4.205 ± 0.958
0.0AspXaa: 0.0 ± 0.0
Glu
2.854GluAla: 2.854 ± 0.852
1.202GluCys: 1.202 ± 0.48
4.506GluAsp: 4.506 ± 0.85
7.66GluGlu: 7.66 ± 2.049
2.854GluPhe: 2.854 ± 0.703
3.905GluGly: 3.905 ± 0.683
0.901GluHis: 0.901 ± 0.369
5.407GluIle: 5.407 ± 0.798
5.107GluLys: 5.107 ± 0.911
7.81GluLeu: 7.81 ± 0.758
1.953GluMet: 1.953 ± 0.553
4.205GluAsn: 4.205 ± 0.804
1.953GluPro: 1.953 ± 0.469
3.905GluGln: 3.905 ± 1.021
4.205GluArg: 4.205 ± 0.877
3.304GluSer: 3.304 ± 0.533
4.205GluThr: 4.205 ± 0.706
2.854GluVal: 2.854 ± 0.775
0.3GluTrp: 0.3 ± 0.221
3.905GluTyr: 3.905 ± 0.865
0.0GluXaa: 0.0 ± 0.0
Phe
1.652PheAla: 1.652 ± 0.574
0.601PheCys: 0.601 ± 0.261
4.205PheAsp: 4.205 ± 0.913
2.854PheGlu: 2.854 ± 0.696
1.051PhePhe: 1.051 ± 0.286
2.403PheGly: 2.403 ± 0.626
0.3PheHis: 0.3 ± 0.17
3.304PheIle: 3.304 ± 0.83
2.854PheLys: 2.854 ± 0.53
2.553PheLeu: 2.553 ± 0.772
0.3PheMet: 0.3 ± 0.152
3.605PheAsn: 3.605 ± 0.773
1.051PhePro: 1.051 ± 0.362
1.202PheGln: 1.202 ± 0.495
1.502PheArg: 1.502 ± 0.553
1.953PheSer: 1.953 ± 0.514
1.953PheThr: 1.953 ± 0.672
2.253PheVal: 2.253 ± 0.553
0.3PheTrp: 0.3 ± 0.229
1.352PheTyr: 1.352 ± 0.396
0.0PheXaa: 0.0 ± 0.0
Gly
5.257GlyAla: 5.257 ± 1.135
0.0GlyCys: 0.0 ± 0.0
3.004GlyAsp: 3.004 ± 0.586
2.704GlyGlu: 2.704 ± 0.576
2.403GlyPhe: 2.403 ± 0.462
6.308GlyGly: 6.308 ± 1.62
0.901GlyHis: 0.901 ± 0.407
3.004GlyIle: 3.004 ± 0.511
3.605GlyLys: 3.605 ± 0.51
4.356GlyLeu: 4.356 ± 0.896
0.901GlyMet: 0.901 ± 0.359
2.854GlyAsn: 2.854 ± 0.808
0.601GlyPro: 0.601 ± 0.441
2.103GlyGln: 2.103 ± 0.591
1.652GlyArg: 1.652 ± 0.524
3.905GlySer: 3.905 ± 0.644
3.304GlyThr: 3.304 ± 0.526
3.004GlyVal: 3.004 ± 0.772
0.451GlyTrp: 0.451 ± 0.305
1.802GlyTyr: 1.802 ± 0.524
0.0GlyXaa: 0.0 ± 0.0
His
0.751HisAla: 0.751 ± 0.325
0.15HisCys: 0.15 ± 0.133
1.051HisAsp: 1.051 ± 0.436
0.451HisGlu: 0.451 ± 0.268
1.352HisPhe: 1.352 ± 0.477
0.15HisGly: 0.15 ± 0.175
0.3HisHis: 0.3 ± 0.241
1.352HisIle: 1.352 ± 0.533
1.352HisLys: 1.352 ± 0.373
1.051HisLeu: 1.051 ± 0.392
0.15HisMet: 0.15 ± 0.11
0.901HisAsn: 0.901 ± 0.348
0.901HisPro: 0.901 ± 0.343
0.3HisGln: 0.3 ± 0.255
0.451HisArg: 0.451 ± 0.234
0.601HisSer: 0.601 ± 0.299
0.751HisThr: 0.751 ± 0.226
1.051HisVal: 1.051 ± 0.468
0.15HisTrp: 0.15 ± 0.173
1.502HisTyr: 1.502 ± 0.362
0.0HisXaa: 0.0 ± 0.0
Ile
4.806IleAla: 4.806 ± 0.629
0.601IleCys: 0.601 ± 0.303
5.257IleAsp: 5.257 ± 0.626
6.158IleGlu: 6.158 ± 0.828
3.154IlePhe: 3.154 ± 0.767
2.704IleGly: 2.704 ± 0.387
0.15IleHis: 0.15 ± 0.143
4.506IleIle: 4.506 ± 0.994
6.308IleLys: 6.308 ± 1.373
4.055IleLeu: 4.055 ± 0.737
1.652IleMet: 1.652 ± 0.521
6.759IleAsn: 6.759 ± 0.829
3.454IlePro: 3.454 ± 0.74
1.953IleGln: 1.953 ± 0.452
3.605IleArg: 3.605 ± 0.641
4.205IleSer: 4.205 ± 0.779
4.806IleThr: 4.806 ± 0.924
4.205IleVal: 4.205 ± 0.8
0.451IleTrp: 0.451 ± 0.357
2.704IleTyr: 2.704 ± 0.541
0.0IleXaa: 0.0 ± 0.0
Lys
5.407LysAla: 5.407 ± 1.537
0.901LysCys: 0.901 ± 0.415
4.205LysAsp: 4.205 ± 1.014
6.008LysGlu: 6.008 ± 1.443
2.403LysPhe: 2.403 ± 0.745
3.154LysGly: 3.154 ± 0.62
1.502LysHis: 1.502 ± 0.743
6.458LysIle: 6.458 ± 1.002
9.612LysLys: 9.612 ± 2.523
7.96LysLeu: 7.96 ± 1.012
2.403LysMet: 2.403 ± 0.53
6.308LysAsn: 6.308 ± 1.152
4.055LysPro: 4.055 ± 0.759
3.154LysGln: 3.154 ± 0.656
3.154LysArg: 3.154 ± 1.058
4.205LysSer: 4.205 ± 0.741
5.707LysThr: 5.707 ± 0.862
2.704LysVal: 2.704 ± 0.805
0.601LysTrp: 0.601 ± 0.36
3.454LysTyr: 3.454 ± 1.079
0.0LysXaa: 0.0 ± 0.0
Leu
4.806LeuAla: 4.806 ± 0.823
0.451LeuCys: 0.451 ± 0.275
5.557LeuAsp: 5.557 ± 0.585
7.51LeuGlu: 7.51 ± 1.127
1.652LeuPhe: 1.652 ± 0.655
4.956LeuGly: 4.956 ± 0.734
1.352LeuHis: 1.352 ± 0.467
6.158LeuIle: 6.158 ± 1.3
7.96LeuLys: 7.96 ± 0.992
5.257LeuLeu: 5.257 ± 1.034
2.253LeuMet: 2.253 ± 0.509
6.308LeuAsn: 6.308 ± 1.191
3.755LeuPro: 3.755 ± 0.684
3.004LeuGln: 3.004 ± 1.17
2.704LeuArg: 2.704 ± 0.538
4.506LeuSer: 4.506 ± 1.128
5.858LeuThr: 5.858 ± 0.855
4.356LeuVal: 4.356 ± 0.722
0.601LeuTrp: 0.601 ± 0.247
3.605LeuTyr: 3.605 ± 0.676
0.0LeuXaa: 0.0 ± 0.0
Met
2.103MetAla: 2.103 ± 0.535
0.3MetCys: 0.3 ± 0.217
1.202MetAsp: 1.202 ± 0.531
0.901MetGlu: 0.901 ± 0.316
0.451MetPhe: 0.451 ± 0.294
1.502MetGly: 1.502 ± 0.674
0.15MetHis: 0.15 ± 0.152
1.953MetIle: 1.953 ± 0.567
2.403MetLys: 2.403 ± 0.545
2.704MetLeu: 2.704 ± 0.696
0.3MetMet: 0.3 ± 0.152
1.652MetAsn: 1.652 ± 0.432
1.202MetPro: 1.202 ± 0.271
1.051MetGln: 1.051 ± 0.362
1.352MetArg: 1.352 ± 0.4
1.802MetSer: 1.802 ± 0.641
1.802MetThr: 1.802 ± 0.477
0.751MetVal: 0.751 ± 0.309
0.15MetTrp: 0.15 ± 0.141
0.901MetTyr: 0.901 ± 0.395
0.0MetXaa: 0.0 ± 0.0
Asn
4.055AsnAla: 4.055 ± 0.608
1.051AsnCys: 1.051 ± 0.462
4.356AsnAsp: 4.356 ± 0.707
5.707AsnGlu: 5.707 ± 1.317
2.403AsnPhe: 2.403 ± 0.582
2.103AsnGly: 2.103 ± 0.549
0.901AsnHis: 0.901 ± 0.342
4.956AsnIle: 4.956 ± 0.717
5.707AsnLys: 5.707 ± 1.236
6.008AsnLeu: 6.008 ± 1.105
2.253AsnMet: 2.253 ± 0.46
5.858AsnAsn: 5.858 ± 1.363
4.205AsnPro: 4.205 ± 1.051
1.652AsnGln: 1.652 ± 0.566
2.854AsnArg: 2.854 ± 0.643
4.806AsnSer: 4.806 ± 1.446
4.356AsnThr: 4.356 ± 0.672
3.154AsnVal: 3.154 ± 0.832
0.901AsnTrp: 0.901 ± 0.373
4.356AsnTyr: 4.356 ± 1.061
0.0AsnXaa: 0.0 ± 0.0
Pro
4.506ProAla: 4.506 ± 1.866
0.3ProCys: 0.3 ± 0.221
2.854ProAsp: 2.854 ± 0.985
2.553ProGlu: 2.553 ± 0.547
1.502ProPhe: 1.502 ± 0.277
0.451ProGly: 0.451 ± 0.274
0.601ProHis: 0.601 ± 0.3
3.304ProIle: 3.304 ± 0.93
3.004ProLys: 3.004 ± 0.735
2.253ProLeu: 2.253 ± 0.512
1.352ProMet: 1.352 ± 0.539
4.205ProAsn: 4.205 ± 0.921
5.407ProPro: 5.407 ± 2.707
2.403ProGln: 2.403 ± 0.686
2.403ProArg: 2.403 ± 0.642
4.356ProSer: 4.356 ± 0.923
3.154ProThr: 3.154 ± 0.59
1.953ProVal: 1.953 ± 0.606
0.0ProTrp: 0.0 ± 0.0
1.652ProTyr: 1.652 ± 0.487
0.0ProXaa: 0.0 ± 0.0
Gln
2.854GlnAla: 2.854 ± 1.081
0.751GlnCys: 0.751 ± 0.385
1.202GlnAsp: 1.202 ± 0.434
2.253GlnGlu: 2.253 ± 0.792
1.051GlnPhe: 1.051 ± 0.413
1.502GlnGly: 1.502 ± 0.386
0.901GlnHis: 0.901 ± 0.355
2.704GlnIle: 2.704 ± 0.465
3.154GlnLys: 3.154 ± 0.583
4.956GlnLeu: 4.956 ± 0.843
0.451GlnMet: 0.451 ± 0.214
3.004GlnAsn: 3.004 ± 0.527
2.253GlnPro: 2.253 ± 0.882
1.352GlnGln: 1.352 ± 0.472
2.854GlnArg: 2.854 ± 0.902
1.502GlnSer: 1.502 ± 0.534
2.253GlnThr: 2.253 ± 0.568
1.802GlnVal: 1.802 ± 0.379
0.3GlnTrp: 0.3 ± 0.155
1.652GlnTyr: 1.652 ± 0.463
0.0GlnXaa: 0.0 ± 0.0
Arg
1.652ArgAla: 1.652 ± 0.504
0.451ArgCys: 0.451 ± 0.237
5.257ArgAsp: 5.257 ± 1.263
3.304ArgGlu: 3.304 ± 0.643
1.953ArgPhe: 1.953 ± 0.491
2.253ArgGly: 2.253 ± 0.571
0.901ArgHis: 0.901 ± 0.365
2.704ArgIle: 2.704 ± 0.627
3.304ArgLys: 3.304 ± 0.634
3.755ArgLeu: 3.755 ± 0.66
1.652ArgMet: 1.652 ± 0.558
2.553ArgAsn: 2.553 ± 0.71
2.704ArgPro: 2.704 ± 0.67
1.652ArgGln: 1.652 ± 0.841
0.751ArgArg: 0.751 ± 0.345
1.352ArgSer: 1.352 ± 0.328
2.854ArgThr: 2.854 ± 0.793
2.103ArgVal: 2.103 ± 0.496
0.3ArgTrp: 0.3 ± 0.252
2.253ArgTyr: 2.253 ± 0.417
0.0ArgXaa: 0.0 ± 0.0
Ser
6.909SerAla: 6.909 ± 1.738
0.601SerCys: 0.601 ± 0.28
3.454SerAsp: 3.454 ± 0.823
2.854SerGlu: 2.854 ± 0.817
2.103SerPhe: 2.103 ± 0.725
3.154SerGly: 3.154 ± 0.591
1.051SerHis: 1.051 ± 0.301
3.905SerIle: 3.905 ± 0.839
5.257SerLys: 5.257 ± 1.401
4.356SerLeu: 4.356 ± 0.853
1.502SerMet: 1.502 ± 0.605
3.154SerAsn: 3.154 ± 0.922
1.953SerPro: 1.953 ± 0.417
3.004SerGln: 3.004 ± 0.975
3.004SerArg: 3.004 ± 0.916
4.356SerSer: 4.356 ± 1.035
4.055SerThr: 4.055 ± 1.441
2.553SerVal: 2.553 ± 0.499
0.751SerTrp: 0.751 ± 0.233
2.553SerTyr: 2.553 ± 0.598
0.0SerXaa: 0.0 ± 0.0
Thr
4.506ThrAla: 4.506 ± 0.659
0.601ThrCys: 0.601 ± 0.353
3.605ThrAsp: 3.605 ± 0.79
3.905ThrGlu: 3.905 ± 0.589
2.553ThrPhe: 2.553 ± 0.674
3.905ThrGly: 3.905 ± 0.552
1.051ThrHis: 1.051 ± 0.294
4.356ThrIle: 4.356 ± 0.514
4.356ThrLys: 4.356 ± 1.316
7.36ThrLeu: 7.36 ± 0.905
0.901ThrMet: 0.901 ± 0.273
4.506ThrAsn: 4.506 ± 1.279
3.004ThrPro: 3.004 ± 0.937
3.605ThrGln: 3.605 ± 0.481
1.802ThrArg: 1.802 ± 0.588
5.107ThrSer: 5.107 ± 1.233
4.506ThrThr: 4.506 ± 0.971
0.0ThrVal: 0.0 ± 0.0
0.3ThrTrp: 0.3 ± 0.239
1.953ThrTyr: 1.953 ± 0.434
0.0ThrXaa: 0.0 ± 0.0
Val
3.454ValAla: 3.454 ± 0.811
0.15ValCys: 0.15 ± 0.142
3.004ValAsp: 3.004 ± 0.663
3.304ValGlu: 3.304 ± 0.711
1.352ValPhe: 1.352 ± 0.55
1.953ValGly: 1.953 ± 0.426
0.451ValHis: 0.451 ± 0.28
4.055ValIle: 4.055 ± 0.921
4.055ValLys: 4.055 ± 0.765
1.953ValLeu: 1.953 ± 0.471
1.352ValMet: 1.352 ± 0.358
3.154ValAsn: 3.154 ± 0.696
2.103ValPro: 2.103 ± 0.751
1.953ValGln: 1.953 ± 0.392
2.253ValArg: 2.253 ± 0.503
1.953ValSer: 1.953 ± 0.54
1.953ValThr: 1.953 ± 0.453
2.253ValVal: 2.253 ± 0.457
0.3ValTrp: 0.3 ± 0.185
1.652ValTyr: 1.652 ± 0.408
0.0ValXaa: 0.0 ± 0.0
Trp
0.3TrpAla: 0.3 ± 0.221
0.15TrpCys: 0.15 ± 0.158
0.3TrpAsp: 0.3 ± 0.221
0.751TrpGlu: 0.751 ± 0.334
0.751TrpPhe: 0.751 ± 0.415
0.451TrpGly: 0.451 ± 0.271
0.451TrpHis: 0.451 ± 0.229
0.15TrpIle: 0.15 ± 0.15
0.601TrpLys: 0.601 ± 0.28
0.451TrpLeu: 0.451 ± 0.338
0.0TrpMet: 0.0 ± 0.0
0.601TrpAsn: 0.601 ± 0.295
0.15TrpPro: 0.15 ± 0.143
0.15TrpGln: 0.15 ± 0.173
0.451TrpArg: 0.451 ± 0.225
0.451TrpSer: 0.451 ± 0.231
0.451TrpThr: 0.451 ± 0.307
0.601TrpVal: 0.601 ± 0.204
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.403TyrAla: 2.403 ± 0.512
0.451TyrCys: 0.451 ± 0.335
3.304TyrAsp: 3.304 ± 0.797
3.004TyrGlu: 3.004 ± 0.915
2.854TyrPhe: 2.854 ± 0.574
2.403TyrGly: 2.403 ± 0.509
0.751TyrHis: 0.751 ± 0.37
2.704TyrIle: 2.704 ± 0.663
3.454TyrLys: 3.454 ± 1.076
4.656TyrLeu: 4.656 ± 0.831
1.502TyrMet: 1.502 ± 0.425
3.905TyrAsn: 3.905 ± 0.589
1.051TyrPro: 1.051 ± 0.524
1.652TyrGln: 1.652 ± 0.384
1.802TyrArg: 1.802 ± 0.637
2.403TyrSer: 2.403 ± 0.821
2.553TyrThr: 2.553 ± 0.662
0.901TyrVal: 0.901 ± 0.237
0.15TyrTrp: 0.15 ± 0.141
2.403TyrTyr: 2.403 ± 0.674
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 23 proteins (6659 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski