Amino acid dipepetide frequency for Chrysochromulina parva virophage Larry

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.343AlaAla: 9.343 ± 2.777
0.0AlaCys: 0.0 ± 0.0
3.358AlaAsp: 3.358 ± 0.896
4.526AlaGlu: 4.526 ± 1.038
2.19AlaPhe: 2.19 ± 0.367
5.401AlaGly: 5.401 ± 1.152
1.46AlaHis: 1.46 ± 0.347
4.964AlaIle: 4.964 ± 0.925
3.796AlaLys: 3.796 ± 1.026
5.255AlaLeu: 5.255 ± 1.054
1.752AlaMet: 1.752 ± 0.425
2.774AlaAsn: 2.774 ± 0.905
5.985AlaPro: 5.985 ± 2.917
3.942AlaGln: 3.942 ± 0.952
4.964AlaArg: 4.964 ± 1.094
4.964AlaSer: 4.964 ± 1.316
7.007AlaThr: 7.007 ± 2.453
3.504AlaVal: 3.504 ± 0.914
0.292AlaTrp: 0.292 ± 0.153
1.898AlaTyr: 1.898 ± 0.59
0.0AlaXaa: 0.0 ± 0.0
Cys
0.146CysAla: 0.146 ± 0.178
0.146CysCys: 0.146 ± 0.178
0.876CysAsp: 0.876 ± 0.478
0.584CysGlu: 0.584 ± 0.473
0.438CysPhe: 0.438 ± 0.332
0.73CysGly: 0.73 ± 0.429
0.0CysHis: 0.0 ± 0.0
0.876CysIle: 0.876 ± 0.49
0.876CysLys: 0.876 ± 0.482
0.438CysLeu: 0.438 ± 0.392
0.292CysMet: 0.292 ± 0.216
0.146CysAsn: 0.146 ± 0.128
0.146CysPro: 0.146 ± 0.128
0.146CysGln: 0.146 ± 0.128
0.438CysArg: 0.438 ± 0.291
0.292CysSer: 0.292 ± 0.239
0.584CysThr: 0.584 ± 0.3
0.438CysVal: 0.438 ± 0.305
0.0CysTrp: 0.0 ± 0.0
0.146CysTyr: 0.146 ± 0.14
0.0CysXaa: 0.0 ± 0.0
Asp
5.401AspAla: 5.401 ± 0.823
0.438AspCys: 0.438 ± 0.305
4.964AspAsp: 4.964 ± 0.946
2.628AspGlu: 2.628 ± 0.542
2.92AspPhe: 2.92 ± 0.605
2.92AspGly: 2.92 ± 0.76
0.876AspHis: 0.876 ± 0.356
5.547AspIle: 5.547 ± 1.89
3.212AspLys: 3.212 ± 0.832
6.131AspLeu: 6.131 ± 0.803
1.898AspMet: 1.898 ± 0.516
3.066AspAsn: 3.066 ± 0.636
3.212AspPro: 3.212 ± 0.454
1.46AspGln: 1.46 ± 0.458
3.212AspArg: 3.212 ± 0.614
4.526AspSer: 4.526 ± 0.871
5.401AspThr: 5.401 ± 0.7
3.212AspVal: 3.212 ± 0.474
0.584AspTrp: 0.584 ± 0.322
4.964AspTyr: 4.964 ± 0.987
0.0AspXaa: 0.0 ± 0.0
Glu
4.818GluAla: 4.818 ± 0.93
0.292GluCys: 0.292 ± 0.356
3.942GluAsp: 3.942 ± 0.816
5.401GluGlu: 5.401 ± 1.76
1.606GluPhe: 1.606 ± 0.436
2.628GluGly: 2.628 ± 0.459
1.46GluHis: 1.46 ± 0.449
2.774GluIle: 2.774 ± 0.86
3.066GluLys: 3.066 ± 0.817
5.985GluLeu: 5.985 ± 1.115
3.066GluMet: 3.066 ± 0.59
3.504GluAsn: 3.504 ± 0.9
1.606GluPro: 1.606 ± 0.473
1.752GluGln: 1.752 ± 0.999
3.942GluArg: 3.942 ± 0.97
4.234GluSer: 4.234 ± 0.862
4.088GluThr: 4.088 ± 0.767
2.336GluVal: 2.336 ± 0.449
0.584GluTrp: 0.584 ± 0.321
3.066GluTyr: 3.066 ± 0.868
0.0GluXaa: 0.0 ± 0.0
Phe
2.628PheAla: 2.628 ± 0.594
1.022PheCys: 1.022 ± 0.628
2.92PheAsp: 2.92 ± 0.621
2.19PheGlu: 2.19 ± 0.745
1.46PhePhe: 1.46 ± 0.696
2.19PheGly: 2.19 ± 0.668
0.584PheHis: 0.584 ± 0.284
1.898PheIle: 1.898 ± 0.462
2.92PheLys: 2.92 ± 1.124
3.358PheLeu: 3.358 ± 0.661
1.314PheMet: 1.314 ± 0.382
3.066PheAsn: 3.066 ± 0.712
1.606PhePro: 1.606 ± 0.645
1.314PheGln: 1.314 ± 0.604
2.336PheArg: 2.336 ± 0.528
1.022PheSer: 1.022 ± 0.384
2.628PheThr: 2.628 ± 0.428
2.19PheVal: 2.19 ± 0.63
0.146PheTrp: 0.146 ± 0.128
1.606PheTyr: 1.606 ± 0.316
0.0PheXaa: 0.0 ± 0.0
Gly
7.007GlyAla: 7.007 ± 1.996
0.292GlyCys: 0.292 ± 0.202
2.482GlyAsp: 2.482 ± 0.606
1.752GlyGlu: 1.752 ± 0.634
2.774GlyPhe: 2.774 ± 0.767
4.672GlyGly: 4.672 ± 0.996
0.73GlyHis: 0.73 ± 0.319
2.482GlyIle: 2.482 ± 0.611
1.46GlyLys: 1.46 ± 0.767
4.818GlyLeu: 4.818 ± 1.0
2.336GlyMet: 2.336 ± 0.498
3.358GlyAsn: 3.358 ± 0.674
2.044GlyPro: 2.044 ± 0.748
2.336GlyGln: 2.336 ± 0.573
3.504GlyArg: 3.504 ± 0.761
3.358GlySer: 3.358 ± 0.704
7.591GlyThr: 7.591 ± 2.76
4.38GlyVal: 4.38 ± 0.878
0.292GlyTrp: 0.292 ± 0.245
2.628GlyTyr: 2.628 ± 0.717
0.0GlyXaa: 0.0 ± 0.0
His
1.168HisAla: 1.168 ± 0.453
0.0HisCys: 0.0 ± 0.0
1.46HisAsp: 1.46 ± 0.372
0.876HisGlu: 0.876 ± 0.288
1.752HisPhe: 1.752 ± 0.752
1.606HisGly: 1.606 ± 0.505
0.292HisHis: 0.292 ± 0.278
1.606HisIle: 1.606 ± 0.431
1.606HisLys: 1.606 ± 0.669
1.022HisLeu: 1.022 ± 0.328
0.0HisMet: 0.0 ± 0.0
1.752HisAsn: 1.752 ± 0.511
0.73HisPro: 0.73 ± 0.257
0.584HisGln: 0.584 ± 0.284
0.584HisArg: 0.584 ± 0.284
0.73HisSer: 0.73 ± 0.264
1.314HisThr: 1.314 ± 0.339
0.876HisVal: 0.876 ± 0.409
0.0HisTrp: 0.0 ± 0.0
0.292HisTyr: 0.292 ± 0.222
0.0HisXaa: 0.0 ± 0.0
Ile
3.65IleAla: 3.65 ± 0.921
0.584IleCys: 0.584 ± 0.42
4.672IleAsp: 4.672 ± 1.058
5.693IleGlu: 5.693 ± 0.759
2.044IlePhe: 2.044 ± 0.626
3.942IleGly: 3.942 ± 0.61
1.022IleHis: 1.022 ± 0.365
4.088IleIle: 4.088 ± 0.91
4.38IleLys: 4.38 ± 1.494
3.504IleLeu: 3.504 ± 0.682
1.752IleMet: 1.752 ± 0.689
4.234IleAsn: 4.234 ± 1.069
3.796IlePro: 3.796 ± 0.781
2.774IleGln: 2.774 ± 0.725
5.693IleArg: 5.693 ± 1.181
3.65IleSer: 3.65 ± 0.711
3.65IleThr: 3.65 ± 0.671
3.212IleVal: 3.212 ± 1.073
0.146IleTrp: 0.146 ± 0.156
1.752IleTyr: 1.752 ± 0.401
0.0IleXaa: 0.0 ± 0.0
Lys
2.044LysAla: 2.044 ± 0.722
0.584LysCys: 0.584 ± 0.309
3.066LysAsp: 3.066 ± 0.978
3.066LysGlu: 3.066 ± 1.162
1.022LysPhe: 1.022 ± 0.332
1.898LysGly: 1.898 ± 0.887
1.022LysHis: 1.022 ± 0.422
6.131LysIle: 6.131 ± 2.029
5.255LysLys: 5.255 ± 1.986
3.65LysLeu: 3.65 ± 1.123
1.898LysMet: 1.898 ± 0.823
4.234LysAsn: 4.234 ± 1.524
2.482LysPro: 2.482 ± 1.143
2.482LysGln: 2.482 ± 1.061
2.482LysArg: 2.482 ± 0.734
2.336LysSer: 2.336 ± 0.959
3.504LysThr: 3.504 ± 0.808
2.044LysVal: 2.044 ± 0.741
0.584LysTrp: 0.584 ± 0.463
2.628LysTyr: 2.628 ± 0.921
0.0LysXaa: 0.0 ± 0.0
Leu
5.109LeuAla: 5.109 ± 0.853
0.584LeuCys: 0.584 ± 0.338
4.818LeuAsp: 4.818 ± 1.203
5.547LeuGlu: 5.547 ± 0.835
3.066LeuPhe: 3.066 ± 0.922
5.693LeuGly: 5.693 ± 0.628
1.752LeuHis: 1.752 ± 0.67
4.234LeuIle: 4.234 ± 0.649
3.65LeuLys: 3.65 ± 1.48
5.839LeuLeu: 5.839 ± 1.397
1.752LeuMet: 1.752 ± 0.519
3.504LeuAsn: 3.504 ± 0.732
3.942LeuPro: 3.942 ± 0.769
3.65LeuGln: 3.65 ± 1.092
4.672LeuArg: 4.672 ± 0.944
4.088LeuSer: 4.088 ± 0.719
5.693LeuThr: 5.693 ± 0.969
3.212LeuVal: 3.212 ± 0.871
0.73LeuTrp: 0.73 ± 0.327
2.19LeuTyr: 2.19 ± 0.417
0.0LeuXaa: 0.0 ± 0.0
Met
2.628MetAla: 2.628 ± 0.726
0.584MetCys: 0.584 ± 0.343
1.606MetAsp: 1.606 ± 0.435
2.336MetGlu: 2.336 ± 0.759
0.876MetPhe: 0.876 ± 0.414
0.73MetGly: 0.73 ± 0.428
0.584MetHis: 0.584 ± 0.375
1.752MetIle: 1.752 ± 0.534
2.19MetLys: 2.19 ± 0.851
1.314MetLeu: 1.314 ± 0.51
1.314MetMet: 1.314 ± 0.748
1.46MetAsn: 1.46 ± 0.503
1.314MetPro: 1.314 ± 0.258
0.73MetGln: 0.73 ± 0.306
2.336MetArg: 2.336 ± 0.665
2.044MetSer: 2.044 ± 0.48
1.606MetThr: 1.606 ± 0.446
1.022MetVal: 1.022 ± 0.296
0.146MetTrp: 0.146 ± 0.107
0.584MetTyr: 0.584 ± 0.288
0.0MetXaa: 0.0 ± 0.0
Asn
3.066AsnAla: 3.066 ± 0.778
0.584AsnCys: 0.584 ± 0.328
3.212AsnAsp: 3.212 ± 0.696
3.066AsnGlu: 3.066 ± 0.762
3.066AsnPhe: 3.066 ± 0.531
2.336AsnGly: 2.336 ± 0.429
1.168AsnHis: 1.168 ± 0.362
4.234AsnIle: 4.234 ± 0.665
2.92AsnLys: 2.92 ± 1.094
6.277AsnLeu: 6.277 ± 0.624
1.752AsnMet: 1.752 ± 0.577
2.482AsnAsn: 2.482 ± 0.575
3.942AsnPro: 3.942 ± 0.803
1.314AsnGln: 1.314 ± 0.435
3.65AsnArg: 3.65 ± 0.776
3.942AsnSer: 3.942 ± 0.923
4.964AsnThr: 4.964 ± 1.245
3.212AsnVal: 3.212 ± 0.789
0.584AsnTrp: 0.584 ± 0.247
4.234AsnTyr: 4.234 ± 0.911
0.0AsnXaa: 0.0 ± 0.0
Pro
7.007ProAla: 7.007 ± 3.255
0.0ProCys: 0.0 ± 0.0
3.358ProAsp: 3.358 ± 0.658
3.942ProGlu: 3.942 ± 0.636
2.044ProPhe: 2.044 ± 0.329
2.482ProGly: 2.482 ± 0.725
0.438ProHis: 0.438 ± 0.23
2.482ProIle: 2.482 ± 0.639
2.336ProLys: 2.336 ± 0.852
1.898ProLeu: 1.898 ± 0.503
0.876ProMet: 0.876 ± 0.266
2.482ProAsn: 2.482 ± 0.525
5.693ProPro: 5.693 ± 1.913
1.606ProGln: 1.606 ± 0.437
3.066ProArg: 3.066 ± 0.961
4.672ProSer: 4.672 ± 0.717
5.839ProThr: 5.839 ± 2.16
5.255ProVal: 5.255 ± 2.061
0.0ProTrp: 0.0 ± 0.0
1.46ProTyr: 1.46 ± 0.479
0.0ProXaa: 0.0 ± 0.0
Gln
1.752GlnAla: 1.752 ± 0.67
0.438GlnCys: 0.438 ± 0.267
2.92GlnAsp: 2.92 ± 0.498
2.044GlnGlu: 2.044 ± 0.495
1.022GlnPhe: 1.022 ± 0.296
1.898GlnGly: 1.898 ± 0.375
0.438GlnHis: 0.438 ± 0.211
1.898GlnIle: 1.898 ± 0.454
1.606GlnLys: 1.606 ± 0.452
3.796GlnLeu: 3.796 ± 0.79
1.168GlnMet: 1.168 ± 0.329
1.898GlnAsn: 1.898 ± 0.691
1.314GlnPro: 1.314 ± 0.433
2.482GlnGln: 2.482 ± 0.742
3.066GlnArg: 3.066 ± 1.047
2.482GlnSer: 2.482 ± 0.659
2.482GlnThr: 2.482 ± 0.507
2.044GlnVal: 2.044 ± 0.589
0.146GlnTrp: 0.146 ± 0.107
2.19GlnTyr: 2.19 ± 0.498
0.0GlnXaa: 0.0 ± 0.0
Arg
4.964ArgAla: 4.964 ± 1.563
0.584ArgCys: 0.584 ± 0.42
4.526ArgAsp: 4.526 ± 0.712
4.526ArgGlu: 4.526 ± 1.386
2.92ArgPhe: 2.92 ± 0.554
3.65ArgGly: 3.65 ± 0.567
1.168ArgHis: 1.168 ± 0.286
3.066ArgIle: 3.066 ± 0.612
2.774ArgLys: 2.774 ± 1.139
3.504ArgLeu: 3.504 ± 0.571
1.606ArgMet: 1.606 ± 0.517
4.672ArgAsn: 4.672 ± 1.162
3.942ArgPro: 3.942 ± 1.084
2.774ArgGln: 2.774 ± 0.636
2.774ArgArg: 2.774 ± 0.844
3.066ArgSer: 3.066 ± 0.817
3.504ArgThr: 3.504 ± 1.021
3.212ArgVal: 3.212 ± 0.781
0.292ArgTrp: 0.292 ± 0.164
1.752ArgTyr: 1.752 ± 0.557
0.0ArgXaa: 0.0 ± 0.0
Ser
3.796SerAla: 3.796 ± 1.213
0.146SerCys: 0.146 ± 0.178
3.796SerAsp: 3.796 ± 0.858
4.672SerGlu: 4.672 ± 0.781
3.796SerPhe: 3.796 ± 1.065
4.964SerGly: 4.964 ± 1.199
1.46SerHis: 1.46 ± 0.597
5.401SerIle: 5.401 ± 0.987
2.336SerLys: 2.336 ± 0.865
4.526SerLeu: 4.526 ± 1.145
0.876SerMet: 0.876 ± 0.241
3.796SerAsn: 3.796 ± 0.862
1.752SerPro: 1.752 ± 0.361
1.898SerGln: 1.898 ± 0.512
3.65SerArg: 3.65 ± 0.942
4.818SerSer: 4.818 ± 1.16
5.693SerThr: 5.693 ± 1.191
3.504SerVal: 3.504 ± 0.744
0.292SerTrp: 0.292 ± 0.18
1.46SerTyr: 1.46 ± 0.456
0.0SerXaa: 0.0 ± 0.0
Thr
8.029ThrAla: 8.029 ± 1.445
0.584ThrCys: 0.584 ± 0.331
3.942ThrAsp: 3.942 ± 0.539
2.19ThrGlu: 2.19 ± 0.52
2.336ThrPhe: 2.336 ± 0.494
7.737ThrGly: 7.737 ± 2.199
1.606ThrHis: 1.606 ± 0.399
4.088ThrIle: 4.088 ± 0.932
3.066ThrLys: 3.066 ± 1.165
7.153ThrLeu: 7.153 ± 1.24
1.022ThrMet: 1.022 ± 0.286
3.942ThrAsn: 3.942 ± 0.765
6.277ThrPro: 6.277 ± 2.666
3.066ThrGln: 3.066 ± 0.565
3.942ThrArg: 3.942 ± 0.933
6.277ThrSer: 6.277 ± 0.785
6.131ThrThr: 6.131 ± 1.951
3.942ThrVal: 3.942 ± 0.935
0.876ThrTrp: 0.876 ± 0.351
3.358ThrTyr: 3.358 ± 0.685
0.0ThrXaa: 0.0 ± 0.0
Val
3.212ValAla: 3.212 ± 0.578
0.584ValCys: 0.584 ± 0.334
5.255ValAsp: 5.255 ± 0.858
2.628ValGlu: 2.628 ± 0.771
0.876ValPhe: 0.876 ± 0.415
2.482ValGly: 2.482 ± 0.704
0.876ValHis: 0.876 ± 0.293
2.774ValIle: 2.774 ± 0.746
2.19ValLys: 2.19 ± 0.744
2.628ValLeu: 2.628 ± 0.557
0.876ValMet: 0.876 ± 0.277
5.109ValAsn: 5.109 ± 0.841
5.255ValPro: 5.255 ± 1.671
1.606ValGln: 1.606 ± 0.445
2.336ValArg: 2.336 ± 0.592
4.088ValSer: 4.088 ± 0.711
4.234ValThr: 4.234 ± 0.977
2.92ValVal: 2.92 ± 0.759
0.146ValTrp: 0.146 ± 0.2
2.044ValTyr: 2.044 ± 0.482
0.0ValXaa: 0.0 ± 0.0
Trp
0.146TrpAla: 0.146 ± 0.107
0.146TrpCys: 0.146 ± 0.189
0.438TrpAsp: 0.438 ± 0.237
0.584TrpGlu: 0.584 ± 0.229
0.292TrpPhe: 0.292 ± 0.148
0.146TrpGly: 0.146 ± 0.189
0.438TrpHis: 0.438 ± 0.229
0.584TrpIle: 0.584 ± 0.335
0.438TrpLys: 0.438 ± 0.442
0.584TrpLeu: 0.584 ± 0.289
0.292TrpMet: 0.292 ± 0.215
1.022TrpAsn: 1.022 ± 0.416
0.146TrpPro: 0.146 ± 0.187
0.146TrpGln: 0.146 ± 0.164
0.292TrpArg: 0.292 ± 0.191
0.146TrpSer: 0.146 ± 0.107
0.146TrpThr: 0.146 ± 0.17
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.146TrpTyr: 0.146 ± 0.17
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.46TyrAla: 1.46 ± 0.419
0.146TyrCys: 0.146 ± 0.14
4.964TyrAsp: 4.964 ± 0.731
1.898TyrGlu: 1.898 ± 0.745
1.898TyrPhe: 1.898 ± 0.546
2.19TyrGly: 2.19 ± 0.369
0.876TyrHis: 0.876 ± 0.29
3.358TyrIle: 3.358 ± 1.016
2.336TyrLys: 2.336 ± 0.892
2.044TyrLeu: 2.044 ± 0.579
1.168TyrMet: 1.168 ± 0.479
3.504TyrAsn: 3.504 ± 0.448
1.752TyrPro: 1.752 ± 0.641
1.022TyrGln: 1.022 ± 0.305
2.336TyrArg: 2.336 ± 0.532
1.898TyrSer: 1.898 ± 0.548
3.65TyrThr: 3.65 ± 0.644
1.606TyrVal: 1.606 ± 0.516
0.292TyrTrp: 0.292 ± 0.215
1.606TyrTyr: 1.606 ± 0.365
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 19 proteins (6851 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski