Amino acid dipepetide frequency for Persimmon virus B

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.195AlaAla: 4.195 ± 0.97
1.174AlaCys: 1.174 ± 0.509
3.188AlaAsp: 3.188 ± 0.91
4.866AlaGlu: 4.866 ± 0.903
2.517AlaPhe: 2.517 ± 0.533
4.027AlaGly: 4.027 ± 0.626
1.342AlaHis: 1.342 ± 0.466
4.195AlaIle: 4.195 ± 0.87
3.523AlaLys: 3.523 ± 0.733
5.537AlaLeu: 5.537 ± 1.359
1.174AlaMet: 1.174 ± 0.501
1.846AlaAsn: 1.846 ± 0.511
3.859AlaPro: 3.859 ± 1.275
1.342AlaGln: 1.342 ± 0.466
2.852AlaArg: 2.852 ± 0.671
4.53AlaSer: 4.53 ± 0.822
3.356AlaThr: 3.356 ± 0.462
5.201AlaVal: 5.201 ± 0.655
1.007AlaTrp: 1.007 ± 0.436
2.349AlaTyr: 2.349 ± 0.82
0.0AlaXaa: 0.0 ± 0.0
Cys
1.846CysAla: 1.846 ± 0.482
0.839CysCys: 0.839 ± 0.272
1.51CysAsp: 1.51 ± 0.344
1.007CysGlu: 1.007 ± 0.58
1.174CysPhe: 1.174 ± 0.364
1.678CysGly: 1.678 ± 0.687
0.0CysHis: 0.0 ± 0.0
1.51CysIle: 1.51 ± 0.739
0.839CysLys: 0.839 ± 0.394
1.846CysLeu: 1.846 ± 0.845
0.839CysMet: 0.839 ± 0.59
1.342CysAsn: 1.342 ± 0.503
0.503CysPro: 0.503 ± 0.29
0.503CysGln: 0.503 ± 0.283
1.846CysArg: 1.846 ± 0.56
1.174CysSer: 1.174 ± 0.478
2.013CysThr: 2.013 ± 0.265
1.846CysVal: 1.846 ± 0.509
0.336CysTrp: 0.336 ± 0.193
1.342CysTyr: 1.342 ± 0.302
0.0CysXaa: 0.0 ± 0.0
Asp
4.698AspAla: 4.698 ± 1.276
1.342AspCys: 1.342 ± 0.4
3.859AspAsp: 3.859 ± 1.468
4.027AspGlu: 4.027 ± 0.694
3.02AspPhe: 3.02 ± 0.745
3.02AspGly: 3.02 ± 1.128
0.839AspHis: 0.839 ± 0.276
4.53AspIle: 4.53 ± 0.514
4.195AspLys: 4.195 ± 0.545
3.859AspLeu: 3.859 ± 0.954
1.51AspMet: 1.51 ± 0.309
2.517AspAsn: 2.517 ± 0.609
1.007AspPro: 1.007 ± 0.275
0.671AspGln: 0.671 ± 0.322
1.678AspArg: 1.678 ± 1.23
5.705AspSer: 5.705 ± 0.376
2.852AspThr: 2.852 ± 0.823
5.537AspVal: 5.537 ± 1.361
0.503AspTrp: 0.503 ± 0.291
3.02AspTyr: 3.02 ± 1.284
0.0AspXaa: 0.0 ± 0.0
Glu
3.691GluAla: 3.691 ± 0.482
1.007GluCys: 1.007 ± 0.5
3.02GluAsp: 3.02 ± 0.726
4.195GluGlu: 4.195 ± 1.111
2.517GluPhe: 2.517 ± 0.508
4.53GluGly: 4.53 ± 0.815
0.336GluHis: 0.336 ± 0.2
3.859GluIle: 3.859 ± 0.602
3.523GluLys: 3.523 ± 0.898
6.208GluLeu: 6.208 ± 0.654
1.51GluMet: 1.51 ± 0.664
1.51GluAsn: 1.51 ± 0.69
3.02GluPro: 3.02 ± 0.857
1.846GluGln: 1.846 ± 0.6
3.523GluArg: 3.523 ± 0.823
4.866GluSer: 4.866 ± 0.829
2.013GluThr: 2.013 ± 0.682
5.537GluVal: 5.537 ± 0.978
0.336GluTrp: 0.336 ± 0.215
2.517GluTyr: 2.517 ± 0.384
0.0GluXaa: 0.0 ± 0.0
Phe
1.846PheAla: 1.846 ± 0.444
1.174PheCys: 1.174 ± 0.51
2.181PheAsp: 2.181 ± 0.531
1.846PheGlu: 1.846 ± 0.731
2.852PhePhe: 2.852 ± 0.882
2.685PheGly: 2.685 ± 0.727
0.336PheHis: 0.336 ± 0.339
3.02PheIle: 3.02 ± 0.675
3.859PheLys: 3.859 ± 0.62
4.362PheLeu: 4.362 ± 0.792
1.51PheMet: 1.51 ± 0.314
1.846PheAsn: 1.846 ± 0.689
1.846PhePro: 1.846 ± 0.428
0.168PheGln: 0.168 ± 0.097
2.349PheArg: 2.349 ± 0.383
4.53PheSer: 4.53 ± 1.363
2.013PheThr: 2.013 ± 0.732
4.362PheVal: 4.362 ± 0.604
0.336PheTrp: 0.336 ± 0.215
2.349PheTyr: 2.349 ± 0.489
0.0PheXaa: 0.0 ± 0.0
Gly
3.356GlyAla: 3.356 ± 0.803
1.846GlyCys: 1.846 ± 0.443
3.859GlyAsp: 3.859 ± 0.852
3.859GlyGlu: 3.859 ± 0.829
2.013GlyPhe: 2.013 ± 0.444
5.872GlyGly: 5.872 ± 1.29
0.671GlyHis: 0.671 ± 0.189
2.349GlyIle: 2.349 ± 0.907
3.188GlyLys: 3.188 ± 0.593
4.866GlyLeu: 4.866 ± 1.36
1.678GlyMet: 1.678 ± 0.502
3.02GlyAsn: 3.02 ± 0.676
1.51GlyPro: 1.51 ± 0.7
1.174GlyGln: 1.174 ± 0.362
3.356GlyArg: 3.356 ± 0.878
4.53GlySer: 4.53 ± 1.011
2.685GlyThr: 2.685 ± 0.941
5.369GlyVal: 5.369 ± 1.239
0.839GlyTrp: 0.839 ± 0.239
2.685GlyTyr: 2.685 ± 0.579
0.0GlyXaa: 0.0 ± 0.0
His
0.503HisAla: 0.503 ± 0.17
0.168HisCys: 0.168 ± 0.202
1.007HisAsp: 1.007 ± 0.33
1.174HisGlu: 1.174 ± 0.471
0.503HisPhe: 0.503 ± 0.245
1.007HisGly: 1.007 ± 0.647
0.336HisHis: 0.336 ± 0.16
1.51HisIle: 1.51 ± 0.355
1.342HisLys: 1.342 ± 0.31
1.007HisLeu: 1.007 ± 0.48
0.336HisMet: 0.336 ± 0.225
1.007HisAsn: 1.007 ± 0.454
0.503HisPro: 0.503 ± 0.207
0.168HisGln: 0.168 ± 0.244
0.336HisArg: 0.336 ± 0.193
1.51HisSer: 1.51 ± 0.401
1.007HisThr: 1.007 ± 0.279
2.013HisVal: 2.013 ± 0.503
0.168HisTrp: 0.168 ± 0.097
0.671HisTyr: 0.671 ± 0.399
0.0HisXaa: 0.0 ± 0.0
Ile
2.517IleAla: 2.517 ± 0.721
1.342IleCys: 1.342 ± 0.565
2.685IleAsp: 2.685 ± 0.558
3.523IleGlu: 3.523 ± 0.463
2.013IlePhe: 2.013 ± 0.443
2.517IleGly: 2.517 ± 0.444
1.678IleHis: 1.678 ± 0.416
3.188IleIle: 3.188 ± 0.619
4.195IleLys: 4.195 ± 0.499
5.201IleLeu: 5.201 ± 1.631
1.342IleMet: 1.342 ± 0.371
1.51IleAsn: 1.51 ± 0.434
3.188IlePro: 3.188 ± 0.885
0.671IleGln: 0.671 ± 0.267
3.859IleArg: 3.859 ± 0.62
7.886IleSer: 7.886 ± 0.935
3.859IleThr: 3.859 ± 0.525
4.195IleVal: 4.195 ± 0.985
0.336IleTrp: 0.336 ± 0.337
2.181IleTyr: 2.181 ± 0.749
0.0IleXaa: 0.0 ± 0.0
Lys
4.195LysAla: 4.195 ± 0.528
1.007LysCys: 1.007 ± 0.27
3.691LysAsp: 3.691 ± 0.507
3.691LysGlu: 3.691 ± 0.812
3.691LysPhe: 3.691 ± 0.776
3.523LysGly: 3.523 ± 1.019
1.342LysHis: 1.342 ± 0.574
3.356LysIle: 3.356 ± 0.575
3.02LysLys: 3.02 ± 0.795
6.544LysLeu: 6.544 ± 1.22
1.007LysMet: 1.007 ± 0.301
2.349LysAsn: 2.349 ± 0.293
2.013LysPro: 2.013 ± 0.485
1.342LysGln: 1.342 ± 0.425
4.53LysArg: 4.53 ± 0.61
4.698LysSer: 4.698 ± 0.936
3.188LysThr: 3.188 ± 0.698
4.53LysVal: 4.53 ± 0.756
0.839LysTrp: 0.839 ± 0.356
2.685LysTyr: 2.685 ± 0.905
0.0LysXaa: 0.0 ± 0.0
Leu
4.195LeuAla: 4.195 ± 0.765
1.342LeuCys: 1.342 ± 0.335
4.698LeuAsp: 4.698 ± 0.693
6.544LeuGlu: 6.544 ± 1.101
4.698LeuPhe: 4.698 ± 0.372
4.53LeuGly: 4.53 ± 0.808
1.007LeuHis: 1.007 ± 0.31
4.698LeuIle: 4.698 ± 0.964
5.369LeuLys: 5.369 ± 0.959
6.544LeuLeu: 6.544 ± 1.132
2.517LeuMet: 2.517 ± 0.654
5.034LeuAsn: 5.034 ± 1.108
4.195LeuPro: 4.195 ± 0.813
1.846LeuGln: 1.846 ± 0.525
5.034LeuArg: 5.034 ± 0.598
8.221LeuSer: 8.221 ± 1.513
4.195LeuThr: 4.195 ± 0.769
7.047LeuVal: 7.047 ± 0.765
1.342LeuTrp: 1.342 ± 0.496
4.53LeuTyr: 4.53 ± 0.603
0.0LeuXaa: 0.0 ± 0.0
Met
2.349MetAla: 2.349 ± 0.945
0.336MetCys: 0.336 ± 0.235
1.007MetAsp: 1.007 ± 0.591
1.51MetGlu: 1.51 ± 0.513
1.342MetPhe: 1.342 ± 0.52
0.839MetGly: 0.839 ± 0.349
0.336MetHis: 0.336 ± 0.16
1.174MetIle: 1.174 ± 0.37
1.342MetLys: 1.342 ± 0.371
1.678MetLeu: 1.678 ± 0.814
0.503MetMet: 0.503 ± 0.207
0.671MetAsn: 0.671 ± 0.376
0.671MetPro: 0.671 ± 0.258
1.174MetGln: 1.174 ± 0.501
1.678MetArg: 1.678 ± 0.562
2.013MetSer: 2.013 ± 0.485
0.839MetThr: 0.839 ± 0.388
2.349MetVal: 2.349 ± 0.456
0.0MetTrp: 0.0 ± 0.0
1.174MetTyr: 1.174 ± 0.332
0.0MetXaa: 0.0 ± 0.0
Asn
3.02AsnAla: 3.02 ± 0.556
1.007AsnCys: 1.007 ± 0.27
3.691AsnAsp: 3.691 ± 1.651
2.685AsnGlu: 2.685 ± 0.564
1.678AsnPhe: 1.678 ± 0.368
2.517AsnGly: 2.517 ± 0.439
0.503AsnHis: 0.503 ± 0.538
2.349AsnIle: 2.349 ± 0.457
2.852AsnLys: 2.852 ± 0.542
4.195AsnLeu: 4.195 ± 0.744
1.007AsnMet: 1.007 ± 0.632
2.349AsnAsn: 2.349 ± 0.917
2.181AsnPro: 2.181 ± 0.732
1.174AsnGln: 1.174 ± 0.398
2.181AsnArg: 2.181 ± 0.569
2.685AsnSer: 2.685 ± 0.835
2.013AsnThr: 2.013 ± 1.092
4.027AsnVal: 4.027 ± 0.674
0.503AsnTrp: 0.503 ± 0.484
2.013AsnTyr: 2.013 ± 0.878
0.0AsnXaa: 0.0 ± 0.0
Pro
2.852ProAla: 2.852 ± 0.9
0.168ProCys: 0.168 ± 0.097
3.02ProAsp: 3.02 ± 0.707
1.51ProGlu: 1.51 ± 0.869
1.007ProPhe: 1.007 ± 0.332
2.852ProGly: 2.852 ± 0.589
0.671ProHis: 0.671 ± 0.287
1.678ProIle: 1.678 ± 0.552
3.02ProLys: 3.02 ± 1.174
4.53ProLeu: 4.53 ± 0.852
0.503ProMet: 0.503 ± 0.218
1.007ProAsn: 1.007 ± 0.388
2.181ProPro: 2.181 ± 0.783
1.007ProGln: 1.007 ± 0.324
2.181ProArg: 2.181 ± 0.451
2.685ProSer: 2.685 ± 1.371
1.007ProThr: 1.007 ± 0.308
5.034ProVal: 5.034 ± 1.63
0.839ProTrp: 0.839 ± 0.356
1.678ProTyr: 1.678 ± 0.559
0.0ProXaa: 0.0 ± 0.0
Gln
1.342GlnAla: 1.342 ± 0.414
0.671GlnCys: 0.671 ± 0.274
0.839GlnAsp: 0.839 ± 0.393
1.342GlnGlu: 1.342 ± 0.301
1.007GlnPhe: 1.007 ± 0.392
1.174GlnGly: 1.174 ± 0.509
0.168GlnHis: 0.168 ± 0.097
1.007GlnIle: 1.007 ± 0.491
1.174GlnLys: 1.174 ± 0.363
1.846GlnLeu: 1.846 ± 0.385
0.503GlnMet: 0.503 ± 0.381
1.342GlnAsn: 1.342 ± 0.28
0.839GlnPro: 0.839 ± 0.418
0.336GlnGln: 0.336 ± 0.186
2.852GlnArg: 2.852 ± 0.991
1.51GlnSer: 1.51 ± 0.685
1.174GlnThr: 1.174 ± 0.204
1.678GlnVal: 1.678 ± 0.55
0.503GlnTrp: 0.503 ± 0.449
0.839GlnTyr: 0.839 ± 0.613
0.0GlnXaa: 0.0 ± 0.0
Arg
4.362ArgAla: 4.362 ± 1.319
2.349ArgCys: 2.349 ± 0.784
3.691ArgAsp: 3.691 ± 0.656
4.698ArgGlu: 4.698 ± 0.874
3.188ArgPhe: 3.188 ± 0.726
4.027ArgGly: 4.027 ± 0.812
0.336ArgHis: 0.336 ± 0.193
3.523ArgIle: 3.523 ± 0.557
3.188ArgLys: 3.188 ± 0.775
4.362ArgLeu: 4.362 ± 0.775
1.342ArgMet: 1.342 ± 0.423
2.013ArgAsn: 2.013 ± 0.55
1.342ArgPro: 1.342 ± 0.592
2.013ArgGln: 2.013 ± 0.461
5.201ArgArg: 5.201 ± 0.882
3.691ArgSer: 3.691 ± 0.39
2.517ArgThr: 2.517 ± 0.651
5.201ArgVal: 5.201 ± 1.102
0.671ArgTrp: 0.671 ± 0.577
3.356ArgTyr: 3.356 ± 0.97
0.0ArgXaa: 0.0 ± 0.0
Ser
6.544SerAla: 6.544 ± 1.134
1.846SerCys: 1.846 ± 0.605
5.537SerAsp: 5.537 ± 0.409
4.866SerGlu: 4.866 ± 0.537
2.685SerPhe: 2.685 ± 0.71
4.866SerGly: 4.866 ± 0.857
1.678SerHis: 1.678 ± 0.617
3.859SerIle: 3.859 ± 0.757
6.04SerLys: 6.04 ± 0.714
6.544SerLeu: 6.544 ± 1.021
1.846SerMet: 1.846 ± 0.535
4.53SerAsn: 4.53 ± 0.767
2.852SerPro: 2.852 ± 0.931
1.678SerGln: 1.678 ± 0.581
4.698SerArg: 4.698 ± 0.942
8.221SerSer: 8.221 ± 1.58
4.698SerThr: 4.698 ± 0.883
7.55SerVal: 7.55 ± 0.643
1.678SerTrp: 1.678 ± 0.397
2.685SerTyr: 2.685 ± 0.858
0.0SerXaa: 0.0 ± 0.0
Thr
3.02ThrAla: 3.02 ± 0.542
1.007ThrCys: 1.007 ± 0.374
1.846ThrAsp: 1.846 ± 0.408
1.51ThrGlu: 1.51 ± 0.7
2.013ThrPhe: 2.013 ± 0.943
2.181ThrGly: 2.181 ± 0.282
1.174ThrHis: 1.174 ± 0.515
2.852ThrIle: 2.852 ± 0.856
2.685ThrLys: 2.685 ± 0.97
4.362ThrLeu: 4.362 ± 0.717
0.839ThrMet: 0.839 ± 0.331
2.517ThrAsn: 2.517 ± 1.105
3.02ThrPro: 3.02 ± 0.634
1.342ThrGln: 1.342 ± 0.366
3.523ThrArg: 3.523 ± 0.416
4.027ThrSer: 4.027 ± 0.895
2.181ThrThr: 2.181 ± 0.687
5.369ThrVal: 5.369 ± 0.751
0.503ThrTrp: 0.503 ± 0.29
2.181ThrTyr: 2.181 ± 1.275
0.0ThrXaa: 0.0 ± 0.0
Val
4.362ValAla: 4.362 ± 0.426
3.523ValCys: 3.523 ± 0.42
5.201ValAsp: 5.201 ± 0.499
5.034ValGlu: 5.034 ± 0.747
4.027ValPhe: 4.027 ± 0.619
3.859ValGly: 3.859 ± 0.464
2.181ValHis: 2.181 ± 0.676
6.04ValIle: 6.04 ± 0.668
4.866ValLys: 4.866 ± 0.577
7.215ValLeu: 7.215 ± 1.004
1.51ValMet: 1.51 ± 0.399
4.698ValAsn: 4.698 ± 1.086
2.852ValPro: 2.852 ± 0.855
2.349ValGln: 2.349 ± 0.701
6.711ValArg: 6.711 ± 1.854
7.383ValSer: 7.383 ± 1.555
3.859ValThr: 3.859 ± 0.745
7.55ValVal: 7.55 ± 1.075
0.839ValTrp: 0.839 ± 0.279
4.195ValTyr: 4.195 ± 1.276
0.0ValXaa: 0.0 ± 0.0
Trp
0.671TrpAla: 0.671 ± 0.256
0.168TrpCys: 0.168 ± 0.097
0.671TrpAsp: 0.671 ± 0.283
0.671TrpGlu: 0.671 ± 0.304
0.503TrpPhe: 0.503 ± 0.245
0.336TrpGly: 0.336 ± 0.193
0.168TrpHis: 0.168 ± 0.202
0.839TrpIle: 0.839 ± 0.632
0.336TrpLys: 0.336 ± 0.355
1.342TrpLeu: 1.342 ± 0.312
0.839TrpMet: 0.839 ± 0.476
0.336TrpAsn: 0.336 ± 0.186
0.336TrpPro: 0.336 ± 0.193
0.503TrpGln: 0.503 ± 0.207
0.671TrpArg: 0.671 ± 0.256
0.839TrpSer: 0.839 ± 0.487
0.839TrpThr: 0.839 ± 0.273
1.51TrpVal: 1.51 ± 0.456
0.0TrpTrp: 0.0 ± 0.0
0.168TrpTyr: 0.168 ± 0.097
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.517TyrAla: 2.517 ± 0.551
1.678TyrCys: 1.678 ± 0.361
3.188TyrAsp: 3.188 ± 0.822
1.007TyrGlu: 1.007 ± 0.275
3.02TyrPhe: 3.02 ± 0.518
2.685TyrGly: 2.685 ± 0.674
1.174TyrHis: 1.174 ± 0.702
2.349TyrIle: 2.349 ± 1.136
2.685TyrLys: 2.685 ± 1.075
5.537TyrLeu: 5.537 ± 0.812
0.503TyrMet: 0.503 ± 0.269
3.188TyrAsn: 3.188 ± 1.321
1.51TyrPro: 1.51 ± 0.461
0.839TyrGln: 0.839 ± 0.356
2.181TyrArg: 2.181 ± 0.677
4.195TyrSer: 4.195 ± 1.462
1.846TyrThr: 1.846 ± 0.561
2.349TyrVal: 2.349 ± 1.866
0.168TyrTrp: 0.168 ± 0.224
2.852TyrTyr: 2.852 ± 0.436
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12 proteins (5961 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski