Amino acid dipepetide frequency for Iris yellow spot virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.16AlaAla: 2.16 ± 1.764
2.006AlaCys: 2.006 ± 0.348
1.543AlaAsp: 1.543 ± 0.3
2.623AlaGlu: 2.623 ± 0.275
1.851AlaPhe: 1.851 ± 0.353
0.771AlaGly: 0.771 ± 0.286
0.771AlaHis: 0.771 ± 0.349
3.857AlaIle: 3.857 ± 1.428
2.623AlaLys: 2.623 ± 0.765
3.857AlaLeu: 3.857 ± 0.696
1.08AlaMet: 1.08 ± 0.355
2.314AlaAsn: 2.314 ± 0.49
1.234AlaPro: 1.234 ± 0.506
0.309AlaGln: 0.309 ± 0.197
0.617AlaArg: 0.617 ± 1.005
4.011AlaSer: 4.011 ± 1.425
2.006AlaThr: 2.006 ± 0.59
3.394AlaVal: 3.394 ± 0.479
0.154AlaTrp: 0.154 ± 0.099
0.771AlaTyr: 0.771 ± 0.615
0.0AlaXaa: 0.0 ± 0.0
Cys
0.771CysAla: 0.771 ± 0.286
0.309CysCys: 0.309 ± 0.197
1.234CysAsp: 1.234 ± 0.769
0.771CysGlu: 0.771 ± 0.249
1.851CysPhe: 1.851 ± 0.297
1.388CysGly: 1.388 ± 0.842
0.154CysHis: 0.154 ± 0.099
3.24CysIle: 3.24 ± 0.702
1.543CysLys: 1.543 ± 0.348
2.006CysLeu: 2.006 ± 0.579
0.463CysMet: 0.463 ± 0.313
1.08CysAsn: 1.08 ± 0.534
0.463CysPro: 0.463 ± 0.296
0.617CysGln: 0.617 ± 0.28
1.388CysArg: 1.388 ± 0.472
2.623CysSer: 2.623 ± 1.133
1.697CysThr: 1.697 ± 0.597
2.006CysVal: 2.006 ± 1.019
0.309CysTrp: 0.309 ± 0.278
0.617CysTyr: 0.617 ± 0.264
0.0CysXaa: 0.0 ± 0.0
Asp
1.543AspAla: 1.543 ± 0.377
2.623AspCys: 2.623 ± 0.592
4.011AspAsp: 4.011 ± 0.718
4.32AspGlu: 4.32 ± 0.845
4.011AspPhe: 4.011 ± 0.918
2.931AspGly: 2.931 ± 0.622
0.926AspHis: 0.926 ± 0.481
4.782AspIle: 4.782 ± 0.423
4.628AspLys: 4.628 ± 0.828
5.554AspLeu: 5.554 ± 1.14
2.623AspMet: 2.623 ± 0.967
2.16AspAsn: 2.16 ± 0.811
2.468AspPro: 2.468 ± 0.693
2.468AspGln: 2.468 ± 0.574
2.931AspArg: 2.931 ± 0.425
5.862AspSer: 5.862 ± 1.491
2.777AspThr: 2.777 ± 0.757
4.474AspVal: 4.474 ± 0.852
0.617AspTrp: 0.617 ± 0.314
1.851AspTyr: 1.851 ± 0.528
0.0AspXaa: 0.0 ± 0.0
Glu
2.006GluAla: 2.006 ± 0.795
2.16GluCys: 2.16 ± 0.646
4.165GluAsp: 4.165 ± 0.591
5.245GluGlu: 5.245 ± 0.65
4.011GluPhe: 4.011 ± 1.234
2.006GluGly: 2.006 ± 1.162
0.771GluHis: 0.771 ± 0.361
4.32GluIle: 4.32 ± 1.173
5.862GluLys: 5.862 ± 1.795
5.862GluLeu: 5.862 ± 0.322
2.314GluMet: 2.314 ± 0.648
5.554GluAsn: 5.554 ± 0.832
1.08GluPro: 1.08 ± 0.308
1.08GluGln: 1.08 ± 0.577
1.388GluArg: 1.388 ± 0.243
4.937GluSer: 4.937 ± 0.599
3.548GluThr: 3.548 ± 0.773
2.314GluVal: 2.314 ± 0.611
0.309GluTrp: 0.309 ± 0.278
2.931GluTyr: 2.931 ± 0.721
0.0GluXaa: 0.0 ± 0.0
Phe
2.006PheAla: 2.006 ± 0.504
1.543PheCys: 1.543 ± 0.334
4.165PheAsp: 4.165 ± 0.609
1.851PheGlu: 1.851 ± 0.733
2.777PhePhe: 2.777 ± 1.017
2.468PheGly: 2.468 ± 0.3
0.926PheHis: 0.926 ± 0.445
2.314PheIle: 2.314 ± 0.439
4.32PheLys: 4.32 ± 0.757
4.628PheLeu: 4.628 ± 0.828
1.697PheMet: 1.697 ± 0.321
2.931PheAsn: 2.931 ± 1.326
2.314PhePro: 2.314 ± 0.497
2.006PheGln: 2.006 ± 0.561
1.543PheArg: 1.543 ± 0.642
5.4PheSer: 5.4 ± 0.822
2.314PheThr: 2.314 ± 0.583
2.777PheVal: 2.777 ± 0.503
0.0PheTrp: 0.0 ± 0.0
1.388PheTyr: 1.388 ± 0.385
0.0PheXaa: 0.0 ± 0.0
Gly
1.697GlyAla: 1.697 ± 0.654
2.006GlyCys: 2.006 ± 1.019
2.468GlyAsp: 2.468 ± 0.685
3.085GlyGlu: 3.085 ± 0.568
2.314GlyPhe: 2.314 ± 0.699
1.543GlyGly: 1.543 ± 0.615
1.234GlyHis: 1.234 ± 0.399
3.085GlyIle: 3.085 ± 0.591
3.548GlyLys: 3.548 ± 1.009
3.24GlyLeu: 3.24 ± 0.467
0.926GlyMet: 0.926 ± 0.503
3.548GlyAsn: 3.548 ± 0.742
0.771GlyPro: 0.771 ± 0.286
0.771GlyGln: 0.771 ± 0.255
0.771GlyArg: 0.771 ± 0.79
4.165GlySer: 4.165 ± 1.015
2.16GlyThr: 2.16 ± 0.821
2.16GlyVal: 2.16 ± 0.411
0.309GlyTrp: 0.309 ± 0.197
2.777GlyTyr: 2.777 ± 0.662
0.0GlyXaa: 0.0 ± 0.0
His
0.771HisAla: 0.771 ± 0.401
0.309HisCys: 0.309 ± 0.2
1.08HisAsp: 1.08 ± 0.405
0.771HisGlu: 0.771 ± 0.349
1.543HisPhe: 1.543 ± 0.431
0.617HisGly: 0.617 ± 0.177
0.309HisHis: 0.309 ± 0.2
1.08HisIle: 1.08 ± 0.323
0.771HisLys: 0.771 ± 0.361
1.234HisLeu: 1.234 ± 0.966
0.463HisMet: 0.463 ± 0.296
1.543HisAsn: 1.543 ± 0.461
0.771HisPro: 0.771 ± 0.237
0.154HisGln: 0.154 ± 0.099
0.309HisArg: 0.309 ± 0.197
1.543HisSer: 1.543 ± 0.461
0.771HisThr: 0.771 ± 0.371
0.463HisVal: 0.463 ± 0.296
0.0HisTrp: 0.0 ± 0.0
0.617HisTyr: 0.617 ± 0.261
0.0HisXaa: 0.0 ± 0.0
Ile
3.548IleAla: 3.548 ± 1.328
1.697IleCys: 1.697 ± 0.292
5.554IleAsp: 5.554 ± 0.844
4.628IleGlu: 4.628 ± 0.9
2.314IlePhe: 2.314 ± 0.47
3.085IleGly: 3.085 ± 0.731
1.234IleHis: 1.234 ± 0.497
4.32IleIle: 4.32 ± 1.046
10.182IleLys: 10.182 ± 1.166
7.251IleLeu: 7.251 ± 0.455
1.388IleMet: 1.388 ± 0.598
4.32IleAsn: 4.32 ± 0.967
4.165IlePro: 4.165 ± 1.563
1.851IleGln: 1.851 ± 0.633
1.851IleArg: 1.851 ± 0.626
7.251IleSer: 7.251 ± 0.983
4.937IleThr: 4.937 ± 1.085
2.777IleVal: 2.777 ± 1.2
0.463IleTrp: 0.463 ± 0.161
3.24IleTyr: 3.24 ± 0.624
0.0IleXaa: 0.0 ± 0.0
Lys
4.32LysAla: 4.32 ± 1.039
2.16LysCys: 2.16 ± 0.823
7.097LysAsp: 7.097 ± 0.62
7.097LysGlu: 7.097 ± 1.064
2.931LysPhe: 2.931 ± 1.126
5.091LysGly: 5.091 ± 0.606
1.234LysHis: 1.234 ± 0.497
7.559LysIle: 7.559 ± 1.543
5.708LysLys: 5.708 ± 0.681
7.097LysLeu: 7.097 ± 1.281
3.24LysMet: 3.24 ± 0.165
6.171LysAsn: 6.171 ± 0.881
2.006LysPro: 2.006 ± 1.095
1.543LysGln: 1.543 ± 0.722
3.703LysArg: 3.703 ± 0.437
8.022LysSer: 8.022 ± 1.367
8.022LysThr: 8.022 ± 2.001
6.171LysVal: 6.171 ± 0.542
0.617LysTrp: 0.617 ± 0.261
2.931LysTyr: 2.931 ± 0.573
0.0LysXaa: 0.0 ± 0.0
Leu
4.32LeuAla: 4.32 ± 0.479
1.543LeuCys: 1.543 ± 0.386
3.857LeuAsp: 3.857 ± 0.526
5.554LeuGlu: 5.554 ± 0.768
2.777LeuPhe: 2.777 ± 0.811
3.857LeuGly: 3.857 ± 0.672
1.388LeuHis: 1.388 ± 0.482
7.868LeuIle: 7.868 ± 0.991
8.485LeuLys: 8.485 ± 1.47
5.862LeuLeu: 5.862 ± 1.302
5.245LeuMet: 5.245 ± 1.23
5.554LeuAsn: 5.554 ± 1.489
1.543LeuPro: 1.543 ± 0.916
2.006LeuGln: 2.006 ± 0.638
2.623LeuArg: 2.623 ± 0.937
12.65LeuSer: 12.65 ± 1.108
5.091LeuThr: 5.091 ± 0.908
3.548LeuVal: 3.548 ± 1.272
1.08LeuTrp: 1.08 ± 0.546
3.085LeuTyr: 3.085 ± 0.883
0.0LeuXaa: 0.0 ± 0.0
Met
0.926MetAla: 0.926 ± 0.582
0.154MetCys: 0.154 ± 0.099
2.468MetAsp: 2.468 ± 0.638
2.468MetGlu: 2.468 ± 0.5
0.771MetPhe: 0.771 ± 0.349
1.234MetGly: 1.234 ± 0.492
0.617MetHis: 0.617 ± 0.817
3.24MetIle: 3.24 ± 0.399
2.314MetLys: 2.314 ± 1.102
2.931MetLeu: 2.931 ± 1.267
1.234MetMet: 1.234 ± 0.789
2.468MetAsn: 2.468 ± 0.74
1.234MetPro: 1.234 ± 0.353
0.771MetGln: 0.771 ± 0.333
1.08MetArg: 1.08 ± 0.691
3.703MetSer: 3.703 ± 0.785
1.851MetThr: 1.851 ± 0.33
1.08MetVal: 1.08 ± 0.55
0.154MetTrp: 0.154 ± 0.099
1.08MetTyr: 1.08 ± 0.425
0.0MetXaa: 0.0 ± 0.0
Asn
2.006AsnAla: 2.006 ± 0.575
0.771AsnCys: 0.771 ± 0.237
4.165AsnAsp: 4.165 ± 0.445
4.474AsnGlu: 4.474 ± 1.69
3.857AsnPhe: 3.857 ± 0.951
2.16AsnGly: 2.16 ± 0.864
0.617AsnHis: 0.617 ± 0.613
3.857AsnIle: 3.857 ± 0.752
6.017AsnLys: 6.017 ± 1.279
6.634AsnLeu: 6.634 ± 1.442
1.697AsnMet: 1.697 ± 0.389
2.16AsnAsn: 2.16 ± 0.512
2.006AsnPro: 2.006 ± 0.569
3.085AsnGln: 3.085 ± 0.93
2.16AsnArg: 2.16 ± 0.857
3.857AsnSer: 3.857 ± 1.574
2.931AsnThr: 2.931 ± 1.061
3.548AsnVal: 3.548 ± 0.75
1.234AsnTrp: 1.234 ± 0.57
3.24AsnTyr: 3.24 ± 1.379
0.0AsnXaa: 0.0 ± 0.0
Pro
0.771ProAla: 0.771 ± 0.464
0.0ProCys: 0.0 ± 0.0
0.926ProAsp: 0.926 ± 0.382
2.006ProGlu: 2.006 ± 0.726
1.388ProPhe: 1.388 ± 0.406
1.543ProGly: 1.543 ± 0.434
0.0ProHis: 0.0 ± 0.0
2.777ProIle: 2.777 ± 0.912
4.32ProLys: 4.32 ± 1.271
2.468ProLeu: 2.468 ± 0.857
0.617ProMet: 0.617 ± 0.177
1.543ProAsn: 1.543 ± 0.434
0.617ProPro: 0.617 ± 0.4
0.926ProGln: 0.926 ± 0.322
0.926ProArg: 0.926 ± 0.6
4.32ProSer: 4.32 ± 1.396
1.697ProThr: 1.697 ± 0.7
2.314ProVal: 2.314 ± 0.481
0.0ProTrp: 0.0 ± 0.0
1.234ProTyr: 1.234 ± 0.497
0.0ProXaa: 0.0 ± 0.0
Gln
1.08GlnAla: 1.08 ± 0.323
0.771GlnCys: 0.771 ± 0.237
0.926GlnAsp: 0.926 ± 0.302
1.08GlnGlu: 1.08 ± 0.242
1.234GlnPhe: 1.234 ± 0.542
1.08GlnGly: 1.08 ± 0.798
0.154GlnHis: 0.154 ± 0.099
2.006GlnIle: 2.006 ± 0.285
2.314GlnLys: 2.314 ± 0.56
1.697GlnLeu: 1.697 ± 0.378
1.08GlnMet: 1.08 ± 0.526
1.388GlnAsn: 1.388 ± 0.534
0.617GlnPro: 0.617 ± 0.177
0.0GlnGln: 0.0 ± 0.0
0.926GlnArg: 0.926 ± 0.592
3.24GlnSer: 3.24 ± 0.981
1.08GlnThr: 1.08 ± 0.323
2.314GlnVal: 2.314 ± 1.048
0.0GlnTrp: 0.0 ± 0.0
0.926GlnTyr: 0.926 ± 0.414
0.0GlnXaa: 0.0 ± 0.0
Arg
1.851ArgAla: 1.851 ± 0.617
0.309ArgCys: 0.309 ± 0.348
1.388ArgAsp: 1.388 ± 0.591
1.234ArgGlu: 1.234 ± 0.353
0.926ArgPhe: 0.926 ± 0.458
0.771ArgGly: 0.771 ± 0.249
0.926ArgHis: 0.926 ± 0.317
2.006ArgIle: 2.006 ± 0.302
3.857ArgLys: 3.857 ± 0.516
3.857ArgLeu: 3.857 ± 0.462
0.617ArgMet: 0.617 ± 0.353
3.085ArgAsn: 3.085 ± 0.322
0.463ArgPro: 0.463 ± 0.279
1.543ArgGln: 1.543 ± 0.366
1.388ArgArg: 1.388 ± 0.538
1.697ArgSer: 1.697 ± 0.353
2.931ArgThr: 2.931 ± 0.925
2.623ArgVal: 2.623 ± 0.536
0.463ArgTrp: 0.463 ± 0.296
1.851ArgTyr: 1.851 ± 0.581
0.0ArgXaa: 0.0 ± 0.0
Ser
2.931SerAla: 2.931 ± 0.714
1.851SerCys: 1.851 ± 0.72
6.942SerAsp: 6.942 ± 0.825
5.708SerGlu: 5.708 ± 1.701
4.628SerPhe: 4.628 ± 1.367
5.245SerGly: 5.245 ± 0.332
1.851SerHis: 1.851 ± 0.53
5.245SerIle: 5.245 ± 1.643
9.102SerLys: 9.102 ± 1.049
9.411SerLeu: 9.411 ± 0.983
1.851SerMet: 1.851 ± 0.594
5.091SerAsn: 5.091 ± 1.393
1.697SerPro: 1.697 ± 0.285
1.234SerGln: 1.234 ± 0.647
5.091SerArg: 5.091 ± 0.465
10.336SerSer: 10.336 ± 1.395
7.405SerThr: 7.405 ± 1.965
7.714SerVal: 7.714 ± 1.659
0.617SerTrp: 0.617 ± 0.362
5.245SerTyr: 5.245 ± 0.56
0.0SerXaa: 0.0 ± 0.0
Thr
1.851ThrAla: 1.851 ± 0.647
1.543ThrCys: 1.543 ± 0.234
3.394ThrAsp: 3.394 ± 0.673
3.703ThrGlu: 3.703 ± 0.726
4.011ThrPhe: 4.011 ± 0.666
2.777ThrGly: 2.777 ± 0.443
0.771ThrHis: 0.771 ± 0.333
5.862ThrIle: 5.862 ± 1.381
4.474ThrLys: 4.474 ± 0.566
5.554ThrLeu: 5.554 ± 0.881
0.617ThrMet: 0.617 ± 0.395
4.628ThrAsn: 4.628 ± 1.067
2.468ThrPro: 2.468 ± 0.914
1.234ThrGln: 1.234 ± 0.769
0.771ThrArg: 0.771 ± 0.355
4.782ThrSer: 4.782 ± 0.516
4.32ThrThr: 4.32 ± 0.778
4.011ThrVal: 4.011 ± 1.189
1.08ThrTrp: 1.08 ± 0.387
1.697ThrTyr: 1.697 ± 0.439
0.0ThrXaa: 0.0 ± 0.0
Val
1.543ValAla: 1.543 ± 0.371
1.234ValCys: 1.234 ± 0.353
4.32ValAsp: 4.32 ± 0.973
3.548ValGlu: 3.548 ± 1.168
3.085ValPhe: 3.085 ± 0.372
2.16ValGly: 2.16 ± 0.763
0.771ValHis: 0.771 ± 0.371
4.937ValIle: 4.937 ± 0.9
6.479ValLys: 6.479 ± 0.372
4.782ValLeu: 4.782 ± 0.788
2.16ValMet: 2.16 ± 0.701
2.314ValAsn: 2.314 ± 0.661
3.703ValPro: 3.703 ± 0.703
0.926ValGln: 0.926 ± 0.371
2.931ValArg: 2.931 ± 0.654
5.862ValSer: 5.862 ± 0.987
2.623ValThr: 2.623 ± 0.333
3.394ValVal: 3.394 ± 0.291
0.463ValTrp: 0.463 ± 0.313
2.931ValTyr: 2.931 ± 0.602
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.154TrpCys: 0.154 ± 0.099
0.771TrpAsp: 0.771 ± 0.29
0.309TrpGlu: 0.309 ± 0.16
0.463TrpPhe: 0.463 ± 0.295
0.309TrpGly: 0.309 ± 0.2
0.0TrpHis: 0.0 ± 0.0
0.617TrpIle: 0.617 ± 0.261
1.388TrpLys: 1.388 ± 0.614
0.617TrpLeu: 0.617 ± 0.177
0.309TrpMet: 0.309 ± 0.197
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.154TrpArg: 0.154 ± 0.145
1.543TrpSer: 1.543 ± 0.377
0.154TrpThr: 0.154 ± 0.322
1.08TrpVal: 1.08 ± 0.703
0.309TrpTrp: 0.309 ± 0.2
0.309TrpTyr: 0.309 ± 0.2
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.851TyrAla: 1.851 ± 0.32
1.388TyrCys: 1.388 ± 0.406
3.085TyrAsp: 3.085 ± 0.598
1.388TyrGlu: 1.388 ± 0.747
2.623TyrPhe: 2.623 ± 0.399
1.543TyrGly: 1.543 ± 0.434
0.463TyrHis: 0.463 ± 0.279
3.085TyrIle: 3.085 ± 0.375
4.782TyrLys: 4.782 ± 1.117
3.548TyrLeu: 3.548 ± 0.777
2.006TyrMet: 2.006 ± 0.335
2.623TyrAsn: 2.623 ± 0.709
0.771TyrPro: 0.771 ± 0.215
1.388TyrGln: 1.388 ± 0.472
1.234TyrArg: 1.234 ± 0.259
3.394TyrSer: 3.394 ± 0.418
1.234TyrThr: 1.234 ± 0.352
2.006TyrVal: 2.006 ± 0.977
0.309TyrTrp: 0.309 ± 0.278
1.851TyrTyr: 1.851 ± 0.633
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (6483 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski