Amino acid dipepetide frequency for Donkey orchid symptomless virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.092AlaAla: 6.092 ± 2.407
1.924AlaCys: 1.924 ± 0.702
4.809AlaAsp: 4.809 ± 1.2
3.527AlaGlu: 3.527 ± 1.225
2.886AlaPhe: 2.886 ± 1.394
5.45AlaGly: 5.45 ± 1.05
2.886AlaHis: 2.886 ± 0.52
6.412AlaIle: 6.412 ± 1.912
4.168AlaLys: 4.168 ± 0.992
9.618AlaLeu: 9.618 ± 1.776
1.603AlaMet: 1.603 ± 0.807
3.527AlaAsn: 3.527 ± 0.259
3.847AlaPro: 3.847 ± 1.765
2.886AlaGln: 2.886 ± 1.161
3.847AlaArg: 3.847 ± 1.547
6.733AlaSer: 6.733 ± 2.516
6.733AlaThr: 6.733 ± 1.309
2.244AlaVal: 2.244 ± 0.624
0.962AlaTrp: 0.962 ± 0.444
3.527AlaTyr: 3.527 ± 0.754
0.0AlaXaa: 0.0 ± 0.0
Cys
0.962CysAla: 0.962 ± 0.476
0.0CysCys: 0.0 ± 0.0
1.603CysAsp: 1.603 ± 0.797
0.641CysGlu: 0.641 ± 0.422
1.924CysPhe: 1.924 ± 0.699
0.321CysGly: 0.321 ± 0.211
0.962CysHis: 0.962 ± 0.624
0.321CysIle: 0.321 ± 0.378
0.641CysLys: 0.641 ± 0.471
1.282CysLeu: 1.282 ± 0.308
0.962CysMet: 0.962 ± 0.673
0.962CysAsn: 0.962 ± 0.633
2.565CysPro: 2.565 ± 0.782
0.0CysGln: 0.0 ± 0.0
1.924CysArg: 1.924 ± 0.884
2.886CysSer: 2.886 ± 1.313
1.603CysThr: 1.603 ± 1.223
1.282CysVal: 1.282 ± 0.511
0.641CysTrp: 0.641 ± 0.817
0.962CysTyr: 0.962 ± 0.597
0.0CysXaa: 0.0 ± 0.0
Asp
4.809AspAla: 4.809 ± 1.305
0.641AspCys: 0.641 ± 0.471
3.527AspAsp: 3.527 ± 1.662
4.489AspGlu: 4.489 ± 1.719
0.962AspPhe: 0.962 ± 0.514
2.565AspGly: 2.565 ± 0.954
1.282AspHis: 1.282 ± 0.511
1.924AspIle: 1.924 ± 0.765
0.641AspLys: 0.641 ± 0.422
4.168AspLeu: 4.168 ± 1.358
0.321AspMet: 0.321 ± 0.211
2.886AspAsn: 2.886 ± 1.875
2.886AspPro: 2.886 ± 1.236
1.282AspGln: 1.282 ± 0.638
2.886AspArg: 2.886 ± 0.841
2.244AspSer: 2.244 ± 1.433
2.565AspThr: 2.565 ± 1.426
4.489AspVal: 4.489 ± 1.779
0.321AspTrp: 0.321 ± 0.211
1.603AspTyr: 1.603 ± 0.797
0.0AspXaa: 0.0 ± 0.0
Glu
3.527GluAla: 3.527 ± 1.019
0.321GluCys: 0.321 ± 0.211
1.603GluAsp: 1.603 ± 0.934
2.244GluGlu: 2.244 ± 1.011
1.924GluPhe: 1.924 ± 0.968
2.565GluGly: 2.565 ± 1.371
2.886GluHis: 2.886 ± 0.927
3.527GluIle: 3.527 ± 0.947
2.244GluLys: 2.244 ± 0.517
6.412GluLeu: 6.412 ± 2.07
0.321GluMet: 0.321 ± 0.497
2.565GluAsn: 2.565 ± 0.585
3.527GluPro: 3.527 ± 0.921
2.565GluGln: 2.565 ± 0.678
2.886GluArg: 2.886 ± 1.168
1.282GluSer: 1.282 ± 0.646
6.412GluThr: 6.412 ± 1.022
3.847GluVal: 3.847 ± 1.268
0.641GluTrp: 0.641 ± 0.671
1.924GluTyr: 1.924 ± 1.266
0.0GluXaa: 0.0 ± 0.0
Phe
1.924PheAla: 1.924 ± 1.218
1.282PheCys: 1.282 ± 0.526
3.206PheAsp: 3.206 ± 1.166
1.924PheGlu: 1.924 ± 0.774
0.962PhePhe: 0.962 ± 0.565
1.282PheGly: 1.282 ± 0.511
0.962PheHis: 0.962 ± 0.431
1.603PheIle: 1.603 ± 1.224
2.244PheLys: 2.244 ± 0.73
1.924PheLeu: 1.924 ± 0.686
0.0PheMet: 0.0 ± 0.0
1.603PheAsn: 1.603 ± 1.055
2.565PhePro: 2.565 ± 0.779
1.924PheGln: 1.924 ± 1.642
1.603PheArg: 1.603 ± 0.723
3.527PheSer: 3.527 ± 0.754
2.244PheThr: 2.244 ± 0.697
1.924PheVal: 1.924 ± 0.99
0.321PheTrp: 0.321 ± 0.378
0.641PheTyr: 0.641 ± 0.422
0.0PheXaa: 0.0 ± 0.0
Gly
4.489GlyAla: 4.489 ± 1.127
0.321GlyCys: 0.321 ± 0.211
2.244GlyAsp: 2.244 ± 1.1
2.886GlyGlu: 2.886 ± 1.031
1.603GlyPhe: 1.603 ± 0.537
0.962GlyGly: 0.962 ± 0.456
1.282GlyHis: 1.282 ± 0.524
2.565GlyIle: 2.565 ± 1.114
1.924GlyLys: 1.924 ± 0.541
3.206GlyLeu: 3.206 ± 1.115
0.641GlyMet: 0.641 ± 0.525
1.603GlyAsn: 1.603 ± 1.022
3.527GlyPro: 3.527 ± 0.488
1.282GlyGln: 1.282 ± 0.876
2.565GlyArg: 2.565 ± 0.751
4.489GlySer: 4.489 ± 1.848
4.168GlyThr: 4.168 ± 1.131
2.244GlyVal: 2.244 ± 0.666
0.641GlyTrp: 0.641 ± 0.415
1.603GlyTyr: 1.603 ± 0.778
0.0GlyXaa: 0.0 ± 0.0
His
3.527HisAla: 3.527 ± 1.254
0.962HisCys: 0.962 ± 0.5
2.244HisAsp: 2.244 ± 1.302
1.924HisGlu: 1.924 ± 1.009
1.282HisPhe: 1.282 ± 0.559
1.603HisGly: 1.603 ± 0.799
2.244HisHis: 2.244 ± 0.94
1.924HisIle: 1.924 ± 0.7
1.603HisLys: 1.603 ± 0.699
3.847HisLeu: 3.847 ± 1.268
0.321HisMet: 0.321 ± 0.211
0.641HisAsn: 0.641 ± 0.691
2.244HisPro: 2.244 ± 0.756
1.924HisGln: 1.924 ± 0.491
1.924HisArg: 1.924 ± 0.782
2.886HisSer: 2.886 ± 1.727
3.206HisThr: 3.206 ± 0.426
2.244HisVal: 2.244 ± 1.205
0.0HisTrp: 0.0 ± 0.0
1.603HisTyr: 1.603 ± 0.934
0.0HisXaa: 0.0 ± 0.0
Ile
1.924IleAla: 1.924 ± 0.942
0.962IleCys: 0.962 ± 0.758
3.206IleAsp: 3.206 ± 1.777
4.489IleGlu: 4.489 ± 1.584
1.282IlePhe: 1.282 ± 0.308
2.565IleGly: 2.565 ± 0.933
1.924IleHis: 1.924 ± 0.598
3.206IleIle: 3.206 ± 1.086
2.886IleLys: 2.886 ± 1.04
3.206IleLeu: 3.206 ± 0.826
1.603IleMet: 1.603 ± 0.588
2.244IleAsn: 2.244 ± 0.871
4.809IlePro: 4.809 ± 1.407
1.282IleGln: 1.282 ± 0.602
2.886IleArg: 2.886 ± 0.689
4.489IleSer: 4.489 ± 1.028
3.206IleThr: 3.206 ± 1.353
2.565IleVal: 2.565 ± 0.812
0.321IleTrp: 0.321 ± 0.211
0.641IleTyr: 0.641 ± 0.378
0.0IleXaa: 0.0 ± 0.0
Lys
2.244LysAla: 2.244 ± 0.694
0.962LysCys: 0.962 ± 0.571
2.565LysAsp: 2.565 ± 0.835
2.244LysGlu: 2.244 ± 0.598
0.962LysPhe: 0.962 ± 0.35
0.962LysGly: 0.962 ± 0.802
2.244LysHis: 2.244 ± 0.799
2.886LysIle: 2.886 ± 0.762
0.641LysLys: 0.641 ± 0.27
4.809LysLeu: 4.809 ± 1.507
0.0LysMet: 0.0 ± 0.0
3.206LysAsn: 3.206 ± 0.523
3.847LysPro: 3.847 ± 0.797
0.321LysGln: 0.321 ± 0.378
2.886LysArg: 2.886 ± 0.565
2.565LysSer: 2.565 ± 0.779
2.886LysThr: 2.886 ± 0.718
2.565LysVal: 2.565 ± 1.242
0.321LysTrp: 0.321 ± 0.335
1.282LysTyr: 1.282 ± 0.646
0.0LysXaa: 0.0 ± 0.0
Leu
7.695LeuAla: 7.695 ± 1.534
2.886LeuCys: 2.886 ± 0.335
1.603LeuAsp: 1.603 ± 0.615
5.45LeuGlu: 5.45 ± 0.716
1.603LeuPhe: 1.603 ± 0.686
6.412LeuGly: 6.412 ± 1.588
4.168LeuHis: 4.168 ± 1.081
3.847LeuIle: 3.847 ± 1.109
3.527LeuLys: 3.527 ± 0.681
8.977LeuLeu: 8.977 ± 3.403
0.962LeuMet: 0.962 ± 0.711
4.168LeuAsn: 4.168 ± 1.724
7.054LeuPro: 7.054 ± 1.726
4.168LeuGln: 4.168 ± 0.659
5.771LeuArg: 5.771 ± 0.943
9.618LeuSer: 9.618 ± 1.094
6.733LeuThr: 6.733 ± 2.044
2.244LeuVal: 2.244 ± 0.783
0.321LeuTrp: 0.321 ± 0.211
4.168LeuTyr: 4.168 ± 1.966
0.0LeuXaa: 0.0 ± 0.0
Met
1.282MetAla: 1.282 ± 0.44
0.0MetCys: 0.0 ± 0.0
0.321MetAsp: 0.321 ± 0.378
1.603MetGlu: 1.603 ± 0.594
0.641MetPhe: 0.641 ± 0.422
0.962MetGly: 0.962 ± 0.571
0.641MetHis: 0.641 ± 0.459
0.0MetIle: 0.0 ± 0.0
0.641MetLys: 0.641 ± 0.459
2.244MetLeu: 2.244 ± 0.365
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
2.565MetArg: 2.565 ± 2.229
2.565MetSer: 2.565 ± 1.333
0.962MetThr: 0.962 ± 0.571
0.321MetVal: 0.321 ± 0.378
0.0MetTrp: 0.0 ± 0.0
0.641MetTyr: 0.641 ± 0.589
0.0MetXaa: 0.0 ± 0.0
Asn
5.45AsnAla: 5.45 ± 1.041
0.962AsnCys: 0.962 ± 0.35
2.244AsnAsp: 2.244 ± 0.849
1.282AsnGlu: 1.282 ± 0.698
1.282AsnPhe: 1.282 ± 0.698
1.282AsnGly: 1.282 ± 0.756
1.924AsnHis: 1.924 ± 0.86
2.244AsnIle: 2.244 ± 1.125
0.962AsnLys: 0.962 ± 0.622
1.924AsnLeu: 1.924 ± 0.95
0.962AsnMet: 0.962 ± 0.455
1.282AsnAsn: 1.282 ± 0.876
3.206AsnPro: 3.206 ± 1.055
2.244AsnGln: 2.244 ± 0.862
1.924AsnArg: 1.924 ± 0.586
4.809AsnSer: 4.809 ± 1.503
3.847AsnThr: 3.847 ± 1.08
0.321AsnVal: 0.321 ± 0.408
1.282AsnTrp: 1.282 ± 0.443
1.603AsnTyr: 1.603 ± 0.547
0.0AsnXaa: 0.0 ± 0.0
Pro
5.771ProAla: 5.771 ± 1.048
0.962ProCys: 0.962 ± 0.711
2.565ProAsp: 2.565 ± 1.44
4.809ProGlu: 4.809 ± 1.202
2.244ProPhe: 2.244 ± 0.379
3.206ProGly: 3.206 ± 1.072
1.924ProHis: 1.924 ± 0.632
1.924ProIle: 1.924 ± 0.771
4.168ProLys: 4.168 ± 1.284
5.771ProLeu: 5.771 ± 1.451
0.321ProMet: 0.321 ± 0.211
2.886ProAsn: 2.886 ± 1.313
8.015ProPro: 8.015 ± 3.286
2.565ProGln: 2.565 ± 0.66
3.206ProArg: 3.206 ± 1.116
8.977ProSer: 8.977 ± 3.55
7.054ProThr: 7.054 ± 2.27
4.489ProVal: 4.489 ± 0.575
0.0ProTrp: 0.0 ± 0.0
2.244ProTyr: 2.244 ± 0.497
0.0ProXaa: 0.0 ± 0.0
Gln
6.092GlnAla: 6.092 ± 1.153
1.282GlnCys: 1.282 ± 0.531
0.962GlnAsp: 0.962 ± 0.633
2.244GlnGlu: 2.244 ± 0.959
1.924GlnPhe: 1.924 ± 1.08
1.282GlnGly: 1.282 ± 0.566
1.603GlnHis: 1.603 ± 1.055
1.924GlnIle: 1.924 ± 0.655
1.924GlnLys: 1.924 ± 0.41
3.847GlnLeu: 3.847 ± 0.796
1.603GlnMet: 1.603 ± 0.791
0.962GlnAsn: 0.962 ± 0.456
2.886GlnPro: 2.886 ± 1.052
1.924GlnGln: 1.924 ± 1.028
1.282GlnArg: 1.282 ± 0.308
2.565GlnSer: 2.565 ± 1.546
2.886GlnThr: 2.886 ± 0.852
1.603GlnVal: 1.603 ± 0.811
0.321GlnTrp: 0.321 ± 0.211
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
7.374ArgAla: 7.374 ± 1.524
1.282ArgCys: 1.282 ± 0.382
1.603ArgAsp: 1.603 ± 0.799
4.489ArgGlu: 4.489 ± 1.061
1.924ArgPhe: 1.924 ± 0.683
1.924ArgGly: 1.924 ± 0.918
2.565ArgHis: 2.565 ± 0.644
3.206ArgIle: 3.206 ± 1.925
1.924ArgLys: 1.924 ± 1.009
4.809ArgLeu: 4.809 ± 0.669
1.282ArgMet: 1.282 ± 0.897
0.962ArgAsn: 0.962 ± 0.39
3.847ArgPro: 3.847 ± 1.915
3.847ArgGln: 3.847 ± 0.328
5.45ArgArg: 5.45 ± 0.667
4.168ArgSer: 4.168 ± 1.212
6.092ArgThr: 6.092 ± 2.143
3.527ArgVal: 3.527 ± 0.588
0.962ArgTrp: 0.962 ± 0.456
2.565ArgTyr: 2.565 ± 0.829
0.0ArgXaa: 0.0 ± 0.0
Ser
5.45SerAla: 5.45 ± 1.097
3.527SerCys: 3.527 ± 2.956
4.168SerAsp: 4.168 ± 0.992
1.924SerGlu: 1.924 ± 0.439
4.809SerPhe: 4.809 ± 0.132
3.847SerGly: 3.847 ± 1.658
1.924SerHis: 1.924 ± 0.7
5.13SerIle: 5.13 ± 1.501
4.489SerLys: 4.489 ± 2.302
7.695SerLeu: 7.695 ± 1.518
0.962SerMet: 0.962 ± 0.819
4.809SerAsn: 4.809 ± 1.11
6.092SerPro: 6.092 ± 3.906
3.847SerGln: 3.847 ± 1.652
6.092SerArg: 6.092 ± 2.454
13.786SerSer: 13.786 ± 10.202
8.657SerThr: 8.657 ± 5.023
3.527SerVal: 3.527 ± 1.151
0.321SerTrp: 0.321 ± 0.211
2.244SerTyr: 2.244 ± 0.892
0.0SerXaa: 0.0 ± 0.0
Thr
8.336ThrAla: 8.336 ± 2.01
1.603ThrCys: 1.603 ± 0.739
2.886ThrAsp: 2.886 ± 1.371
1.924ThrGlu: 1.924 ± 0.462
0.962ThrPhe: 0.962 ± 0.633
2.886ThrGly: 2.886 ± 0.7
2.565ThrHis: 2.565 ± 0.468
2.565ThrIle: 2.565 ± 0.8
3.527ThrLys: 3.527 ± 0.808
10.58ThrLeu: 10.58 ± 1.29
2.565ThrMet: 2.565 ± 1.837
2.565ThrAsn: 2.565 ± 0.851
7.695ThrPro: 7.695 ± 3.826
2.886ThrGln: 2.886 ± 0.61
6.733ThrArg: 6.733 ± 0.58
8.977ThrSer: 8.977 ± 6.549
7.374ThrThr: 7.374 ± 4.102
2.565ThrVal: 2.565 ± 0.886
0.321ThrTrp: 0.321 ± 0.378
2.244ThrTyr: 2.244 ± 0.401
0.0ThrXaa: 0.0 ± 0.0
Val
4.168ValAla: 4.168 ± 1.895
0.962ValCys: 0.962 ± 0.476
3.206ValAsp: 3.206 ± 1.369
2.565ValGlu: 2.565 ± 0.603
2.886ValPhe: 2.886 ± 0.87
2.244ValGly: 2.244 ± 0.799
1.282ValHis: 1.282 ± 0.844
3.206ValIle: 3.206 ± 1.103
1.924ValLys: 1.924 ± 0.99
3.527ValLeu: 3.527 ± 1.231
0.321ValMet: 0.321 ± 0.335
2.565ValAsn: 2.565 ± 0.786
3.527ValPro: 3.527 ± 1.163
2.886ValGln: 2.886 ± 1.111
3.527ValArg: 3.527 ± 0.985
2.886ValSer: 2.886 ± 1.132
1.924ValThr: 1.924 ± 0.912
3.206ValVal: 3.206 ± 1.094
0.0ValTrp: 0.0 ± 0.0
1.924ValTyr: 1.924 ± 0.979
0.0ValXaa: 0.0 ± 0.0
Trp
0.962TrpAla: 0.962 ± 0.758
0.0TrpCys: 0.0 ± 0.0
0.321TrpAsp: 0.321 ± 0.378
0.641TrpGlu: 0.641 ± 0.27
0.321TrpPhe: 0.321 ± 0.408
0.641TrpGly: 0.641 ± 0.27
0.641TrpHis: 0.641 ± 0.57
0.321TrpIle: 0.321 ± 0.211
0.321TrpLys: 0.321 ± 0.211
0.641TrpLeu: 0.641 ± 0.422
0.0TrpMet: 0.0 ± 0.0
0.641TrpAsn: 0.641 ± 0.364
0.0TrpPro: 0.0 ± 0.0
0.321TrpGln: 0.321 ± 0.211
0.962TrpArg: 0.962 ± 0.35
0.321TrpSer: 0.321 ± 0.335
0.321TrpThr: 0.321 ± 0.335
0.321TrpVal: 0.321 ± 0.497
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.603TyrAla: 1.603 ± 0.799
1.603TyrCys: 1.603 ± 0.886
1.603TyrAsp: 1.603 ± 0.615
1.924TyrGlu: 1.924 ± 0.686
1.282TyrPhe: 1.282 ± 0.646
1.282TyrGly: 1.282 ± 0.524
2.244TyrHis: 2.244 ± 1.27
0.962TyrIle: 0.962 ± 0.456
0.0TyrLys: 0.0 ± 0.0
2.886TyrLeu: 2.886 ± 0.701
0.321TyrMet: 0.321 ± 0.211
0.962TyrAsn: 0.962 ± 0.5
0.962TyrPro: 0.962 ± 0.633
0.962TyrGln: 0.962 ± 0.571
2.886TyrArg: 2.886 ± 0.817
3.527TyrSer: 3.527 ± 1.133
3.206TyrThr: 3.206 ± 0.577
3.206TyrVal: 3.206 ± 1.122
0.0TyrTrp: 0.0 ± 0.0
1.282TyrTyr: 1.282 ± 0.62
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3120 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski