Amino acid dipepetide frequency for Sonchus yellow net virus (SYNV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.633AlaAla: 2.633 ± 0.758
2.155AlaCys: 2.155 ± 0.559
2.394AlaAsp: 2.394 ± 0.688
3.831AlaGlu: 3.831 ± 1.366
1.676AlaPhe: 1.676 ± 0.59
2.155AlaGly: 2.155 ± 0.789
0.479AlaHis: 0.479 ± 0.252
4.07AlaIle: 4.07 ± 1.901
2.394AlaLys: 2.394 ± 0.73
5.506AlaLeu: 5.506 ± 0.928
1.197AlaMet: 1.197 ± 1.096
2.155AlaAsn: 2.155 ± 0.887
1.676AlaPro: 1.676 ± 0.486
1.676AlaGln: 1.676 ± 0.588
2.155AlaArg: 2.155 ± 1.044
4.549AlaSer: 4.549 ± 1.124
4.07AlaThr: 4.07 ± 1.079
3.352AlaVal: 3.352 ± 0.616
0.718AlaTrp: 0.718 ± 0.433
1.676AlaTyr: 1.676 ± 0.684
0.0AlaXaa: 0.0 ± 0.0
Cys
1.197CysAla: 1.197 ± 0.647
0.239CysCys: 0.239 ± 0.153
1.436CysAsp: 1.436 ± 0.608
0.718CysGlu: 0.718 ± 0.304
0.718CysPhe: 0.718 ± 0.352
0.718CysGly: 0.718 ± 0.555
0.479CysHis: 0.479 ± 0.506
2.873CysIle: 2.873 ± 1.013
0.958CysLys: 0.958 ± 0.416
1.676CysLeu: 1.676 ± 0.334
0.479CysMet: 0.479 ± 0.266
1.197CysAsn: 1.197 ± 0.814
0.958CysPro: 0.958 ± 0.407
0.479CysGln: 0.479 ± 0.305
0.239CysArg: 0.239 ± 0.153
2.394CysSer: 2.394 ± 0.636
1.436CysThr: 1.436 ± 0.502
0.958CysVal: 0.958 ± 0.45
0.239CysTrp: 0.239 ± 0.153
1.676CysTyr: 1.676 ± 1.02
0.0CysXaa: 0.0 ± 0.0
Asp
5.506AspAla: 5.506 ± 1.351
0.718AspCys: 0.718 ± 0.304
4.549AspAsp: 4.549 ± 0.956
3.112AspGlu: 3.112 ± 0.724
1.197AspPhe: 1.197 ± 0.321
3.591AspGly: 3.591 ± 1.777
1.436AspHis: 1.436 ± 0.608
6.703AspIle: 6.703 ± 1.53
5.028AspLys: 5.028 ± 0.729
5.267AspLeu: 5.267 ± 1.22
3.352AspMet: 3.352 ± 0.79
5.028AspAsn: 5.028 ± 0.991
3.831AspPro: 3.831 ± 0.876
1.915AspGln: 1.915 ± 0.952
2.394AspArg: 2.394 ± 0.835
3.591AspSer: 3.591 ± 1.182
3.591AspThr: 3.591 ± 1.619
3.831AspVal: 3.831 ± 2.007
0.239AspTrp: 0.239 ± 0.153
2.394AspTyr: 2.394 ± 0.989
0.0AspXaa: 0.0 ± 0.0
Glu
3.591GluAla: 3.591 ± 0.423
0.479GluCys: 0.479 ± 0.266
4.788GluAsp: 4.788 ± 0.697
3.591GluGlu: 3.591 ± 0.694
1.197GluPhe: 1.197 ± 0.396
3.591GluGly: 3.591 ± 0.979
1.197GluHis: 1.197 ± 0.763
4.309GluIle: 4.309 ± 1.934
2.633GluLys: 2.633 ± 0.825
3.831GluLeu: 3.831 ± 0.843
1.915GluMet: 1.915 ± 0.242
2.155GluAsn: 2.155 ± 0.615
1.197GluPro: 1.197 ± 0.623
1.436GluGln: 1.436 ± 0.454
2.873GluArg: 2.873 ± 1.324
3.591GluSer: 3.591 ± 0.581
2.394GluThr: 2.394 ± 0.574
3.352GluVal: 3.352 ± 1.729
0.479GluTrp: 0.479 ± 0.414
1.915GluTyr: 1.915 ± 0.454
0.0GluXaa: 0.0 ± 0.0
Phe
1.436PheAla: 1.436 ± 0.717
0.0PheCys: 0.0 ± 0.0
2.155PheAsp: 2.155 ± 0.807
1.436PheGlu: 1.436 ± 0.36
1.436PhePhe: 1.436 ± 0.467
1.197PheGly: 1.197 ± 0.549
0.479PheHis: 0.479 ± 0.255
1.915PheIle: 1.915 ± 0.373
2.633PheLys: 2.633 ± 0.656
3.591PheLeu: 3.591 ± 1.154
0.958PheMet: 0.958 ± 0.799
1.915PheAsn: 1.915 ± 0.469
1.915PhePro: 1.915 ± 0.537
1.676PheGln: 1.676 ± 0.587
1.915PheArg: 1.915 ± 0.557
3.112PheSer: 3.112 ± 0.74
0.718PheThr: 0.718 ± 0.304
1.197PheVal: 1.197 ± 0.321
0.718PheTrp: 0.718 ± 0.675
2.155PheTyr: 2.155 ± 0.653
0.0PheXaa: 0.0 ± 0.0
Gly
1.915GlyAla: 1.915 ± 0.443
0.958GlyCys: 0.958 ± 0.735
4.07GlyAsp: 4.07 ± 1.311
2.633GlyGlu: 2.633 ± 0.396
2.633GlyPhe: 2.633 ± 0.899
3.831GlyGly: 3.831 ± 0.931
2.155GlyHis: 2.155 ± 1.546
4.788GlyIle: 4.788 ± 0.956
3.591GlyLys: 3.591 ± 1.228
4.309GlyLeu: 4.309 ± 0.773
1.197GlyMet: 1.197 ± 0.326
2.394GlyAsn: 2.394 ± 0.844
1.676GlyPro: 1.676 ± 0.379
0.958GlyGln: 0.958 ± 0.549
2.633GlyArg: 2.633 ± 1.356
4.549GlySer: 4.549 ± 1.494
3.112GlyThr: 3.112 ± 0.696
3.591GlyVal: 3.591 ± 1.079
0.958GlyTrp: 0.958 ± 0.531
3.112GlyTyr: 3.112 ± 1.219
0.0GlyXaa: 0.0 ± 0.0
His
1.915HisAla: 1.915 ± 0.558
0.479HisCys: 0.479 ± 0.305
1.436HisAsp: 1.436 ± 0.673
0.718HisGlu: 0.718 ± 0.41
0.958HisPhe: 0.958 ± 0.61
2.394HisGly: 2.394 ± 0.693
1.436HisHis: 1.436 ± 0.826
1.676HisIle: 1.676 ± 0.804
1.436HisLys: 1.436 ± 0.607
3.112HisLeu: 3.112 ± 1.05
0.479HisMet: 0.479 ± 0.454
0.958HisAsn: 0.958 ± 0.624
1.436HisPro: 1.436 ± 0.532
0.239HisGln: 0.239 ± 0.153
1.436HisArg: 1.436 ± 0.567
2.633HisSer: 2.633 ± 0.794
2.155HisThr: 2.155 ± 0.606
0.958HisVal: 0.958 ± 0.662
0.479HisTrp: 0.479 ± 0.305
1.676HisTyr: 1.676 ± 0.699
0.0HisXaa: 0.0 ± 0.0
Ile
2.394IleAla: 2.394 ± 0.933
2.633IleCys: 2.633 ± 0.645
4.07IleAsp: 4.07 ± 1.503
3.591IleGlu: 3.591 ± 0.938
3.352IlePhe: 3.352 ± 0.799
4.549IleGly: 4.549 ± 0.749
0.958IleHis: 0.958 ± 0.357
4.07IleIle: 4.07 ± 1.269
5.746IleLys: 5.746 ± 0.993
8.14IleLeu: 8.14 ± 1.966
3.831IleMet: 3.831 ± 0.883
3.831IleAsn: 3.831 ± 1.023
2.633IlePro: 2.633 ± 0.564
2.394IleGln: 2.394 ± 0.605
3.591IleArg: 3.591 ± 0.754
10.055IleSer: 10.055 ± 2.145
6.225IleThr: 6.225 ± 0.857
4.07IleVal: 4.07 ± 1.283
1.197IleTrp: 1.197 ± 0.535
3.112IleTyr: 3.112 ± 0.556
0.0IleXaa: 0.0 ± 0.0
Lys
3.831LysAla: 3.831 ± 1.821
0.718LysCys: 0.718 ± 0.346
4.309LysAsp: 4.309 ± 0.454
3.831LysGlu: 3.831 ± 0.791
1.915LysPhe: 1.915 ± 0.671
3.831LysGly: 3.831 ± 0.645
2.633LysHis: 2.633 ± 1.029
5.985LysIle: 5.985 ± 1.572
5.028LysLys: 5.028 ± 1.09
4.309LysLeu: 4.309 ± 1.303
2.155LysMet: 2.155 ± 1.078
2.873LysAsn: 2.873 ± 0.907
1.676LysPro: 1.676 ± 0.35
1.436LysGln: 1.436 ± 0.415
5.028LysArg: 5.028 ± 1.045
3.831LysSer: 3.831 ± 2.093
3.831LysThr: 3.831 ± 0.835
3.352LysVal: 3.352 ± 0.914
0.718LysTrp: 0.718 ± 0.303
3.352LysTyr: 3.352 ± 1.148
0.0LysXaa: 0.0 ± 0.0
Leu
5.028LeuAla: 5.028 ± 1.135
1.676LeuCys: 1.676 ± 0.805
5.028LeuAsp: 5.028 ± 0.733
5.028LeuGlu: 5.028 ± 0.89
3.591LeuPhe: 3.591 ± 1.072
4.788LeuGly: 4.788 ± 1.849
2.633LeuHis: 2.633 ± 0.844
6.225LeuIle: 6.225 ± 1.829
5.746LeuLys: 5.746 ± 1.702
6.943LeuLeu: 6.943 ± 1.186
2.394LeuMet: 2.394 ± 0.928
4.07LeuAsn: 4.07 ± 1.369
3.352LeuPro: 3.352 ± 0.975
1.915LeuGln: 1.915 ± 0.866
4.07LeuArg: 4.07 ± 1.204
9.097LeuSer: 9.097 ± 0.686
5.985LeuThr: 5.985 ± 0.527
3.831LeuVal: 3.831 ± 0.834
0.958LeuTrp: 0.958 ± 0.284
4.309LeuTyr: 4.309 ± 1.797
0.0LeuXaa: 0.0 ± 0.0
Met
1.436MetAla: 1.436 ± 0.731
0.479MetCys: 0.479 ± 0.305
0.958MetAsp: 0.958 ± 0.349
1.436MetGlu: 1.436 ± 0.769
0.718MetPhe: 0.718 ± 0.303
1.676MetGly: 1.676 ± 0.686
0.0MetHis: 0.0 ± 0.0
2.873MetIle: 2.873 ± 0.672
2.633MetLys: 2.633 ± 0.274
2.155MetLeu: 2.155 ± 1.316
0.718MetMet: 0.718 ± 0.872
1.676MetAsn: 1.676 ± 0.643
0.479MetPro: 0.479 ± 0.305
0.958MetGln: 0.958 ± 0.406
2.155MetArg: 2.155 ± 0.457
6.225MetSer: 6.225 ± 1.458
2.394MetThr: 2.394 ± 0.852
1.436MetVal: 1.436 ± 0.459
0.958MetTrp: 0.958 ± 0.284
1.676MetTyr: 1.676 ± 0.695
0.0MetXaa: 0.0 ± 0.0
Asn
1.915AsnAla: 1.915 ± 0.373
0.0AsnCys: 0.0 ± 0.0
3.352AsnAsp: 3.352 ± 1.069
2.873AsnGlu: 2.873 ± 0.586
0.718AsnPhe: 0.718 ± 0.303
1.915AsnGly: 1.915 ± 0.772
3.112AsnHis: 3.112 ± 0.669
5.028AsnIle: 5.028 ± 1.099
3.831AsnLys: 3.831 ± 0.948
4.788AsnLeu: 4.788 ± 1.278
1.915AsnMet: 1.915 ± 0.647
3.591AsnAsn: 3.591 ± 0.756
2.633AsnPro: 2.633 ± 0.617
2.633AsnGln: 2.633 ± 1.016
3.112AsnArg: 3.112 ± 0.688
3.831AsnSer: 3.831 ± 0.839
2.873AsnThr: 2.873 ± 1.477
1.915AsnVal: 1.915 ± 0.597
0.718AsnTrp: 0.718 ± 0.304
1.436AsnTyr: 1.436 ± 0.36
0.0AsnXaa: 0.0 ± 0.0
Pro
2.394ProAla: 2.394 ± 0.651
1.197ProCys: 1.197 ± 0.544
2.155ProAsp: 2.155 ± 0.664
1.197ProGlu: 1.197 ± 0.337
1.436ProPhe: 1.436 ± 1.192
1.197ProGly: 1.197 ± 0.321
1.197ProHis: 1.197 ± 0.525
3.831ProIle: 3.831 ± 0.714
2.155ProLys: 2.155 ± 0.918
2.394ProLeu: 2.394 ± 0.685
0.479ProMet: 0.479 ± 0.255
1.915ProAsn: 1.915 ± 0.729
1.676ProPro: 1.676 ± 0.815
0.239ProGln: 0.239 ± 0.309
1.676ProArg: 1.676 ± 0.559
3.591ProSer: 3.591 ± 1.383
3.591ProThr: 3.591 ± 0.794
2.394ProVal: 2.394 ± 0.282
0.479ProTrp: 0.479 ± 0.255
1.915ProTyr: 1.915 ± 0.747
0.0ProXaa: 0.0 ± 0.0
Gln
1.197GlnAla: 1.197 ± 0.396
1.197GlnCys: 1.197 ± 0.771
1.915GlnAsp: 1.915 ± 0.391
2.394GlnGlu: 2.394 ± 2.296
0.239GlnPhe: 0.239 ± 0.153
2.394GlnGly: 2.394 ± 0.754
0.718GlnHis: 0.718 ± 0.303
2.155GlnIle: 2.155 ± 0.505
1.915GlnLys: 1.915 ± 0.572
2.155GlnLeu: 2.155 ± 0.815
0.479GlnMet: 0.479 ± 0.266
1.436GlnAsn: 1.436 ± 0.647
1.436GlnPro: 1.436 ± 0.415
0.479GlnGln: 0.479 ± 0.419
0.958GlnArg: 0.958 ± 0.451
3.112GlnSer: 3.112 ± 1.05
0.479GlnThr: 0.479 ± 0.582
1.436GlnVal: 1.436 ± 0.459
0.0GlnTrp: 0.0 ± 0.0
2.155GlnTyr: 2.155 ± 0.501
0.0GlnXaa: 0.0 ± 0.0
Arg
3.112ArgAla: 3.112 ± 0.501
0.718ArgCys: 0.718 ± 0.296
3.352ArgAsp: 3.352 ± 1.169
3.352ArgGlu: 3.352 ± 0.408
1.915ArgPhe: 1.915 ± 0.611
3.112ArgGly: 3.112 ± 0.963
0.479ArgHis: 0.479 ± 0.255
3.831ArgIle: 3.831 ± 1.068
2.633ArgLys: 2.633 ± 1.18
4.07ArgLeu: 4.07 ± 1.205
1.436ArgMet: 1.436 ± 0.415
2.394ArgAsn: 2.394 ± 1.103
2.394ArgPro: 2.394 ± 1.079
2.633ArgGln: 2.633 ± 1.144
2.394ArgArg: 2.394 ± 0.993
3.352ArgSer: 3.352 ± 1.043
2.633ArgThr: 2.633 ± 1.138
3.112ArgVal: 3.112 ± 0.761
0.718ArgTrp: 0.718 ± 0.304
0.958ArgTyr: 0.958 ± 0.401
0.0ArgXaa: 0.0 ± 0.0
Ser
4.309SerAla: 4.309 ± 1.297
3.352SerCys: 3.352 ± 0.849
8.619SerAsp: 8.619 ± 0.719
4.549SerGlu: 4.549 ± 0.835
2.394SerPhe: 2.394 ± 0.967
5.028SerGly: 5.028 ± 1.317
3.112SerHis: 3.112 ± 0.9
5.506SerIle: 5.506 ± 0.937
5.746SerLys: 5.746 ± 0.734
6.703SerLeu: 6.703 ± 1.17
3.112SerMet: 3.112 ± 0.539
2.633SerAsn: 2.633 ± 0.678
2.155SerPro: 2.155 ± 0.708
2.394SerGln: 2.394 ± 0.562
5.028SerArg: 5.028 ± 1.218
8.619SerSer: 8.619 ± 1.265
7.182SerThr: 7.182 ± 1.862
5.267SerVal: 5.267 ± 0.426
1.436SerTrp: 1.436 ± 1.171
3.112SerTyr: 3.112 ± 0.446
0.0SerXaa: 0.0 ± 0.0
Thr
1.915ThrAla: 1.915 ± 0.555
1.676ThrCys: 1.676 ± 0.515
4.549ThrAsp: 4.549 ± 1.931
3.112ThrGlu: 3.112 ± 1.165
1.915ThrPhe: 1.915 ± 1.009
4.07ThrGly: 4.07 ± 1.244
1.915ThrHis: 1.915 ± 0.678
5.028ThrIle: 5.028 ± 0.769
4.07ThrLys: 4.07 ± 1.141
4.788ThrLeu: 4.788 ± 0.488
2.873ThrMet: 2.873 ± 0.5
3.352ThrAsn: 3.352 ± 0.605
2.155ThrPro: 2.155 ± 0.595
2.633ThrGln: 2.633 ± 0.633
2.873ThrArg: 2.873 ± 0.474
6.225ThrSer: 6.225 ± 0.913
3.591ThrThr: 3.591 ± 0.463
2.873ThrVal: 2.873 ± 1.022
1.676ThrTrp: 1.676 ± 0.334
2.873ThrTyr: 2.873 ± 0.905
0.0ThrXaa: 0.0 ± 0.0
Val
2.633ValAla: 2.633 ± 1.188
0.958ValCys: 0.958 ± 0.531
2.873ValAsp: 2.873 ± 0.496
1.436ValGlu: 1.436 ± 0.661
1.915ValPhe: 1.915 ± 0.537
2.155ValGly: 2.155 ± 0.916
0.718ValHis: 0.718 ± 0.303
5.267ValIle: 5.267 ± 0.718
3.112ValLys: 3.112 ± 1.496
6.943ValLeu: 6.943 ± 1.156
3.591ValMet: 3.591 ± 1.03
3.352ValAsn: 3.352 ± 0.762
2.633ValPro: 2.633 ± 0.955
0.718ValGln: 0.718 ± 0.324
1.436ValArg: 1.436 ± 0.527
4.309ValSer: 4.309 ± 0.911
4.309ValThr: 4.309 ± 1.033
3.591ValVal: 3.591 ± 1.273
0.479ValTrp: 0.479 ± 0.428
1.436ValTyr: 1.436 ± 0.679
0.0ValXaa: 0.0 ± 0.0
Trp
0.239TrpAla: 0.239 ± 0.153
0.479TrpCys: 0.479 ± 0.582
1.436TrpAsp: 1.436 ± 0.489
1.436TrpGlu: 1.436 ± 0.459
0.479TrpPhe: 0.479 ± 0.617
1.197TrpGly: 1.197 ± 0.393
0.479TrpHis: 0.479 ± 0.414
0.479TrpIle: 0.479 ± 0.266
1.197TrpLys: 1.197 ± 0.514
1.197TrpLeu: 1.197 ± 0.771
0.0TrpMet: 0.0 ± 0.0
0.718TrpAsn: 0.718 ± 0.304
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.958TrpArg: 0.958 ± 0.401
0.958TrpSer: 0.958 ± 0.401
1.436TrpThr: 1.436 ± 0.489
0.479TrpVal: 0.479 ± 0.255
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.915TyrAla: 1.915 ± 1.248
0.958TyrCys: 0.958 ± 0.577
3.831TyrAsp: 3.831 ± 0.676
0.239TyrGlu: 0.239 ± 0.153
2.155TyrPhe: 2.155 ± 0.799
1.676TyrGly: 1.676 ± 0.394
2.394TyrHis: 2.394 ± 1.259
3.831TyrIle: 3.831 ± 1.141
2.155TyrLys: 2.155 ± 1.28
4.788TyrLeu: 4.788 ± 1.543
0.239TyrMet: 0.239 ± 0.231
4.549TyrAsn: 4.549 ± 0.746
1.197TyrPro: 1.197 ± 0.474
1.436TyrGln: 1.436 ± 1.111
1.676TyrArg: 1.676 ± 0.805
2.633TyrSer: 2.633 ± 0.376
2.155TyrThr: 2.155 ± 0.723
2.873TyrVal: 2.873 ± 0.382
0.0TyrTrp: 0.0 ± 0.0
1.915TyrTyr: 1.915 ± 0.662
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (4178 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski