Amino acid dipepetide frequency for Rice hoja blanca tenuivirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.84AlaAla: 2.84 ± 1.053
0.609AlaCys: 0.609 ± 0.462
2.028AlaAsp: 2.028 ± 0.974
4.057AlaGlu: 4.057 ± 0.7
1.217AlaPhe: 1.217 ± 0.277
1.826AlaGly: 1.826 ± 0.485
1.42AlaHis: 1.42 ± 0.483
3.854AlaIle: 3.854 ± 0.808
4.462AlaLys: 4.462 ± 1.316
4.26AlaLeu: 4.26 ± 0.778
1.826AlaMet: 1.826 ± 0.622
1.826AlaAsn: 1.826 ± 0.744
1.826AlaPro: 1.826 ± 0.383
1.217AlaGln: 1.217 ± 0.502
2.028AlaArg: 2.028 ± 0.642
3.651AlaSer: 3.651 ± 0.703
2.434AlaThr: 2.434 ± 0.62
3.448AlaVal: 3.448 ± 0.974
0.406AlaTrp: 0.406 ± 0.234
3.043AlaTyr: 3.043 ± 0.687
0.0AlaXaa: 0.0 ± 0.0
Cys
1.014CysAla: 1.014 ± 0.234
0.203CysCys: 0.203 ± 0.117
1.217CysAsp: 1.217 ± 0.471
0.811CysGlu: 0.811 ± 0.574
1.014CysPhe: 1.014 ± 0.531
0.609CysGly: 0.609 ± 0.395
0.203CysHis: 0.203 ± 0.266
1.42CysIle: 1.42 ± 0.334
2.028CysLys: 2.028 ± 1.594
2.028CysLeu: 2.028 ± 0.587
0.811CysMet: 0.811 ± 0.362
0.609CysAsn: 0.609 ± 0.689
1.014CysPro: 1.014 ± 0.531
0.609CysGln: 0.609 ± 0.462
0.406CysArg: 0.406 ± 0.234
1.217CysSer: 1.217 ± 1.256
1.826CysThr: 1.826 ± 1.463
1.217CysVal: 1.217 ± 0.813
0.406CysTrp: 0.406 ± 0.57
1.217CysTyr: 1.217 ± 0.504
0.0CysXaa: 0.0 ± 0.0
Asp
3.651AspAla: 3.651 ± 0.994
0.811AspCys: 0.811 ± 0.414
5.071AspAsp: 5.071 ± 0.756
4.462AspGlu: 4.462 ± 1.211
3.651AspPhe: 3.651 ± 0.937
2.637AspGly: 2.637 ± 1.309
1.623AspHis: 1.623 ± 0.548
4.057AspIle: 4.057 ± 0.966
4.26AspLys: 4.26 ± 0.677
6.897AspLeu: 6.897 ± 1.338
1.826AspMet: 1.826 ± 0.742
3.245AspAsn: 3.245 ± 0.436
4.462AspPro: 4.462 ± 1.129
2.637AspGln: 2.637 ± 0.557
2.434AspArg: 2.434 ± 0.76
3.651AspSer: 3.651 ± 1.235
2.637AspThr: 2.637 ± 0.547
4.665AspVal: 4.665 ± 0.597
0.609AspTrp: 0.609 ± 0.395
1.42AspTyr: 1.42 ± 0.365
0.0AspXaa: 0.0 ± 0.0
Glu
4.057GluAla: 4.057 ± 0.509
2.231GluCys: 2.231 ± 0.799
4.665GluAsp: 4.665 ± 0.939
2.637GluGlu: 2.637 ± 0.749
2.434GluPhe: 2.434 ± 0.76
3.043GluGly: 3.043 ± 1.037
2.434GluHis: 2.434 ± 0.712
4.868GluIle: 4.868 ± 0.518
3.245GluLys: 3.245 ± 0.93
5.882GluLeu: 5.882 ± 1.0
2.231GluMet: 2.231 ± 1.288
3.043GluAsn: 3.043 ± 0.592
1.217GluPro: 1.217 ± 0.475
1.42GluGln: 1.42 ± 0.82
4.462GluArg: 4.462 ± 0.825
5.274GluSer: 5.274 ± 0.888
4.057GluThr: 4.057 ± 0.845
4.868GluVal: 4.868 ± 0.945
0.406GluTrp: 0.406 ± 0.207
1.42GluTyr: 1.42 ± 0.667
0.0GluXaa: 0.0 ± 0.0
Phe
1.42PheAla: 1.42 ± 1.104
0.811PheCys: 0.811 ± 0.564
3.043PheAsp: 3.043 ± 0.701
3.448PheGlu: 3.448 ± 0.857
2.434PhePhe: 2.434 ± 0.655
1.623PheGly: 1.623 ± 0.294
1.014PheHis: 1.014 ± 0.352
3.043PheIle: 3.043 ± 0.699
3.854PheLys: 3.854 ± 1.001
3.245PheLeu: 3.245 ± 1.132
1.217PheMet: 1.217 ± 0.395
2.028PheAsn: 2.028 ± 0.657
2.434PhePro: 2.434 ± 0.787
0.811PheGln: 0.811 ± 0.26
2.231PheArg: 2.231 ± 0.925
3.448PheSer: 3.448 ± 0.442
3.043PheThr: 3.043 ± 0.863
3.043PheVal: 3.043 ± 1.367
0.811PheTrp: 0.811 ± 0.453
2.028PheTyr: 2.028 ± 0.657
0.0PheXaa: 0.0 ± 0.0
Gly
0.811GlyAla: 0.811 ± 1.099
0.203GlyCys: 0.203 ± 0.316
3.651GlyAsp: 3.651 ± 0.926
2.231GlyGlu: 2.231 ± 0.75
4.462GlyPhe: 4.462 ± 0.631
2.231GlyGly: 2.231 ± 0.559
0.811GlyHis: 0.811 ± 0.321
4.868GlyIle: 4.868 ± 0.625
4.26GlyLys: 4.26 ± 0.829
4.26GlyLeu: 4.26 ± 0.753
1.014GlyMet: 1.014 ± 0.408
0.811GlyAsn: 0.811 ± 0.38
0.811GlyPro: 0.811 ± 0.337
0.609GlyGln: 0.609 ± 0.28
1.217GlyArg: 1.217 ± 0.86
3.651GlySer: 3.651 ± 0.496
2.231GlyThr: 2.231 ± 0.797
2.84GlyVal: 2.84 ± 0.648
0.0GlyTrp: 0.0 ± 0.0
1.826GlyTyr: 1.826 ± 0.507
0.0GlyXaa: 0.0 ± 0.0
His
1.014HisAla: 1.014 ± 0.303
0.406HisCys: 0.406 ± 0.354
1.217HisAsp: 1.217 ± 0.703
2.231HisGlu: 2.231 ± 0.711
1.014HisPhe: 1.014 ± 0.429
1.623HisGly: 1.623 ± 0.621
0.406HisHis: 0.406 ± 0.351
0.609HisIle: 0.609 ± 0.351
2.434HisLys: 2.434 ± 0.82
3.043HisLeu: 3.043 ± 0.763
0.609HisMet: 0.609 ± 0.537
1.217HisAsn: 1.217 ± 0.534
2.028HisPro: 2.028 ± 0.505
0.406HisGln: 0.406 ± 0.234
0.811HisArg: 0.811 ± 0.469
1.217HisSer: 1.217 ± 0.459
0.203HisThr: 0.203 ± 0.266
0.609HisVal: 0.609 ± 0.622
0.203HisTrp: 0.203 ± 0.117
2.434HisTyr: 2.434 ± 0.672
0.0HisXaa: 0.0 ± 0.0
Ile
3.448IleAla: 3.448 ± 0.704
1.826IleCys: 1.826 ± 0.936
3.651IleAsp: 3.651 ± 0.739
5.882IleGlu: 5.882 ± 0.893
1.42IlePhe: 1.42 ± 0.458
2.231IleGly: 2.231 ± 0.583
1.42IleHis: 1.42 ± 0.561
3.245IleIle: 3.245 ± 0.692
7.302IleLys: 7.302 ± 2.028
8.114IleLeu: 8.114 ± 2.008
1.42IleMet: 1.42 ± 0.561
2.84IleAsn: 2.84 ± 0.941
2.231IlePro: 2.231 ± 0.776
2.637IleGln: 2.637 ± 0.871
2.231IleArg: 2.231 ± 0.536
5.882IleSer: 5.882 ± 1.405
3.043IleThr: 3.043 ± 1.769
4.462IleVal: 4.462 ± 0.78
1.623IleTrp: 1.623 ± 0.382
3.245IleTyr: 3.245 ± 0.98
0.0IleXaa: 0.0 ± 0.0
Lys
4.26LysAla: 4.26 ± 0.758
2.028LysCys: 2.028 ± 0.907
6.085LysAsp: 6.085 ± 0.652
5.071LysGlu: 5.071 ± 1.027
3.245LysPhe: 3.245 ± 0.908
2.434LysGly: 2.434 ± 0.908
1.014LysHis: 1.014 ± 0.831
4.868LysIle: 4.868 ± 1.076
8.722LysLys: 8.722 ± 2.456
7.708LysLeu: 7.708 ± 1.274
1.623LysMet: 1.623 ± 0.744
6.288LysAsn: 6.288 ± 1.289
4.057LysPro: 4.057 ± 1.616
1.217LysGln: 1.217 ± 0.685
2.231LysArg: 2.231 ± 0.606
3.854LysSer: 3.854 ± 0.316
5.882LysThr: 5.882 ± 1.118
4.462LysVal: 4.462 ± 0.995
0.609LysTrp: 0.609 ± 0.351
3.043LysTyr: 3.043 ± 0.419
0.0LysXaa: 0.0 ± 0.0
Leu
6.491LeuAla: 6.491 ± 0.973
1.42LeuCys: 1.42 ± 1.145
4.26LeuAsp: 4.26 ± 1.329
4.665LeuGlu: 4.665 ± 1.111
3.651LeuPhe: 3.651 ± 0.671
5.274LeuGly: 5.274 ± 1.487
2.231LeuHis: 2.231 ± 0.705
5.477LeuIle: 5.477 ± 1.251
7.099LeuLys: 7.099 ± 1.316
9.533LeuLeu: 9.533 ± 1.212
2.231LeuMet: 2.231 ± 0.533
4.462LeuAsn: 4.462 ± 0.922
4.868LeuPro: 4.868 ± 1.697
2.637LeuGln: 2.637 ± 0.776
4.057LeuArg: 4.057 ± 1.574
10.142LeuSer: 10.142 ± 1.606
7.302LeuThr: 7.302 ± 1.658
6.288LeuVal: 6.288 ± 1.008
1.014LeuTrp: 1.014 ± 0.393
2.84LeuTyr: 2.84 ± 1.296
0.0LeuXaa: 0.0 ± 0.0
Met
0.811MetAla: 0.811 ± 0.534
0.609MetCys: 0.609 ± 0.462
2.637MetAsp: 2.637 ± 0.713
1.826MetGlu: 1.826 ± 0.555
1.217MetPhe: 1.217 ± 0.682
0.609MetGly: 0.609 ± 0.206
0.406MetHis: 0.406 ± 0.234
1.623MetIle: 1.623 ± 0.382
2.637MetLys: 2.637 ± 0.978
2.231MetLeu: 2.231 ± 0.776
1.42MetMet: 1.42 ± 0.458
1.217MetAsn: 1.217 ± 0.946
1.217MetPro: 1.217 ± 1.03
1.217MetGln: 1.217 ± 0.277
1.217MetArg: 1.217 ± 0.494
2.637MetSer: 2.637 ± 0.663
2.637MetThr: 2.637 ± 0.535
2.434MetVal: 2.434 ± 1.148
0.203MetTrp: 0.203 ± 0.117
1.42MetTyr: 1.42 ± 0.458
0.0MetXaa: 0.0 ± 0.0
Asn
1.826AsnAla: 1.826 ± 0.65
1.623AsnCys: 1.623 ± 0.624
3.448AsnAsp: 3.448 ± 0.899
1.014AsnGlu: 1.014 ± 0.446
3.245AsnPhe: 3.245 ± 0.833
3.245AsnGly: 3.245 ± 1.194
1.014AsnHis: 1.014 ± 0.455
3.448AsnIle: 3.448 ± 0.48
3.448AsnLys: 3.448 ± 0.803
3.245AsnLeu: 3.245 ± 0.796
1.623AsnMet: 1.623 ± 0.672
3.043AsnAsn: 3.043 ± 0.817
1.42AsnPro: 1.42 ± 0.335
2.028AsnGln: 2.028 ± 1.171
2.434AsnArg: 2.434 ± 0.577
3.245AsnSer: 3.245 ± 0.809
2.231AsnThr: 2.231 ± 0.393
3.651AsnVal: 3.651 ± 0.71
0.609AsnTrp: 0.609 ± 0.206
2.028AsnTyr: 2.028 ± 0.487
0.0AsnXaa: 0.0 ± 0.0
Pro
2.434ProAla: 2.434 ± 0.651
0.203ProCys: 0.203 ± 0.266
2.84ProAsp: 2.84 ± 1.535
2.84ProGlu: 2.84 ± 1.196
2.231ProPhe: 2.231 ± 0.888
2.637ProGly: 2.637 ± 0.645
0.406ProHis: 0.406 ± 0.354
3.043ProIle: 3.043 ± 1.42
2.434ProLys: 2.434 ± 0.823
3.651ProLeu: 3.651 ± 1.632
1.014ProMet: 1.014 ± 0.716
1.826ProAsn: 1.826 ± 0.394
0.609ProPro: 0.609 ± 0.505
0.609ProGln: 0.609 ± 0.326
1.014ProArg: 1.014 ± 0.446
4.057ProSer: 4.057 ± 1.797
2.434ProThr: 2.434 ± 0.79
3.245ProVal: 3.245 ± 0.49
0.406ProTrp: 0.406 ± 0.234
1.623ProTyr: 1.623 ± 0.35
0.0ProXaa: 0.0 ± 0.0
Gln
1.217GlnAla: 1.217 ± 0.27
0.609GlnCys: 0.609 ± 0.462
1.42GlnAsp: 1.42 ± 0.356
2.637GlnGlu: 2.637 ± 1.055
1.014GlnPhe: 1.014 ± 0.238
1.014GlnGly: 1.014 ± 0.586
0.609GlnHis: 0.609 ± 0.206
2.231GlnIle: 2.231 ± 0.87
3.043GlnLys: 3.043 ± 0.728
3.245GlnLeu: 3.245 ± 1.063
1.217GlnMet: 1.217 ± 0.533
1.826GlnAsn: 1.826 ± 0.555
0.609GlnPro: 0.609 ± 0.602
0.406GlnGln: 0.406 ± 0.351
1.217GlnArg: 1.217 ± 0.524
1.014GlnSer: 1.014 ± 0.408
2.028GlnThr: 2.028 ± 0.664
2.434GlnVal: 2.434 ± 0.878
0.203GlnTrp: 0.203 ± 0.117
0.811GlnTyr: 0.811 ± 0.38
0.0GlnXaa: 0.0 ± 0.0
Arg
2.84ArgAla: 2.84 ± 1.044
1.217ArgCys: 1.217 ± 0.453
1.826ArgAsp: 1.826 ± 0.504
3.245ArgGlu: 3.245 ± 0.799
0.609ArgPhe: 0.609 ± 0.351
1.826ArgGly: 1.826 ± 0.39
0.811ArgHis: 0.811 ± 0.462
3.854ArgIle: 3.854 ± 0.99
3.651ArgLys: 3.651 ± 0.656
4.26ArgLeu: 4.26 ± 0.569
1.826ArgMet: 1.826 ± 0.342
1.826ArgAsn: 1.826 ± 0.39
1.42ArgPro: 1.42 ± 0.493
1.014ArgGln: 1.014 ± 0.44
2.028ArgArg: 2.028 ± 0.467
3.651ArgSer: 3.651 ± 0.945
3.043ArgThr: 3.043 ± 1.123
2.637ArgVal: 2.637 ± 0.388
0.811ArgTrp: 0.811 ± 0.469
1.826ArgTyr: 1.826 ± 0.354
0.0ArgXaa: 0.0 ± 0.0
Ser
3.043SerAla: 3.043 ± 0.667
0.609SerCys: 0.609 ± 0.799
4.868SerAsp: 4.868 ± 0.795
5.477SerGlu: 5.477 ± 0.608
3.854SerPhe: 3.854 ± 0.654
2.637SerGly: 2.637 ± 0.559
2.028SerHis: 2.028 ± 0.675
5.274SerIle: 5.274 ± 1.326
6.288SerLys: 6.288 ± 2.16
8.519SerLeu: 8.519 ± 1.273
1.217SerMet: 1.217 ± 0.382
4.26SerAsn: 4.26 ± 0.762
2.637SerPro: 2.637 ± 0.785
2.231SerGln: 2.231 ± 0.4
4.26SerArg: 4.26 ± 0.95
6.491SerSer: 6.491 ± 1.251
4.868SerThr: 4.868 ± 0.562
4.26SerVal: 4.26 ± 1.122
0.609SerTrp: 0.609 ± 0.348
4.057SerTyr: 4.057 ± 1.721
0.0SerXaa: 0.0 ± 0.0
Thr
2.434ThrAla: 2.434 ± 0.919
1.217ThrCys: 1.217 ± 0.911
5.477ThrAsp: 5.477 ± 1.347
3.854ThrGlu: 3.854 ± 0.911
3.245ThrPhe: 3.245 ± 1.163
3.245ThrGly: 3.245 ± 1.355
1.217ThrHis: 1.217 ± 0.907
4.868ThrIle: 4.868 ± 0.732
2.434ThrLys: 2.434 ± 0.794
5.882ThrLeu: 5.882 ± 1.154
2.637ThrMet: 2.637 ± 0.55
2.028ThrAsn: 2.028 ± 1.461
2.434ThrPro: 2.434 ± 0.686
3.448ThrGln: 3.448 ± 1.456
3.043ThrArg: 3.043 ± 0.52
5.071ThrSer: 5.071 ± 1.558
4.868ThrThr: 4.868 ± 1.279
2.637ThrVal: 2.637 ± 1.139
0.203ThrTrp: 0.203 ± 0.266
2.231ThrTyr: 2.231 ± 0.743
0.0ThrXaa: 0.0 ± 0.0
Val
3.043ValAla: 3.043 ± 0.484
2.028ValCys: 2.028 ± 1.165
2.434ValAsp: 2.434 ± 0.471
5.071ValGlu: 5.071 ± 1.392
3.043ValPhe: 3.043 ± 0.539
2.637ValGly: 2.637 ± 1.113
2.231ValHis: 2.231 ± 0.43
4.26ValIle: 4.26 ± 0.701
3.448ValLys: 3.448 ± 1.145
4.868ValLeu: 4.868 ± 1.778
1.623ValMet: 1.623 ± 0.864
2.434ValAsn: 2.434 ± 0.887
2.84ValPro: 2.84 ± 0.541
2.637ValGln: 2.637 ± 0.861
4.057ValArg: 4.057 ± 0.938
5.882ValSer: 5.882 ± 1.241
5.274ValThr: 5.274 ± 1.432
4.462ValVal: 4.462 ± 1.301
0.811ValTrp: 0.811 ± 0.387
2.84ValTyr: 2.84 ± 0.96
0.0ValXaa: 0.0 ± 0.0
Trp
0.203TrpAla: 0.203 ± 0.117
0.203TrpCys: 0.203 ± 0.117
0.406TrpAsp: 0.406 ± 0.415
0.0TrpGlu: 0.0 ± 0.0
1.014TrpPhe: 1.014 ± 0.458
0.406TrpGly: 0.406 ± 0.273
0.609TrpHis: 0.609 ± 0.351
0.811TrpIle: 0.811 ± 0.504
0.811TrpLys: 0.811 ± 0.574
1.42TrpLeu: 1.42 ± 0.514
0.811TrpMet: 0.811 ± 0.321
0.406TrpAsn: 0.406 ± 0.234
0.406TrpPro: 0.406 ± 0.234
0.0TrpGln: 0.0 ± 0.0
0.811TrpArg: 0.811 ± 0.469
0.811TrpSer: 0.811 ± 0.263
0.406TrpThr: 0.406 ± 0.275
0.811TrpVal: 0.811 ± 0.263
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.623TyrAla: 1.623 ± 0.672
1.014TyrCys: 1.014 ± 1.008
4.462TyrAsp: 4.462 ± 0.182
2.434TyrGlu: 2.434 ± 0.431
1.014TyrPhe: 1.014 ± 0.408
1.014TyrGly: 1.014 ± 0.455
1.826TyrHis: 1.826 ± 0.56
2.434TyrIle: 2.434 ± 0.505
2.84TyrLys: 2.84 ± 0.762
3.651TyrLeu: 3.651 ± 1.094
1.826TyrMet: 1.826 ± 1.045
2.637TyrAsn: 2.637 ± 0.888
1.014TyrPro: 1.014 ± 0.234
1.014TyrGln: 1.014 ± 1.095
2.028TyrArg: 2.028 ± 0.722
2.637TyrSer: 2.637 ± 0.772
2.028TyrThr: 2.028 ± 0.701
3.448TyrVal: 3.448 ± 0.803
0.406TyrTrp: 0.406 ± 0.234
2.637TyrTyr: 2.637 ± 0.512
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (4931 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski