Amino acid dipepetide frequency for Rice ragged stunt virus (isolate Thailand) (RRSV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.594AlaAla: 5.594 ± 0.732
1.941AlaCys: 1.941 ± 0.463
3.539AlaAsp: 3.539 ± 0.578
5.023AlaGlu: 5.023 ± 0.771
4.453AlaPhe: 4.453 ± 1.001
3.539AlaGly: 3.539 ± 0.378
1.028AlaHis: 1.028 ± 0.218
3.653AlaIle: 3.653 ± 0.81
3.882AlaLys: 3.882 ± 0.364
6.508AlaLeu: 6.508 ± 0.982
2.626AlaMet: 2.626 ± 0.319
3.311AlaAsn: 3.311 ± 0.672
2.283AlaPro: 2.283 ± 0.609
2.283AlaGln: 2.283 ± 0.395
4.567AlaArg: 4.567 ± 0.628
6.622AlaSer: 6.622 ± 0.556
3.768AlaThr: 3.768 ± 0.703
4.11AlaVal: 4.11 ± 0.846
0.799AlaTrp: 0.799 ± 0.278
1.598AlaTyr: 1.598 ± 0.354
0.0AlaXaa: 0.0 ± 0.0
Cys
1.713CysAla: 1.713 ± 0.368
0.457CysCys: 0.457 ± 0.317
1.028CysAsp: 1.028 ± 0.402
0.913CysGlu: 0.913 ± 0.277
0.685CysPhe: 0.685 ± 0.233
2.169CysGly: 2.169 ± 0.411
0.343CysHis: 0.343 ± 0.218
0.571CysIle: 0.571 ± 0.312
1.028CysLys: 1.028 ± 0.37
2.398CysLeu: 2.398 ± 0.676
0.114CysMet: 0.114 ± 0.105
0.685CysAsn: 0.685 ± 0.217
0.685CysPro: 0.685 ± 0.258
0.685CysGln: 0.685 ± 0.233
0.685CysArg: 0.685 ± 0.308
0.685CysSer: 0.685 ± 0.308
0.571CysThr: 0.571 ± 0.303
1.598CysVal: 1.598 ± 0.465
0.0CysTrp: 0.0 ± 0.0
1.028CysTyr: 1.028 ± 0.342
0.0CysXaa: 0.0 ± 0.0
Asp
3.539AspAla: 3.539 ± 0.52
0.913AspCys: 0.913 ± 0.328
3.539AspAsp: 3.539 ± 0.47
3.539AspGlu: 3.539 ± 0.515
1.941AspPhe: 1.941 ± 0.489
2.283AspGly: 2.283 ± 0.416
1.028AspHis: 1.028 ± 0.379
3.653AspIle: 3.653 ± 0.781
3.197AspLys: 3.197 ± 0.641
5.594AspLeu: 5.594 ± 0.651
1.256AspMet: 1.256 ± 0.328
1.827AspAsn: 1.827 ± 0.518
2.626AspPro: 2.626 ± 0.384
1.028AspGln: 1.028 ± 0.238
2.398AspArg: 2.398 ± 0.382
4.567AspSer: 4.567 ± 0.745
3.197AspThr: 3.197 ± 0.459
3.653AspVal: 3.653 ± 0.431
0.343AspTrp: 0.343 ± 0.224
3.197AspTyr: 3.197 ± 0.482
0.0AspXaa: 0.0 ± 0.0
Glu
3.996GluAla: 3.996 ± 1.313
0.343GluCys: 0.343 ± 0.152
2.626GluAsp: 2.626 ± 0.481
4.224GluGlu: 4.224 ± 0.824
2.169GluPhe: 2.169 ± 0.36
3.083GluGly: 3.083 ± 0.458
1.256GluHis: 1.256 ± 0.368
5.138GluIle: 5.138 ± 0.791
2.854GluLys: 2.854 ± 0.625
6.85GluLeu: 6.85 ± 0.712
1.598GluMet: 1.598 ± 0.456
2.283GluAsn: 2.283 ± 0.474
2.854GluPro: 2.854 ± 0.369
2.626GluGln: 2.626 ± 0.504
5.366GluArg: 5.366 ± 0.74
2.74GluSer: 2.74 ± 0.831
2.283GluThr: 2.283 ± 0.404
4.338GluVal: 4.338 ± 0.564
0.913GluTrp: 0.913 ± 0.295
2.283GluTyr: 2.283 ± 0.419
0.0GluXaa: 0.0 ± 0.0
Phe
4.338PheAla: 4.338 ± 0.858
0.799PheCys: 0.799 ± 0.195
3.197PheAsp: 3.197 ± 0.435
2.854PheGlu: 2.854 ± 0.526
1.37PhePhe: 1.37 ± 0.395
3.083PheGly: 3.083 ± 0.407
0.913PheHis: 0.913 ± 0.279
2.626PheIle: 2.626 ± 0.504
1.713PheLys: 1.713 ± 0.372
2.854PheLeu: 2.854 ± 0.568
0.571PheMet: 0.571 ± 0.205
1.142PheAsn: 1.142 ± 0.182
1.827PhePro: 1.827 ± 0.451
0.913PheGln: 0.913 ± 0.376
2.055PheArg: 2.055 ± 0.548
3.768PheSer: 3.768 ± 0.63
2.74PheThr: 2.74 ± 0.423
2.968PheVal: 2.968 ± 0.695
0.343PheTrp: 0.343 ± 0.221
1.713PheTyr: 1.713 ± 0.65
0.0PheXaa: 0.0 ± 0.0
Gly
3.653GlyAla: 3.653 ± 0.733
0.799GlyCys: 0.799 ± 0.284
2.854GlyAsp: 2.854 ± 0.332
3.653GlyGlu: 3.653 ± 0.536
2.854GlyPhe: 2.854 ± 0.393
3.083GlyGly: 3.083 ± 0.638
0.685GlyHis: 0.685 ± 0.282
3.083GlyIle: 3.083 ± 0.676
3.539GlyLys: 3.539 ± 0.531
7.078GlyLeu: 7.078 ± 0.972
1.941GlyMet: 1.941 ± 0.395
2.626GlyAsn: 2.626 ± 0.602
1.256GlyPro: 1.256 ± 0.266
1.598GlyGln: 1.598 ± 0.329
2.968GlyArg: 2.968 ± 0.496
4.795GlySer: 4.795 ± 0.834
4.338GlyThr: 4.338 ± 0.672
5.366GlyVal: 5.366 ± 0.598
0.457GlyTrp: 0.457 ± 0.226
1.598GlyTyr: 1.598 ± 0.308
0.0GlyXaa: 0.0 ± 0.0
His
2.055HisAla: 2.055 ± 0.408
0.343HisCys: 0.343 ± 0.221
0.913HisAsp: 0.913 ± 0.431
1.028HisGlu: 1.028 ± 0.389
0.685HisPhe: 0.685 ± 0.259
0.685HisGly: 0.685 ± 0.172
0.228HisHis: 0.228 ± 0.166
0.685HisIle: 0.685 ± 0.336
0.913HisLys: 0.913 ± 0.272
1.827HisLeu: 1.827 ± 0.489
0.571HisMet: 0.571 ± 0.245
0.228HisAsn: 0.228 ± 0.134
0.685HisPro: 0.685 ± 0.312
0.913HisGln: 0.913 ± 0.213
1.484HisArg: 1.484 ± 0.388
1.37HisSer: 1.37 ± 0.424
1.37HisThr: 1.37 ± 0.49
2.169HisVal: 2.169 ± 0.454
0.114HisTrp: 0.114 ± 0.12
1.028HisTyr: 1.028 ± 0.433
0.0HisXaa: 0.0 ± 0.0
Ile
4.681IleAla: 4.681 ± 1.141
1.37IleCys: 1.37 ± 0.446
3.197IleAsp: 3.197 ± 0.471
3.311IleGlu: 3.311 ± 0.384
2.055IlePhe: 2.055 ± 0.473
4.11IleGly: 4.11 ± 0.552
1.827IleHis: 1.827 ± 0.633
3.311IleIle: 3.311 ± 0.804
2.283IleLys: 2.283 ± 0.425
3.882IleLeu: 3.882 ± 0.622
1.598IleMet: 1.598 ± 0.347
1.827IleAsn: 1.827 ± 0.362
3.539IlePro: 3.539 ± 0.481
2.283IleGln: 2.283 ± 0.545
3.197IleArg: 3.197 ± 0.602
5.708IleSer: 5.708 ± 0.562
3.882IleThr: 3.882 ± 0.409
3.996IleVal: 3.996 ± 0.769
0.799IleTrp: 0.799 ± 0.299
2.398IleTyr: 2.398 ± 0.648
0.0IleXaa: 0.0 ± 0.0
Lys
3.768LysAla: 3.768 ± 0.546
1.142LysCys: 1.142 ± 0.205
2.283LysAsp: 2.283 ± 0.49
3.425LysGlu: 3.425 ± 0.727
2.283LysPhe: 2.283 ± 0.368
2.512LysGly: 2.512 ± 0.359
1.028LysHis: 1.028 ± 0.354
2.854LysIle: 2.854 ± 0.782
2.626LysLys: 2.626 ± 0.272
4.795LysLeu: 4.795 ± 0.535
1.37LysMet: 1.37 ± 0.279
1.598LysAsn: 1.598 ± 0.354
2.626LysPro: 2.626 ± 0.368
2.055LysGln: 2.055 ± 0.356
2.854LysArg: 2.854 ± 0.556
2.626LysSer: 2.626 ± 0.489
3.539LysThr: 3.539 ± 0.602
2.626LysVal: 2.626 ± 0.471
1.028LysTrp: 1.028 ± 0.445
1.484LysTyr: 1.484 ± 0.324
0.0LysXaa: 0.0 ± 0.0
Leu
8.791LeuAla: 8.791 ± 0.852
1.142LeuCys: 1.142 ± 0.375
5.252LeuAsp: 5.252 ± 0.808
5.138LeuGlu: 5.138 ± 0.737
2.968LeuPhe: 2.968 ± 0.464
5.252LeuGly: 5.252 ± 0.79
1.028LeuHis: 1.028 ± 0.371
4.795LeuIle: 4.795 ± 0.428
4.11LeuLys: 4.11 ± 0.786
8.563LeuLeu: 8.563 ± 1.129
2.283LeuMet: 2.283 ± 0.454
5.138LeuAsn: 5.138 ± 0.765
5.823LeuPro: 5.823 ± 0.917
3.653LeuGln: 3.653 ± 0.496
6.165LeuArg: 6.165 ± 0.808
10.161LeuSer: 10.161 ± 1.227
5.252LeuThr: 5.252 ± 0.824
6.393LeuVal: 6.393 ± 0.948
0.913LeuTrp: 0.913 ± 0.291
2.74LeuTyr: 2.74 ± 0.311
0.0LeuXaa: 0.0 ± 0.0
Met
1.37MetAla: 1.37 ± 0.34
0.685MetCys: 0.685 ± 0.245
0.799MetAsp: 0.799 ± 0.26
1.256MetGlu: 1.256 ± 0.336
1.598MetPhe: 1.598 ± 0.315
0.343MetGly: 0.343 ± 0.224
0.343MetHis: 0.343 ± 0.227
1.941MetIle: 1.941 ± 0.42
1.37MetLys: 1.37 ± 0.292
2.74MetLeu: 2.74 ± 0.477
1.256MetMet: 1.256 ± 0.324
0.799MetAsn: 0.799 ± 0.323
1.256MetPro: 1.256 ± 0.279
1.484MetGln: 1.484 ± 0.446
1.028MetArg: 1.028 ± 0.358
2.398MetSer: 2.398 ± 0.535
1.598MetThr: 1.598 ± 0.389
0.685MetVal: 0.685 ± 0.219
0.343MetTrp: 0.343 ± 0.204
0.913MetTyr: 0.913 ± 0.231
0.0MetXaa: 0.0 ± 0.0
Asn
3.197AsnAla: 3.197 ± 0.581
1.028AsnCys: 1.028 ± 0.31
2.854AsnAsp: 2.854 ± 0.456
1.941AsnGlu: 1.941 ± 0.36
2.398AsnPhe: 2.398 ± 0.316
1.713AsnGly: 1.713 ± 0.571
0.685AsnHis: 0.685 ± 0.265
2.169AsnIle: 2.169 ± 0.562
1.37AsnLys: 1.37 ± 0.3
4.567AsnLeu: 4.567 ± 0.555
0.457AsnMet: 0.457 ± 0.294
1.713AsnAsn: 1.713 ± 0.409
1.713AsnPro: 1.713 ± 0.494
1.713AsnGln: 1.713 ± 0.388
1.941AsnArg: 1.941 ± 0.585
3.311AsnSer: 3.311 ± 0.602
2.055AsnThr: 2.055 ± 0.679
3.882AsnVal: 3.882 ± 0.59
0.913AsnTrp: 0.913 ± 0.282
2.169AsnTyr: 2.169 ± 0.568
0.0AsnXaa: 0.0 ± 0.0
Pro
3.653ProAla: 3.653 ± 0.906
0.913ProCys: 0.913 ± 0.291
1.598ProAsp: 1.598 ± 0.431
2.74ProGlu: 2.74 ± 0.731
1.713ProPhe: 1.713 ± 0.481
2.055ProGly: 2.055 ± 0.37
0.571ProHis: 0.571 ± 0.322
3.311ProIle: 3.311 ± 0.864
2.398ProLys: 2.398 ± 0.252
3.539ProLeu: 3.539 ± 0.453
0.913ProMet: 0.913 ± 0.192
2.74ProAsn: 2.74 ± 0.526
1.941ProPro: 1.941 ± 0.519
1.713ProGln: 1.713 ± 0.33
2.626ProArg: 2.626 ± 0.52
5.366ProSer: 5.366 ± 1.229
3.653ProThr: 3.653 ± 0.616
2.169ProVal: 2.169 ± 0.594
0.228ProTrp: 0.228 ± 0.134
1.941ProTyr: 1.941 ± 0.539
0.0ProXaa: 0.0 ± 0.0
Gln
2.398GlnAla: 2.398 ± 0.621
0.457GlnCys: 0.457 ± 0.147
1.484GlnAsp: 1.484 ± 0.351
0.913GlnGlu: 0.913 ± 0.279
2.74GlnPhe: 2.74 ± 0.718
2.283GlnGly: 2.283 ± 0.464
0.457GlnHis: 0.457 ± 0.245
2.169GlnIle: 2.169 ± 0.389
1.484GlnLys: 1.484 ± 0.308
3.653GlnLeu: 3.653 ± 0.644
1.37GlnMet: 1.37 ± 0.377
1.598GlnAsn: 1.598 ± 0.292
1.256GlnPro: 1.256 ± 0.277
2.055GlnGln: 2.055 ± 0.525
2.055GlnArg: 2.055 ± 0.601
3.996GlnSer: 3.996 ± 0.574
2.968GlnThr: 2.968 ± 0.652
1.941GlnVal: 1.941 ± 0.469
0.0GlnTrp: 0.0 ± 0.0
1.028GlnTyr: 1.028 ± 0.326
0.0GlnXaa: 0.0 ± 0.0
Arg
3.882ArgAla: 3.882 ± 0.504
1.142ArgCys: 1.142 ± 0.542
3.539ArgAsp: 3.539 ± 0.715
3.882ArgGlu: 3.882 ± 0.317
1.713ArgPhe: 1.713 ± 0.525
3.768ArgGly: 3.768 ± 0.769
1.142ArgHis: 1.142 ± 0.289
3.653ArgIle: 3.653 ± 0.686
2.169ArgLys: 2.169 ± 0.381
6.736ArgLeu: 6.736 ± 0.636
2.055ArgMet: 2.055 ± 0.508
2.055ArgAsn: 2.055 ± 0.359
2.626ArgPro: 2.626 ± 0.689
2.968ArgGln: 2.968 ± 0.239
3.197ArgArg: 3.197 ± 0.71
3.768ArgSer: 3.768 ± 0.777
3.882ArgThr: 3.882 ± 0.417
4.224ArgVal: 4.224 ± 0.705
0.571ArgTrp: 0.571 ± 0.35
2.968ArgTyr: 2.968 ± 0.329
0.0ArgXaa: 0.0 ± 0.0
Ser
4.453SerAla: 4.453 ± 0.744
1.256SerCys: 1.256 ± 0.422
5.252SerAsp: 5.252 ± 0.682
4.795SerGlu: 4.795 ± 0.663
2.626SerPhe: 2.626 ± 0.46
6.622SerGly: 6.622 ± 0.625
2.626SerHis: 2.626 ± 0.512
5.48SerIle: 5.48 ± 0.628
4.567SerLys: 4.567 ± 0.484
8.334SerLeu: 8.334 ± 0.604
0.457SerMet: 0.457 ± 0.141
3.197SerAsn: 3.197 ± 0.586
3.768SerPro: 3.768 ± 1.014
2.626SerGln: 2.626 ± 0.521
6.622SerArg: 6.622 ± 0.677
7.649SerSer: 7.649 ± 0.718
4.795SerThr: 4.795 ± 0.984
6.051SerVal: 6.051 ± 1.024
0.685SerTrp: 0.685 ± 0.34
3.882SerTyr: 3.882 ± 0.815
0.0SerXaa: 0.0 ± 0.0
Thr
3.083ThrAla: 3.083 ± 0.716
1.484ThrCys: 1.484 ± 0.409
3.311ThrAsp: 3.311 ± 0.536
3.882ThrGlu: 3.882 ± 0.475
2.854ThrPhe: 2.854 ± 0.834
4.567ThrGly: 4.567 ± 0.787
1.598ThrHis: 1.598 ± 0.353
3.996ThrIle: 3.996 ± 0.545
3.311ThrLys: 3.311 ± 0.382
5.823ThrLeu: 5.823 ± 0.69
1.028ThrMet: 1.028 ± 0.293
2.626ThrAsn: 2.626 ± 0.531
2.626ThrPro: 2.626 ± 0.473
2.169ThrGln: 2.169 ± 0.492
3.539ThrArg: 3.539 ± 0.844
4.795ThrSer: 4.795 ± 0.625
3.083ThrThr: 3.083 ± 0.572
5.023ThrVal: 5.023 ± 0.793
0.457ThrTrp: 0.457 ± 0.191
1.941ThrTyr: 1.941 ± 0.518
0.0ThrXaa: 0.0 ± 0.0
Val
3.768ValAla: 3.768 ± 0.908
1.028ValCys: 1.028 ± 0.264
2.968ValAsp: 2.968 ± 0.631
4.567ValGlu: 4.567 ± 0.798
2.854ValPhe: 2.854 ± 0.472
4.795ValGly: 4.795 ± 0.786
1.713ValHis: 1.713 ± 0.598
3.882ValIle: 3.882 ± 0.475
3.996ValLys: 3.996 ± 0.463
5.023ValLeu: 5.023 ± 0.8
1.827ValMet: 1.827 ± 0.442
2.74ValAsn: 2.74 ± 0.586
3.083ValPro: 3.083 ± 0.469
2.055ValGln: 2.055 ± 0.499
4.11ValArg: 4.11 ± 0.52
6.736ValSer: 6.736 ± 0.593
5.594ValThr: 5.594 ± 0.691
5.138ValVal: 5.138 ± 0.812
1.142ValTrp: 1.142 ± 0.315
2.854ValTyr: 2.854 ± 0.753
0.0ValXaa: 0.0 ± 0.0
Trp
0.457TrpAla: 0.457 ± 0.26
0.114TrpCys: 0.114 ± 0.118
0.343TrpAsp: 0.343 ± 0.207
0.343TrpGlu: 0.343 ± 0.241
0.457TrpPhe: 0.457 ± 0.322
0.228TrpGly: 0.228 ± 0.208
0.114TrpHis: 0.114 ± 0.104
0.571TrpIle: 0.571 ± 0.273
0.114TrpLys: 0.114 ± 0.1
1.028TrpLeu: 1.028 ± 0.305
0.343TrpMet: 0.343 ± 0.19
0.913TrpAsn: 0.913 ± 0.377
0.685TrpPro: 0.685 ± 0.265
0.457TrpGln: 0.457 ± 0.175
1.142TrpArg: 1.142 ± 0.368
1.142TrpSer: 1.142 ± 0.283
0.799TrpThr: 0.799 ± 0.327
1.028TrpVal: 1.028 ± 0.381
0.114TrpTrp: 0.114 ± 0.105
0.799TrpTyr: 0.799 ± 0.402
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.74TyrAla: 2.74 ± 0.28
0.685TyrCys: 0.685 ± 0.284
2.74TyrAsp: 2.74 ± 0.298
2.398TyrGlu: 2.398 ± 0.522
1.37TyrPhe: 1.37 ± 0.383
2.169TyrGly: 2.169 ± 0.349
0.685TyrHis: 0.685 ± 0.207
1.713TyrIle: 1.713 ± 0.368
1.598TyrLys: 1.598 ± 0.388
3.768TyrLeu: 3.768 ± 0.971
0.457TyrMet: 0.457 ± 0.219
2.74TyrAsn: 2.74 ± 0.754
2.626TyrPro: 2.626 ± 0.544
1.028TyrGln: 1.028 ± 0.324
1.941TyrArg: 1.941 ± 0.464
3.539TyrSer: 3.539 ± 0.639
1.827TyrThr: 1.827 ± 0.398
2.512TyrVal: 2.512 ± 0.556
1.028TyrTrp: 1.028 ± 0.287
1.484TyrTyr: 1.484 ± 0.282
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12 proteins (8760 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski