Amino acid dipepetide frequency for Rice black streaked dwarf virus (RBSDV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.933AlaAla: 1.933 ± 0.35
0.215AlaCys: 0.215 ± 0.113
2.577AlaAsp: 2.577 ± 0.656
2.148AlaGlu: 2.148 ± 0.436
2.47AlaPhe: 2.47 ± 0.335
0.752AlaGly: 0.752 ± 0.219
0.644AlaHis: 0.644 ± 0.153
3.007AlaIle: 3.007 ± 0.695
3.114AlaLys: 3.114 ± 0.46
3.544AlaLeu: 3.544 ± 0.694
1.181AlaMet: 1.181 ± 0.26
3.759AlaAsn: 3.759 ± 0.65
1.611AlaPro: 1.611 ± 0.458
1.074AlaGln: 1.074 ± 0.371
1.611AlaArg: 1.611 ± 0.401
3.544AlaSer: 3.544 ± 0.306
1.826AlaThr: 1.826 ± 0.628
2.148AlaVal: 2.148 ± 0.505
0.322AlaTrp: 0.322 ± 0.197
1.933AlaTyr: 1.933 ± 0.262
0.0AlaXaa: 0.0 ± 0.0
Cys
0.752CysAla: 0.752 ± 0.26
0.215CysCys: 0.215 ± 0.162
1.181CysAsp: 1.181 ± 0.33
0.859CysGlu: 0.859 ± 0.379
1.718CysPhe: 1.718 ± 0.251
0.215CysGly: 0.215 ± 0.166
0.537CysHis: 0.537 ± 0.154
0.859CysIle: 0.859 ± 0.237
0.43CysLys: 0.43 ± 0.138
1.826CysLeu: 1.826 ± 0.453
0.215CysMet: 0.215 ± 0.118
0.752CysAsn: 0.752 ± 0.218
0.537CysPro: 0.537 ± 0.265
0.43CysGln: 0.43 ± 0.171
0.752CysArg: 0.752 ± 0.306
1.826CysSer: 1.826 ± 0.236
0.43CysThr: 0.43 ± 0.231
1.503CysVal: 1.503 ± 0.504
0.215CysTrp: 0.215 ± 0.115
0.644CysTyr: 0.644 ± 0.277
0.0CysXaa: 0.0 ± 0.0
Asp
3.329AspAla: 3.329 ± 0.649
0.644AspCys: 0.644 ± 0.195
4.725AspAsp: 4.725 ± 0.919
4.296AspGlu: 4.296 ± 0.6
4.725AspPhe: 4.725 ± 0.312
2.685AspGly: 2.685 ± 0.481
1.611AspHis: 1.611 ± 0.234
4.188AspIle: 4.188 ± 0.467
3.436AspLys: 3.436 ± 0.515
5.906AspLeu: 5.906 ± 0.592
1.074AspMet: 1.074 ± 0.329
2.792AspAsn: 2.792 ± 0.66
2.47AspPro: 2.47 ± 0.569
1.503AspGln: 1.503 ± 0.462
2.148AspArg: 2.148 ± 0.423
4.618AspSer: 4.618 ± 0.469
2.255AspThr: 2.255 ± 0.662
5.262AspVal: 5.262 ± 0.514
0.752AspTrp: 0.752 ± 0.253
3.436AspTyr: 3.436 ± 0.427
0.0AspXaa: 0.0 ± 0.0
Glu
1.396GluAla: 1.396 ± 0.304
0.644GluCys: 0.644 ± 0.145
1.826GluAsp: 1.826 ± 0.426
3.866GluGlu: 3.866 ± 0.482
3.007GluPhe: 3.007 ± 0.477
2.04GluGly: 2.04 ± 0.528
1.289GluHis: 1.289 ± 0.257
4.618GluIle: 4.618 ± 0.697
4.725GluLys: 4.725 ± 0.694
6.014GluLeu: 6.014 ± 0.719
2.04GluMet: 2.04 ± 0.403
4.188GluAsn: 4.188 ± 0.998
0.859GluPro: 0.859 ± 0.28
2.792GluGln: 2.792 ± 0.384
3.007GluArg: 3.007 ± 0.577
3.651GluSer: 3.651 ± 0.553
3.329GluThr: 3.329 ± 0.677
3.973GluVal: 3.973 ± 0.62
0.537GluTrp: 0.537 ± 0.177
2.792GluTyr: 2.792 ± 0.491
0.0GluXaa: 0.0 ± 0.0
Phe
2.255PheAla: 2.255 ± 0.55
1.074PheCys: 1.074 ± 0.365
5.584PheAsp: 5.584 ± 0.824
4.188PheGlu: 4.188 ± 0.371
3.222PhePhe: 3.222 ± 0.6
4.725PheGly: 4.725 ± 0.706
0.752PheHis: 0.752 ± 0.232
4.296PheIle: 4.296 ± 0.297
3.544PheLys: 3.544 ± 0.467
5.477PheLeu: 5.477 ± 0.813
1.503PheMet: 1.503 ± 0.374
4.51PheAsn: 4.51 ± 0.621
1.503PhePro: 1.503 ± 0.274
1.826PheGln: 1.826 ± 0.43
2.04PheArg: 2.04 ± 0.426
5.692PheSer: 5.692 ± 0.516
3.759PheThr: 3.759 ± 0.751
3.973PheVal: 3.973 ± 0.463
0.644PheTrp: 0.644 ± 0.203
2.363PheTyr: 2.363 ± 0.316
0.0PheXaa: 0.0 ± 0.0
Gly
1.611GlyAla: 1.611 ± 0.41
0.644GlyCys: 0.644 ± 0.189
2.148GlyAsp: 2.148 ± 0.534
2.255GlyGlu: 2.255 ± 0.333
2.685GlyPhe: 2.685 ± 0.326
1.074GlyGly: 1.074 ± 0.242
1.289GlyHis: 1.289 ± 0.43
3.436GlyIle: 3.436 ± 0.566
2.792GlyLys: 2.792 ± 0.467
3.222GlyLeu: 3.222 ± 0.42
0.43GlyMet: 0.43 ± 0.202
3.329GlyAsn: 3.329 ± 0.67
0.322GlyPro: 0.322 ± 0.146
1.289GlyGln: 1.289 ± 0.385
1.611GlyArg: 1.611 ± 0.35
2.363GlySer: 2.363 ± 0.299
1.933GlyThr: 1.933 ± 0.364
2.899GlyVal: 2.899 ± 0.571
0.43GlyTrp: 0.43 ± 0.128
2.148GlyTyr: 2.148 ± 0.288
0.0GlyXaa: 0.0 ± 0.0
His
1.289HisAla: 1.289 ± 0.353
0.322HisCys: 0.322 ± 0.111
1.289HisAsp: 1.289 ± 0.253
1.289HisGlu: 1.289 ± 0.383
1.933HisPhe: 1.933 ± 0.39
0.859HisGly: 0.859 ± 0.332
0.752HisHis: 0.752 ± 0.26
0.966HisIle: 0.966 ± 0.256
1.289HisLys: 1.289 ± 0.25
3.651HisLeu: 3.651 ± 0.491
0.215HisMet: 0.215 ± 0.113
1.289HisAsn: 1.289 ± 0.434
1.503HisPro: 1.503 ± 0.28
1.074HisGln: 1.074 ± 0.286
0.644HisArg: 0.644 ± 0.161
1.826HisSer: 1.826 ± 0.53
1.289HisThr: 1.289 ± 0.375
1.396HisVal: 1.396 ± 0.249
0.107HisTrp: 0.107 ± 0.109
1.611HisTyr: 1.611 ± 0.283
0.0HisXaa: 0.0 ± 0.0
Ile
2.363IleAla: 2.363 ± 0.352
0.966IleCys: 0.966 ± 0.286
5.477IleAsp: 5.477 ± 0.604
3.866IleGlu: 3.866 ± 0.749
4.188IlePhe: 4.188 ± 0.606
3.114IleGly: 3.114 ± 0.514
1.718IleHis: 1.718 ± 0.335
4.51IleIle: 4.51 ± 0.997
5.155IleLys: 5.155 ± 0.786
5.692IleLeu: 5.692 ± 0.767
1.503IleMet: 1.503 ± 0.317
5.369IleAsn: 5.369 ± 0.817
2.685IlePro: 2.685 ± 0.526
2.148IleGln: 2.148 ± 0.363
3.114IleArg: 3.114 ± 0.476
7.839IleSer: 7.839 ± 1.709
4.296IleThr: 4.296 ± 0.696
3.973IleVal: 3.973 ± 0.415
0.215IleTrp: 0.215 ± 0.124
2.363IleTyr: 2.363 ± 0.461
0.0IleXaa: 0.0 ± 0.0
Lys
2.255LysAla: 2.255 ± 0.454
0.966LysCys: 0.966 ± 0.243
3.759LysAsp: 3.759 ± 0.629
3.329LysGlu: 3.329 ± 0.775
3.544LysPhe: 3.544 ± 0.348
1.933LysGly: 1.933 ± 0.43
1.826LysHis: 1.826 ± 0.506
5.906LysIle: 5.906 ± 0.766
4.725LysLys: 4.725 ± 0.653
7.195LysLeu: 7.195 ± 0.67
2.255LysMet: 2.255 ± 0.313
4.725LysAsn: 4.725 ± 0.396
2.148LysPro: 2.148 ± 0.385
1.933LysGln: 1.933 ± 0.299
4.081LysArg: 4.081 ± 0.549
4.832LysSer: 4.832 ± 0.675
4.832LysThr: 4.832 ± 0.698
3.329LysVal: 3.329 ± 0.478
0.752LysTrp: 0.752 ± 0.167
2.899LysTyr: 2.899 ± 0.369
0.0LysXaa: 0.0 ± 0.0
Leu
3.544LeuAla: 3.544 ± 0.806
1.933LeuCys: 1.933 ± 0.491
6.121LeuAsp: 6.121 ± 0.842
5.047LeuGlu: 5.047 ± 0.479
7.088LeuPhe: 7.088 ± 0.457
4.081LeuGly: 4.081 ± 0.637
2.47LeuHis: 2.47 ± 0.419
6.765LeuIle: 6.765 ± 0.802
8.698LeuLys: 8.698 ± 0.486
9.88LeuLeu: 9.88 ± 1.184
2.47LeuMet: 2.47 ± 0.469
8.698LeuAsn: 8.698 ± 0.927
3.329LeuPro: 3.329 ± 0.62
2.148LeuGln: 2.148 ± 0.383
4.618LeuArg: 4.618 ± 0.657
10.631LeuSer: 10.631 ± 1.192
5.906LeuThr: 5.906 ± 0.74
4.832LeuVal: 4.832 ± 0.707
0.43LeuTrp: 0.43 ± 0.172
3.007LeuTyr: 3.007 ± 0.644
0.0LeuXaa: 0.0 ± 0.0
Met
0.859MetAla: 0.859 ± 0.212
0.43MetCys: 0.43 ± 0.202
1.289MetAsp: 1.289 ± 0.26
0.43MetGlu: 0.43 ± 0.214
1.611MetPhe: 1.611 ± 0.292
0.859MetGly: 0.859 ± 0.194
0.752MetHis: 0.752 ± 0.215
2.255MetIle: 2.255 ± 0.533
1.611MetLys: 1.611 ± 0.361
2.685MetLeu: 2.685 ± 0.473
0.752MetMet: 0.752 ± 0.266
2.255MetAsn: 2.255 ± 0.568
0.644MetPro: 0.644 ± 0.261
0.322MetGln: 0.322 ± 0.161
0.644MetArg: 0.644 ± 0.265
2.577MetSer: 2.577 ± 0.638
1.289MetThr: 1.289 ± 0.366
1.181MetVal: 1.181 ± 0.211
0.0MetTrp: 0.0 ± 0.0
1.611MetTyr: 1.611 ± 0.31
0.0MetXaa: 0.0 ± 0.0
Asn
3.436AsnAla: 3.436 ± 0.463
1.826AsnCys: 1.826 ± 0.388
4.832AsnAsp: 4.832 ± 0.695
4.081AsnGlu: 4.081 ± 0.619
3.759AsnPhe: 3.759 ± 0.66
3.007AsnGly: 3.007 ± 0.375
2.47AsnHis: 2.47 ± 0.541
3.759AsnIle: 3.759 ± 0.705
4.403AsnLys: 4.403 ± 0.552
8.162AsnLeu: 8.162 ± 0.992
1.074AsnMet: 1.074 ± 0.313
3.222AsnAsn: 3.222 ± 0.459
2.47AsnPro: 2.47 ± 0.495
2.363AsnGln: 2.363 ± 0.405
2.792AsnArg: 2.792 ± 0.347
5.262AsnSer: 5.262 ± 0.702
4.618AsnThr: 4.618 ± 0.864
5.692AsnVal: 5.692 ± 0.644
0.752AsnTrp: 0.752 ± 0.171
3.007AsnTyr: 3.007 ± 0.937
0.0AsnXaa: 0.0 ± 0.0
Pro
0.859ProAla: 0.859 ± 0.36
0.752ProCys: 0.752 ± 0.185
1.611ProAsp: 1.611 ± 0.207
1.396ProGlu: 1.396 ± 0.367
2.577ProPhe: 2.577 ± 0.353
1.181ProGly: 1.181 ± 0.241
0.752ProHis: 0.752 ± 0.312
2.899ProIle: 2.899 ± 0.505
1.826ProLys: 1.826 ± 0.365
3.114ProLeu: 3.114 ± 0.577
0.644ProMet: 0.644 ± 0.18
3.759ProAsn: 3.759 ± 0.547
0.644ProPro: 0.644 ± 0.326
0.859ProGln: 0.859 ± 0.237
0.966ProArg: 0.966 ± 0.38
4.51ProSer: 4.51 ± 0.8
2.363ProThr: 2.363 ± 0.569
2.148ProVal: 2.148 ± 0.216
0.215ProTrp: 0.215 ± 0.118
0.644ProTyr: 0.644 ± 0.273
0.0ProXaa: 0.0 ± 0.0
Gln
2.255GlnAla: 2.255 ± 0.417
0.322GlnCys: 0.322 ± 0.127
0.966GlnAsp: 0.966 ± 0.188
1.611GlnGlu: 1.611 ± 0.272
2.363GlnPhe: 2.363 ± 0.302
1.074GlnGly: 1.074 ± 0.195
0.644GlnHis: 0.644 ± 0.25
2.47GlnIle: 2.47 ± 0.557
1.503GlnLys: 1.503 ± 0.368
4.188GlnLeu: 4.188 ± 0.65
1.074GlnMet: 1.074 ± 0.257
1.396GlnAsn: 1.396 ± 0.381
0.752GlnPro: 0.752 ± 0.218
1.503GlnGln: 1.503 ± 0.433
1.933GlnArg: 1.933 ± 0.428
2.792GlnSer: 2.792 ± 0.544
2.685GlnThr: 2.685 ± 0.537
1.718GlnVal: 1.718 ± 0.455
0.215GlnTrp: 0.215 ± 0.119
0.966GlnTyr: 0.966 ± 0.307
0.0GlnXaa: 0.0 ± 0.0
Arg
1.718ArgAla: 1.718 ± 0.331
0.537ArgCys: 0.537 ± 0.279
2.148ArgAsp: 2.148 ± 0.225
1.611ArgGlu: 1.611 ± 0.469
3.222ArgPhe: 3.222 ± 0.513
1.396ArgGly: 1.396 ± 0.338
1.396ArgHis: 1.396 ± 0.211
2.577ArgIle: 2.577 ± 0.448
3.114ArgLys: 3.114 ± 0.424
3.866ArgLeu: 3.866 ± 0.588
1.933ArgMet: 1.933 ± 0.317
2.47ArgAsn: 2.47 ± 0.439
1.289ArgPro: 1.289 ± 0.237
2.148ArgGln: 2.148 ± 0.419
2.148ArgArg: 2.148 ± 0.52
2.899ArgSer: 2.899 ± 0.76
2.47ArgThr: 2.47 ± 0.509
2.255ArgVal: 2.255 ± 0.309
0.215ArgTrp: 0.215 ± 0.129
1.933ArgTyr: 1.933 ± 0.385
0.0ArgXaa: 0.0 ± 0.0
Ser
4.296SerAla: 4.296 ± 0.82
1.611SerCys: 1.611 ± 0.33
5.584SerAsp: 5.584 ± 0.424
6.551SerGlu: 6.551 ± 1.083
4.403SerPhe: 4.403 ± 0.856
2.148SerGly: 2.148 ± 0.414
2.04SerHis: 2.04 ± 0.376
6.98SerIle: 6.98 ± 1.108
6.014SerLys: 6.014 ± 0.834
9.128SerLeu: 9.128 ± 1.094
1.503SerMet: 1.503 ± 0.433
6.551SerAsn: 6.551 ± 1.232
4.188SerPro: 4.188 ± 0.392
3.007SerGln: 3.007 ± 0.392
3.651SerArg: 3.651 ± 0.491
9.343SerSer: 9.343 ± 1.351
4.081SerThr: 4.081 ± 0.811
4.725SerVal: 4.725 ± 0.534
0.322SerTrp: 0.322 ± 0.162
4.618SerTyr: 4.618 ± 0.837
0.0SerXaa: 0.0 ± 0.0
Thr
2.255ThrAla: 2.255 ± 0.557
0.859ThrCys: 0.859 ± 0.289
3.759ThrAsp: 3.759 ± 0.782
3.114ThrGlu: 3.114 ± 0.542
3.114ThrPhe: 3.114 ± 0.551
1.933ThrGly: 1.933 ± 0.435
0.859ThrHis: 0.859 ± 0.3
3.866ThrIle: 3.866 ± 0.489
3.651ThrLys: 3.651 ± 0.675
6.336ThrLeu: 6.336 ± 0.658
1.396ThrMet: 1.396 ± 0.513
3.007ThrAsn: 3.007 ± 0.662
1.289ThrPro: 1.289 ± 0.437
2.04ThrGln: 2.04 ± 0.376
2.255ThrArg: 2.255 ± 0.605
7.088ThrSer: 7.088 ± 0.733
2.685ThrThr: 2.685 ± 0.775
4.296ThrVal: 4.296 ± 0.762
0.322ThrTrp: 0.322 ± 0.206
2.04ThrTyr: 2.04 ± 0.478
0.0ThrXaa: 0.0 ± 0.0
Val
1.718ValAla: 1.718 ± 0.381
1.074ValCys: 1.074 ± 0.396
3.114ValAsp: 3.114 ± 0.39
3.866ValGlu: 3.866 ± 0.588
3.973ValPhe: 3.973 ± 0.559
2.47ValGly: 2.47 ± 0.351
1.181ValHis: 1.181 ± 0.411
4.51ValIle: 4.51 ± 0.604
4.51ValLys: 4.51 ± 0.596
6.873ValLeu: 6.873 ± 0.653
1.933ValMet: 1.933 ± 0.531
4.832ValAsn: 4.832 ± 0.687
3.114ValPro: 3.114 ± 0.411
2.04ValGln: 2.04 ± 0.43
1.933ValArg: 1.933 ± 0.324
5.047ValSer: 5.047 ± 0.655
2.792ValThr: 2.792 ± 0.493
4.403ValVal: 4.403 ± 0.574
0.215ValTrp: 0.215 ± 0.119
2.47ValTyr: 2.47 ± 0.446
0.0ValXaa: 0.0 ± 0.0
Trp
0.215TrpAla: 0.215 ± 0.119
0.107TrpCys: 0.107 ± 0.109
0.322TrpAsp: 0.322 ± 0.169
0.752TrpGlu: 0.752 ± 0.258
0.107TrpPhe: 0.107 ± 0.109
0.107TrpGly: 0.107 ± 0.079
0.0TrpHis: 0.0 ± 0.0
0.215TrpIle: 0.215 ± 0.113
1.074TrpLys: 1.074 ± 0.271
0.322TrpLeu: 0.322 ± 0.158
0.0TrpMet: 0.0 ± 0.0
1.074TrpAsn: 1.074 ± 0.325
0.644TrpPro: 0.644 ± 0.217
0.644TrpGln: 0.644 ± 0.212
0.0TrpArg: 0.0 ± 0.0
0.752TrpSer: 0.752 ± 0.337
0.537TrpThr: 0.537 ± 0.196
0.215TrpVal: 0.215 ± 0.129
0.107TrpTrp: 0.107 ± 0.094
0.215TrpTyr: 0.215 ± 0.149
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.074TyrAla: 1.074 ± 0.224
0.752TyrCys: 0.752 ± 0.23
3.544TyrAsp: 3.544 ± 0.607
2.04TyrGlu: 2.04 ± 0.461
3.222TyrPhe: 3.222 ± 0.646
2.255TyrGly: 2.255 ± 0.416
1.611TyrHis: 1.611 ± 0.388
2.47TyrIle: 2.47 ± 0.387
1.611TyrLys: 1.611 ± 0.442
4.618TyrLeu: 4.618 ± 0.565
0.752TyrMet: 0.752 ± 0.333
3.007TyrAsn: 3.007 ± 0.434
1.718TyrPro: 1.718 ± 0.262
1.289TyrGln: 1.289 ± 0.373
1.396TyrArg: 1.396 ± 0.246
3.759TyrSer: 3.759 ± 0.632
2.792TyrThr: 2.792 ± 0.842
2.148TyrVal: 2.148 ± 0.396
0.644TyrTrp: 0.644 ± 0.349
1.718TyrTyr: 1.718 ± 0.558
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 13 proteins (9313 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski