Amino acid dipepetide frequency for Southern rice black-streaked dwarf virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.537AlaAla: 1.537 ± 0.451
0.439AlaCys: 0.439 ± 0.178
2.636AlaAsp: 2.636 ± 0.696
2.087AlaGlu: 2.087 ± 0.448
2.416AlaPhe: 2.416 ± 0.359
0.879AlaGly: 0.879 ± 0.297
1.098AlaHis: 1.098 ± 0.452
3.404AlaIle: 3.404 ± 0.416
3.185AlaLys: 3.185 ± 0.554
3.953AlaLeu: 3.953 ± 0.831
0.879AlaMet: 0.879 ± 0.34
3.624AlaAsn: 3.624 ± 0.84
1.537AlaPro: 1.537 ± 0.36
1.318AlaGln: 1.318 ± 0.391
1.537AlaArg: 1.537 ± 0.502
2.745AlaSer: 2.745 ± 0.305
1.757AlaThr: 1.757 ± 0.341
2.087AlaVal: 2.087 ± 0.571
0.22AlaTrp: 0.22 ± 0.135
2.306AlaTyr: 2.306 ± 0.311
0.0AlaXaa: 0.0 ± 0.0
Cys
0.769CysAla: 0.769 ± 0.218
0.22CysCys: 0.22 ± 0.159
0.879CysAsp: 0.879 ± 0.252
0.439CysGlu: 0.439 ± 0.237
1.537CysPhe: 1.537 ± 0.341
0.439CysGly: 0.439 ± 0.209
0.879CysHis: 0.879 ± 0.247
0.769CysIle: 0.769 ± 0.353
0.329CysLys: 0.329 ± 0.154
1.537CysLeu: 1.537 ± 0.413
0.22CysMet: 0.22 ± 0.132
1.208CysAsn: 1.208 ± 0.402
0.439CysPro: 0.439 ± 0.241
0.329CysGln: 0.329 ± 0.178
0.769CysArg: 0.769 ± 0.342
0.988CysSer: 0.988 ± 0.332
0.769CysThr: 0.769 ± 0.278
1.428CysVal: 1.428 ± 0.474
0.22CysTrp: 0.22 ± 0.152
0.659CysTyr: 0.659 ± 0.29
0.0CysXaa: 0.0 ± 0.0
Asp
3.404AspAla: 3.404 ± 0.616
0.22AspCys: 0.22 ± 0.142
4.832AspAsp: 4.832 ± 0.763
4.393AspGlu: 4.393 ± 0.561
4.283AspPhe: 4.283 ± 0.482
2.745AspGly: 2.745 ± 0.537
1.098AspHis: 1.098 ± 0.479
3.734AspIle: 3.734 ± 0.451
3.844AspLys: 3.844 ± 0.809
5.601AspLeu: 5.601 ± 0.796
1.537AspMet: 1.537 ± 0.279
3.075AspAsn: 3.075 ± 0.547
2.087AspPro: 2.087 ± 0.529
1.977AspGln: 1.977 ± 0.467
2.745AspArg: 2.745 ± 0.441
5.161AspSer: 5.161 ± 0.836
2.855AspThr: 2.855 ± 0.79
4.722AspVal: 4.722 ± 0.969
0.549AspTrp: 0.549 ± 0.148
3.624AspTyr: 3.624 ± 0.64
0.0AspXaa: 0.0 ± 0.0
Glu
1.757GluAla: 1.757 ± 0.609
0.988GluCys: 0.988 ± 0.282
2.745GluAsp: 2.745 ± 0.528
3.514GluGlu: 3.514 ± 0.738
3.075GluPhe: 3.075 ± 0.502
1.757GluGly: 1.757 ± 0.585
1.977GluHis: 1.977 ± 0.422
4.503GluIle: 4.503 ± 0.709
4.942GluLys: 4.942 ± 1.0
6.04GluLeu: 6.04 ± 0.941
1.757GluMet: 1.757 ± 0.46
3.514GluAsn: 3.514 ± 0.8
1.537GluPro: 1.537 ± 0.279
1.867GluGln: 1.867 ± 0.447
3.185GluArg: 3.185 ± 0.572
4.503GluSer: 4.503 ± 0.86
2.636GluThr: 2.636 ± 0.506
4.393GluVal: 4.393 ± 0.745
0.769GluTrp: 0.769 ± 0.284
2.306GluTyr: 2.306 ± 0.432
0.0GluXaa: 0.0 ± 0.0
Phe
2.526PheAla: 2.526 ± 0.387
0.988PheCys: 0.988 ± 0.342
5.381PheAsp: 5.381 ± 0.828
4.722PheGlu: 4.722 ± 0.735
3.514PhePhe: 3.514 ± 0.737
4.612PheGly: 4.612 ± 0.626
0.879PheHis: 0.879 ± 0.34
4.393PheIle: 4.393 ± 0.26
3.185PheLys: 3.185 ± 0.452
5.82PheLeu: 5.82 ± 0.818
1.098PheMet: 1.098 ± 0.358
4.832PheAsn: 4.832 ± 0.728
1.208PhePro: 1.208 ± 0.3
2.196PheGln: 2.196 ± 0.229
1.537PheArg: 1.537 ± 0.382
5.601PheSer: 5.601 ± 0.686
3.844PheThr: 3.844 ± 0.478
4.503PheVal: 4.503 ± 0.521
0.659PheTrp: 0.659 ± 0.203
2.087PheTyr: 2.087 ± 0.334
0.0PheXaa: 0.0 ± 0.0
Gly
1.537GlyAla: 1.537 ± 0.417
0.549GlyCys: 0.549 ± 0.164
2.636GlyAsp: 2.636 ± 0.542
2.416GlyGlu: 2.416 ± 0.355
2.745GlyPhe: 2.745 ± 0.442
1.318GlyGly: 1.318 ± 0.305
1.757GlyHis: 1.757 ± 0.578
3.953GlyIle: 3.953 ± 0.619
2.745GlyLys: 2.745 ± 0.539
3.624GlyLeu: 3.624 ± 0.469
0.22GlyMet: 0.22 ± 0.209
3.075GlyAsn: 3.075 ± 0.651
0.22GlyPro: 0.22 ± 0.138
1.208GlyGln: 1.208 ± 0.342
1.318GlyArg: 1.318 ± 0.406
2.745GlySer: 2.745 ± 0.566
2.416GlyThr: 2.416 ± 0.454
3.075GlyVal: 3.075 ± 0.643
0.439GlyTrp: 0.439 ± 0.146
1.977GlyTyr: 1.977 ± 0.355
0.0GlyXaa: 0.0 ± 0.0
His
0.879HisAla: 0.879 ± 0.464
0.439HisCys: 0.439 ± 0.131
1.647HisAsp: 1.647 ± 0.356
1.428HisGlu: 1.428 ± 0.4
2.526HisPhe: 2.526 ± 0.545
1.098HisGly: 1.098 ± 0.27
0.549HisHis: 0.549 ± 0.248
1.428HisIle: 1.428 ± 0.302
1.647HisLys: 1.647 ± 0.257
2.745HisLeu: 2.745 ± 0.527
0.549HisMet: 0.549 ± 0.236
1.537HisAsn: 1.537 ± 0.394
1.318HisPro: 1.318 ± 0.182
0.769HisGln: 0.769 ± 0.253
0.879HisArg: 0.879 ± 0.275
1.977HisSer: 1.977 ± 0.599
1.428HisThr: 1.428 ± 0.412
1.208HisVal: 1.208 ± 0.305
0.329HisTrp: 0.329 ± 0.259
1.318HisTyr: 1.318 ± 0.28
0.0HisXaa: 0.0 ± 0.0
Ile
2.526IleAla: 2.526 ± 0.393
1.098IleCys: 1.098 ± 0.433
5.052IleAsp: 5.052 ± 0.582
3.953IleGlu: 3.953 ± 0.893
4.722IlePhe: 4.722 ± 0.764
3.295IleGly: 3.295 ± 0.576
1.867IleHis: 1.867 ± 0.641
3.295IleIle: 3.295 ± 0.638
5.052IleLys: 5.052 ± 0.854
6.809IleLeu: 6.809 ± 0.612
1.318IleMet: 1.318 ± 0.45
3.953IleAsn: 3.953 ± 1.066
2.855IlePro: 2.855 ± 0.538
2.636IleGln: 2.636 ± 0.461
3.514IleArg: 3.514 ± 0.456
7.028IleSer: 7.028 ± 0.984
4.722IleThr: 4.722 ± 0.869
3.844IleVal: 3.844 ± 0.661
0.22IleTrp: 0.22 ± 0.143
2.745IleTyr: 2.745 ± 0.585
0.0IleXaa: 0.0 ± 0.0
Lys
2.196LysAla: 2.196 ± 0.761
0.879LysCys: 0.879 ± 0.325
3.734LysAsp: 3.734 ± 0.708
3.404LysGlu: 3.404 ± 0.648
3.844LysPhe: 3.844 ± 0.684
2.196LysGly: 2.196 ± 0.437
1.757LysHis: 1.757 ± 0.537
6.26LysIle: 6.26 ± 0.833
4.503LysLys: 4.503 ± 0.693
7.358LysLeu: 7.358 ± 1.039
1.977LysMet: 1.977 ± 0.273
4.612LysAsn: 4.612 ± 0.394
2.416LysPro: 2.416 ± 0.409
2.416LysGln: 2.416 ± 0.267
3.075LysArg: 3.075 ± 0.485
4.063LysSer: 4.063 ± 0.739
5.161LysThr: 5.161 ± 0.727
3.624LysVal: 3.624 ± 0.946
0.22LysTrp: 0.22 ± 0.172
2.965LysTyr: 2.965 ± 0.446
0.0LysXaa: 0.0 ± 0.0
Leu
3.514LeuAla: 3.514 ± 0.764
1.757LeuCys: 1.757 ± 0.487
6.04LeuAsp: 6.04 ± 0.915
5.93LeuGlu: 5.93 ± 0.693
7.138LeuPhe: 7.138 ± 0.599
4.503LeuGly: 4.503 ± 0.758
2.855LeuHis: 2.855 ± 0.341
6.919LeuIle: 6.919 ± 0.607
8.895LeuLys: 8.895 ± 0.84
9.664LeuLeu: 9.664 ± 1.272
2.855LeuMet: 2.855 ± 0.666
8.895LeuAsn: 8.895 ± 1.109
3.514LeuPro: 3.514 ± 0.554
1.977LeuGln: 1.977 ± 0.547
4.283LeuArg: 4.283 ± 0.732
10.213LeuSer: 10.213 ± 1.139
6.589LeuThr: 6.589 ± 0.513
4.173LeuVal: 4.173 ± 0.543
0.549LeuTrp: 0.549 ± 0.224
3.295LeuTyr: 3.295 ± 0.576
0.0LeuXaa: 0.0 ± 0.0
Met
1.098MetAla: 1.098 ± 0.361
0.329MetCys: 0.329 ± 0.218
1.318MetAsp: 1.318 ± 0.358
0.329MetGlu: 0.329 ± 0.212
1.867MetPhe: 1.867 ± 0.374
0.659MetGly: 0.659 ± 0.224
0.879MetHis: 0.879 ± 0.25
2.306MetIle: 2.306 ± 0.574
1.428MetLys: 1.428 ± 0.274
2.526MetLeu: 2.526 ± 0.502
0.769MetMet: 0.769 ± 0.315
2.526MetAsn: 2.526 ± 0.557
0.11MetPro: 0.11 ± 0.114
0.439MetGln: 0.439 ± 0.177
0.659MetArg: 0.659 ± 0.316
1.867MetSer: 1.867 ± 0.622
1.537MetThr: 1.537 ± 0.587
0.879MetVal: 0.879 ± 0.255
0.0MetTrp: 0.0 ± 0.0
1.098MetTyr: 1.098 ± 0.381
0.0MetXaa: 0.0 ± 0.0
Asn
3.404AsnAla: 3.404 ± 0.9
1.428AsnCys: 1.428 ± 0.264
5.491AsnAsp: 5.491 ± 0.85
2.965AsnGlu: 2.965 ± 0.714
3.295AsnPhe: 3.295 ± 0.653
3.404AsnGly: 3.404 ± 0.627
2.306AsnHis: 2.306 ± 0.754
5.271AsnIle: 5.271 ± 0.622
4.393AsnLys: 4.393 ± 0.772
8.236AsnLeu: 8.236 ± 0.847
0.769AsnMet: 0.769 ± 0.222
4.393AsnAsn: 4.393 ± 0.805
2.087AsnPro: 2.087 ± 0.28
2.306AsnGln: 2.306 ± 0.614
2.196AsnArg: 2.196 ± 0.539
5.491AsnSer: 5.491 ± 0.889
3.953AsnThr: 3.953 ± 1.005
4.722AsnVal: 4.722 ± 0.622
0.879AsnTrp: 0.879 ± 0.25
2.965AsnTyr: 2.965 ± 0.885
0.0AsnXaa: 0.0 ± 0.0
Pro
1.098ProAla: 1.098 ± 0.514
0.659ProCys: 0.659 ± 0.202
1.537ProAsp: 1.537 ± 0.243
1.428ProGlu: 1.428 ± 0.406
2.306ProPhe: 2.306 ± 0.376
0.769ProGly: 0.769 ± 0.309
0.439ProHis: 0.439 ± 0.242
2.416ProIle: 2.416 ± 0.523
1.647ProLys: 1.647 ± 0.333
3.185ProLeu: 3.185 ± 0.633
0.659ProMet: 0.659 ± 0.265
3.844ProAsn: 3.844 ± 0.513
0.988ProPro: 0.988 ± 0.405
0.769ProGln: 0.769 ± 0.183
1.098ProArg: 1.098 ± 0.383
3.844ProSer: 3.844 ± 0.851
2.745ProThr: 2.745 ± 0.542
1.977ProVal: 1.977 ± 0.429
0.329ProTrp: 0.329 ± 0.144
0.769ProTyr: 0.769 ± 0.256
0.0ProXaa: 0.0 ± 0.0
Gln
2.087GlnAla: 2.087 ± 0.37
0.439GlnCys: 0.439 ± 0.138
0.879GlnAsp: 0.879 ± 0.252
2.196GlnGlu: 2.196 ± 0.569
1.867GlnPhe: 1.867 ± 0.524
0.769GlnGly: 0.769 ± 0.315
0.439GlnHis: 0.439 ± 0.323
2.196GlnIle: 2.196 ± 0.564
2.416GlnLys: 2.416 ± 0.662
4.173GlnLeu: 4.173 ± 0.829
1.098GlnMet: 1.098 ± 0.299
1.318GlnAsn: 1.318 ± 0.318
1.208GlnPro: 1.208 ± 0.46
1.098GlnGln: 1.098 ± 0.42
1.977GlnArg: 1.977 ± 0.424
2.306GlnSer: 2.306 ± 0.39
1.647GlnThr: 1.647 ± 0.428
1.757GlnVal: 1.757 ± 0.507
0.22GlnTrp: 0.22 ± 0.131
1.098GlnTyr: 1.098 ± 0.403
0.0GlnXaa: 0.0 ± 0.0
Arg
1.867ArgAla: 1.867 ± 0.476
0.329ArgCys: 0.329 ± 0.247
1.867ArgAsp: 1.867 ± 0.298
1.428ArgGlu: 1.428 ± 0.363
2.855ArgPhe: 2.855 ± 0.527
1.318ArgGly: 1.318 ± 0.402
0.879ArgHis: 0.879 ± 0.374
2.196ArgIle: 2.196 ± 0.496
2.965ArgLys: 2.965 ± 0.562
4.832ArgLeu: 4.832 ± 0.506
1.977ArgMet: 1.977 ± 0.316
2.745ArgAsn: 2.745 ± 0.402
1.208ArgPro: 1.208 ± 0.334
1.757ArgGln: 1.757 ± 0.442
2.636ArgArg: 2.636 ± 0.769
3.075ArgSer: 3.075 ± 0.631
2.526ArgThr: 2.526 ± 0.417
2.306ArgVal: 2.306 ± 0.434
0.329ArgTrp: 0.329 ± 0.159
1.757ArgTyr: 1.757 ± 0.328
0.0ArgXaa: 0.0 ± 0.0
Ser
3.185SerAla: 3.185 ± 0.556
1.208SerCys: 1.208 ± 0.38
5.711SerAsp: 5.711 ± 0.676
6.699SerGlu: 6.699 ± 1.114
4.283SerPhe: 4.283 ± 0.712
2.965SerGly: 2.965 ± 0.656
1.757SerHis: 1.757 ± 0.409
6.15SerIle: 6.15 ± 0.862
4.503SerLys: 4.503 ± 0.719
10.213SerLeu: 10.213 ± 1.157
1.757SerMet: 1.757 ± 0.49
4.393SerAsn: 4.393 ± 0.785
3.404SerPro: 3.404 ± 0.622
3.075SerGln: 3.075 ± 0.546
3.624SerArg: 3.624 ± 0.704
8.895SerSer: 8.895 ± 1.448
5.052SerThr: 5.052 ± 1.171
5.93SerVal: 5.93 ± 0.657
0.439SerTrp: 0.439 ± 0.311
4.503SerTyr: 4.503 ± 0.938
0.0SerXaa: 0.0 ± 0.0
Thr
2.526ThrAla: 2.526 ± 0.775
0.769ThrCys: 0.769 ± 0.281
2.855ThrAsp: 2.855 ± 0.817
3.953ThrGlu: 3.953 ± 0.456
3.953ThrPhe: 3.953 ± 0.39
2.087ThrGly: 2.087 ± 0.336
0.879ThrHis: 0.879 ± 0.319
4.173ThrIle: 4.173 ± 0.505
3.624ThrLys: 3.624 ± 0.627
5.711ThrLeu: 5.711 ± 0.493
0.988ThrMet: 0.988 ± 0.318
3.185ThrAsn: 3.185 ± 0.528
1.537ThrPro: 1.537 ± 0.477
1.537ThrGln: 1.537 ± 0.428
2.306ThrArg: 2.306 ± 0.737
7.358ThrSer: 7.358 ± 0.887
3.514ThrThr: 3.514 ± 0.933
4.942ThrVal: 4.942 ± 0.65
0.439ThrTrp: 0.439 ± 0.25
2.636ThrTyr: 2.636 ± 0.682
0.0ThrXaa: 0.0 ± 0.0
Val
1.977ValAla: 1.977 ± 0.72
1.098ValCys: 1.098 ± 0.286
3.404ValAsp: 3.404 ± 0.468
3.953ValGlu: 3.953 ± 0.768
4.283ValPhe: 4.283 ± 0.619
2.636ValGly: 2.636 ± 0.487
1.428ValHis: 1.428 ± 0.385
4.612ValIle: 4.612 ± 0.648
4.173ValLys: 4.173 ± 0.653
6.479ValLeu: 6.479 ± 0.791
1.318ValMet: 1.318 ± 0.759
4.832ValAsn: 4.832 ± 1.017
2.636ValPro: 2.636 ± 0.404
1.757ValGln: 1.757 ± 0.42
2.196ValArg: 2.196 ± 0.554
5.052ValSer: 5.052 ± 0.489
3.514ValThr: 3.514 ± 0.591
3.185ValVal: 3.185 ± 0.687
0.11ValTrp: 0.11 ± 0.113
2.526ValTyr: 2.526 ± 0.56
0.0ValXaa: 0.0 ± 0.0
Trp
0.329TrpAla: 0.329 ± 0.164
0.11TrpCys: 0.11 ± 0.134
0.22TrpAsp: 0.22 ± 0.177
0.549TrpGlu: 0.549 ± 0.178
0.11TrpPhe: 0.11 ± 0.134
0.11TrpGly: 0.11 ± 0.105
0.0TrpHis: 0.0 ± 0.0
0.22TrpIle: 0.22 ± 0.156
1.208TrpLys: 1.208 ± 0.338
0.329TrpLeu: 0.329 ± 0.218
0.11TrpMet: 0.11 ± 0.105
0.879TrpAsn: 0.879 ± 0.35
0.659TrpPro: 0.659 ± 0.253
0.659TrpGln: 0.659 ± 0.231
0.11TrpArg: 0.11 ± 0.111
0.879TrpSer: 0.879 ± 0.382
0.549TrpThr: 0.549 ± 0.283
0.11TrpVal: 0.11 ± 0.113
0.11TrpTrp: 0.11 ± 0.102
0.22TrpTyr: 0.22 ± 0.183
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.428TyrAla: 1.428 ± 0.285
0.879TyrCys: 0.879 ± 0.332
3.624TyrAsp: 3.624 ± 0.781
1.977TyrGlu: 1.977 ± 0.309
2.965TyrPhe: 2.965 ± 0.581
2.416TyrGly: 2.416 ± 0.569
1.977TyrHis: 1.977 ± 0.462
2.306TyrIle: 2.306 ± 0.404
1.977TyrLys: 1.977 ± 0.501
4.722TyrLeu: 4.722 ± 0.633
0.659TyrMet: 0.659 ± 0.34
3.514TyrAsn: 3.514 ± 0.431
1.537TyrPro: 1.537 ± 0.249
1.098TyrGln: 1.098 ± 0.337
1.208TyrArg: 1.208 ± 0.393
3.953TyrSer: 3.953 ± 0.595
1.757TyrThr: 1.757 ± 0.574
2.416TyrVal: 2.416 ± 0.504
0.439TyrTrp: 0.439 ± 0.25
1.977TyrTyr: 1.977 ± 0.745
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12 proteins (9107 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski