Amino acid dipepetide frequency for Sanxia atyid shrimp virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.331AlaAla: 2.331 ± 0.447
0.874AlaCys: 0.874 ± 0.257
2.331AlaAsp: 2.331 ± 0.327
2.331AlaGlu: 2.331 ± 1.224
1.748AlaPhe: 1.748 ± 0.114
3.205AlaGly: 3.205 ± 2.931
0.583AlaHis: 0.583 ± 0.337
2.331AlaIle: 2.331 ± 0.327
1.457AlaLys: 1.457 ± 0.582
5.536AlaLeu: 5.536 ± 0.149
0.874AlaMet: 0.874 ± 1.187
3.497AlaAsn: 3.497 ± 0.925
2.622AlaPro: 2.622 ± 1.136
1.457AlaGln: 1.457 ± 0.451
3.205AlaArg: 3.205 ± 0.972
5.536AlaSer: 5.536 ± 1.068
1.748AlaThr: 1.748 ± 0.864
4.079AlaVal: 4.079 ± 1.353
0.874AlaTrp: 0.874 ± 0.683
2.914AlaTyr: 2.914 ± 0.381
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.166CysAsp: 1.166 ± 0.674
1.457CysGlu: 1.457 ± 0.575
0.874CysPhe: 0.874 ± 0.506
0.291CysGly: 0.291 ± 0.396
0.291CysHis: 0.291 ± 0.396
1.166CysIle: 1.166 ± 0.326
2.331CysLys: 2.331 ± 0.221
0.874CysLeu: 0.874 ± 0.671
0.291CysMet: 0.291 ± 0.169
0.874CysAsn: 0.874 ± 0.506
0.583CysPro: 0.583 ± 0.337
0.291CysGln: 0.291 ± 0.169
1.166CysArg: 1.166 ± 0.232
1.457CysSer: 1.457 ± 1.109
0.874CysThr: 0.874 ± 0.683
0.583CysVal: 0.583 ± 0.288
0.291CysTrp: 0.291 ± 0.169
0.874CysTyr: 0.874 ± 0.257
0.0CysXaa: 0.0 ± 0.0
Asp
2.622AspAla: 2.622 ± 0.808
0.874AspCys: 0.874 ± 0.671
4.953AspAsp: 4.953 ± 1.882
2.622AspGlu: 2.622 ± 0.298
1.748AspPhe: 1.748 ± 0.424
2.914AspGly: 2.914 ± 0.661
1.748AspHis: 1.748 ± 0.46
5.245AspIle: 5.245 ± 1.444
2.04AspLys: 2.04 ± 1.18
7.284AspLeu: 7.284 ± 2.008
1.457AspMet: 1.457 ± 0.695
1.748AspAsn: 1.748 ± 0.46
2.331AspPro: 2.331 ± 0.447
2.331AspGln: 2.331 ± 0.757
2.04AspArg: 2.04 ± 0.852
2.914AspSer: 2.914 ± 0.479
4.371AspThr: 4.371 ± 1.193
2.914AspVal: 2.914 ± 0.623
1.166AspTrp: 1.166 ± 0.674
2.914AspTyr: 2.914 ± 0.901
0.0AspXaa: 0.0 ± 0.0
Glu
2.622GluAla: 2.622 ± 0.298
0.874GluCys: 0.874 ± 0.257
4.371GluAsp: 4.371 ± 0.311
4.953GluGlu: 4.953 ± 0.189
2.914GluPhe: 2.914 ± 0.479
4.953GluGly: 4.953 ± 0.974
2.331GluHis: 2.331 ± 0.327
5.245GluIle: 5.245 ± 1.014
1.457GluLys: 1.457 ± 0.843
3.205GluLeu: 3.205 ± 1.451
1.457GluMet: 1.457 ± 0.575
2.331GluAsn: 2.331 ± 0.687
0.291GluPro: 0.291 ± 0.169
0.291GluGln: 0.291 ± 0.169
2.331GluArg: 2.331 ± 0.447
3.205GluSer: 3.205 ± 0.952
3.788GluThr: 3.788 ± 1.288
2.914GluVal: 2.914 ± 0.141
1.166GluTrp: 1.166 ± 0.232
0.874GluTyr: 0.874 ± 0.257
0.0GluXaa: 0.0 ± 0.0
Phe
1.457PheAla: 1.457 ± 0.07
1.748PheCys: 1.748 ± 0.114
2.04PheAsp: 2.04 ± 0.753
1.166PheGlu: 1.166 ± 0.326
2.04PhePhe: 2.04 ± 0.279
2.622PheGly: 2.622 ± 1.67
0.874PheHis: 0.874 ± 0.257
3.788PheIle: 3.788 ± 0.846
2.331PheLys: 2.331 ± 0.652
5.536PheLeu: 5.536 ± 1.398
1.166PheMet: 1.166 ± 0.378
1.166PheAsn: 1.166 ± 1.063
1.457PhePro: 1.457 ± 0.575
0.583PheGln: 0.583 ± 0.288
2.622PheArg: 2.622 ± 0.9
2.622PheSer: 2.622 ± 1.123
2.331PheThr: 2.331 ± 0.757
2.331PheVal: 2.331 ± 0.221
0.874PheTrp: 0.874 ± 0.671
1.457PheTyr: 1.457 ± 0.07
0.0PheXaa: 0.0 ± 0.0
Gly
4.371GlyAla: 4.371 ± 3.488
0.291GlyCys: 0.291 ± 0.169
3.205GlyAsp: 3.205 ± 2.086
4.953GlyGlu: 4.953 ± 1.66
1.748GlyPhe: 1.748 ± 0.114
13.986GlyGly: 13.986 ± 15.934
2.331GlyHis: 2.331 ± 1.212
2.04GlyIle: 2.04 ± 0.279
1.748GlyLys: 1.748 ± 0.514
6.119GlyLeu: 6.119 ± 1.079
2.331GlyMet: 2.331 ± 0.327
3.497GlyAsn: 3.497 ± 1.953
2.04GlyPro: 2.04 ± 1.366
2.914GlyGln: 2.914 ± 1.527
5.245GlyArg: 5.245 ± 4.487
5.536GlySer: 5.536 ± 3.317
4.079GlyThr: 4.079 ± 0.201
3.497GlyVal: 3.497 ± 0.422
1.457GlyTrp: 1.457 ± 0.591
1.457GlyTyr: 1.457 ± 0.519
0.0GlyXaa: 0.0 ± 0.0
His
2.331HisAla: 2.331 ± 0.687
0.0HisCys: 0.0 ± 0.0
1.748HisAsp: 1.748 ± 0.597
2.622HisGlu: 2.622 ± 0.615
1.457HisPhe: 1.457 ± 0.451
2.04HisGly: 2.04 ± 0.799
1.166HisHis: 1.166 ± 0.326
2.622HisIle: 2.622 ± 0.782
2.04HisLys: 2.04 ± 0.817
4.662HisLeu: 4.662 ± 0.893
0.583HisMet: 0.583 ± 0.337
1.166HisAsn: 1.166 ± 0.674
1.457HisPro: 1.457 ± 0.503
1.457HisGln: 1.457 ± 0.575
1.748HisArg: 1.748 ± 0.114
2.04HisSer: 2.04 ± 0.563
1.166HisThr: 1.166 ± 0.674
0.583HisVal: 0.583 ± 0.306
2.04HisTrp: 2.04 ± 0.293
1.748HisTyr: 1.748 ± 1.011
0.0HisXaa: 0.0 ± 0.0
Ile
3.497IleAla: 3.497 ± 0.848
0.583IleCys: 0.583 ± 0.306
4.079IleAsp: 4.079 ± 1.353
2.331IleGlu: 2.331 ± 0.914
2.914IlePhe: 2.914 ± 1.039
2.914IleGly: 2.914 ± 1.039
3.205IleHis: 3.205 ± 0.527
6.993IleIle: 6.993 ± 1.571
1.748IleLys: 1.748 ± 0.597
7.284IleLeu: 7.284 ± 1.57
1.457IleMet: 1.457 ± 0.07
3.497IleAsn: 3.497 ± 0.228
3.497IlePro: 3.497 ± 0.506
4.079IleGln: 4.079 ± 0.201
6.119IleArg: 6.119 ± 1.275
4.662IleSer: 4.662 ± 1.816
6.119IleThr: 6.119 ± 1.394
4.371IleVal: 4.371 ± 0.669
0.291IleTrp: 0.291 ± 0.169
2.04IleTyr: 2.04 ± 0.279
0.0IleXaa: 0.0 ± 0.0
Lys
2.04LysAla: 2.04 ± 1.286
0.874LysCys: 0.874 ± 0.399
2.331LysAsp: 2.331 ± 0.652
3.205LysGlu: 3.205 ± 0.613
3.788LysPhe: 3.788 ± 0.739
0.874LysGly: 0.874 ± 0.3
2.331LysHis: 2.331 ± 0.767
3.788LysIle: 3.788 ± 1.349
1.166LysLys: 1.166 ± 0.378
4.079LysLeu: 4.079 ± 1.599
0.583LysMet: 0.583 ± 0.337
2.622LysAsn: 2.622 ± 0.261
2.331LysPro: 2.331 ± 0.447
1.748LysGln: 1.748 ± 1.011
1.166LysArg: 1.166 ± 0.326
3.205LysSer: 3.205 ± 1.045
3.497LysThr: 3.497 ± 0.978
4.371LysVal: 4.371 ± 1.558
1.166LysTrp: 1.166 ± 0.326
2.914LysTyr: 2.914 ± 1.685
0.0LysXaa: 0.0 ± 0.0
Leu
4.662LeuAla: 4.662 ± 1.794
1.166LeuCys: 1.166 ± 0.674
7.867LeuAsp: 7.867 ± 2.019
6.993LeuGlu: 6.993 ± 2.25
3.788LeuPhe: 3.788 ± 0.739
5.245LeuGly: 5.245 ± 0.876
3.205LeuHis: 3.205 ± 1.854
5.828LeuIle: 5.828 ± 0.718
7.576LeuLys: 7.576 ± 1.692
15.152LeuLeu: 15.152 ± 4.74
4.079LeuMet: 4.079 ± 0.514
3.788LeuAsn: 3.788 ± 0.846
3.497LeuPro: 3.497 ± 0.978
3.497LeuGln: 3.497 ± 0.422
2.914LeuArg: 2.914 ± 1.241
8.741LeuSer: 8.741 ± 1.461
6.702LeuThr: 6.702 ± 2.599
4.662LeuVal: 4.662 ± 1.115
0.291LeuTrp: 0.291 ± 0.169
4.953LeuTyr: 4.953 ± 1.125
0.0LeuXaa: 0.0 ± 0.0
Met
1.166MetAla: 1.166 ± 0.378
0.291MetCys: 0.291 ± 0.169
0.583MetAsp: 0.583 ± 0.306
0.583MetGlu: 0.583 ± 0.306
2.04MetPhe: 2.04 ± 1.239
1.166MetGly: 1.166 ± 1.134
0.291MetHis: 0.291 ± 0.169
2.04MetIle: 2.04 ± 0.293
0.874MetLys: 0.874 ± 0.257
1.457MetLeu: 1.457 ± 0.843
1.457MetMet: 1.457 ± 0.987
2.04MetAsn: 2.04 ± 0.279
2.622MetPro: 2.622 ± 0.374
0.874MetGln: 0.874 ± 0.257
1.166MetArg: 1.166 ± 0.232
1.748MetSer: 1.748 ± 0.597
1.748MetThr: 1.748 ± 0.46
1.457MetVal: 1.457 ± 0.503
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.788AsnAla: 3.788 ± 1.031
0.583AsnCys: 0.583 ± 0.567
2.04AsnAsp: 2.04 ± 0.802
1.457AsnGlu: 1.457 ± 0.519
2.914AsnPhe: 2.914 ± 0.479
2.622AsnGly: 2.622 ± 1.326
1.166AsnHis: 1.166 ± 0.612
2.622AsnIle: 2.622 ± 0.9
4.953AsnLys: 4.953 ± 0.616
4.079AsnLeu: 4.079 ± 1.457
1.166AsnMet: 1.166 ± 0.733
1.166AsnAsn: 1.166 ± 0.674
2.622AsnPro: 2.622 ± 1.077
2.622AsnGln: 2.622 ± 0.374
2.331AsnArg: 2.331 ± 0.961
2.914AsnSer: 2.914 ± 1.241
2.622AsnThr: 2.622 ± 0.772
2.914AsnVal: 2.914 ± 0.783
0.583AsnTrp: 0.583 ± 0.337
0.583AsnTyr: 0.583 ± 0.337
0.0AsnXaa: 0.0 ± 0.0
Pro
1.748ProAla: 1.748 ± 0.114
0.291ProCys: 0.291 ± 0.169
1.748ProAsp: 1.748 ± 0.648
2.04ProGlu: 2.04 ± 0.279
1.166ProPhe: 1.166 ± 0.378
2.331ProGly: 2.331 ± 1.756
2.04ProHis: 2.04 ± 0.293
3.788ProIle: 3.788 ± 2.099
1.166ProLys: 1.166 ± 0.378
3.497ProLeu: 3.497 ± 0.684
0.874ProMet: 0.874 ± 0.257
2.331ProAsn: 2.331 ± 0.447
4.662ProPro: 4.662 ± 0.458
1.166ProGln: 1.166 ± 0.378
2.04ProArg: 2.04 ± 0.563
5.536ProSer: 5.536 ± 1.926
1.748ProThr: 1.748 ± 0.959
2.331ProVal: 2.331 ± 0.961
0.874ProTrp: 0.874 ± 0.3
2.331ProTyr: 2.331 ± 0.757
0.0ProXaa: 0.0 ± 0.0
Gln
1.166GlnAla: 1.166 ± 0.576
0.583GlnCys: 0.583 ± 0.791
4.079GlnAsp: 4.079 ± 1.949
1.748GlnGlu: 1.748 ± 0.959
0.874GlnPhe: 0.874 ± 0.506
4.079GlnGly: 4.079 ± 2.731
1.166GlnHis: 1.166 ± 0.576
2.622GlnIle: 2.622 ± 0.576
1.748GlnLys: 1.748 ± 1.011
3.205GlnLeu: 3.205 ± 0.952
1.166GlnMet: 1.166 ± 0.232
0.583GlnAsn: 0.583 ± 0.306
1.166GlnPro: 1.166 ± 0.576
1.166GlnGln: 1.166 ± 0.232
2.04GlnArg: 2.04 ± 0.362
2.622GlnSer: 2.622 ± 0.374
2.331GlnThr: 2.331 ± 0.767
3.205GlnVal: 3.205 ± 1.035
0.0GlnTrp: 0.0 ± 0.0
2.914GlnTyr: 2.914 ± 0.381
0.0GlnXaa: 0.0 ± 0.0
Arg
2.622ArgAla: 2.622 ± 0.374
1.748ArgCys: 1.748 ± 0.424
2.622ArgAsp: 2.622 ± 1.197
2.622ArgGlu: 2.622 ± 0.261
1.166ArgPhe: 1.166 ± 0.612
5.828ArgGly: 5.828 ± 4.451
2.04ArgHis: 2.04 ± 1.18
2.914ArgIle: 2.914 ± 1.44
4.079ArgLys: 4.079 ± 1.01
6.119ArgLeu: 6.119 ± 1.087
0.291ArgMet: 0.291 ± 0.169
3.205ArgAsn: 3.205 ± 1.035
1.166ArgPro: 1.166 ± 0.733
2.622ArgGln: 2.622 ± 0.261
2.914ArgArg: 2.914 ± 0.661
3.788ArgSer: 3.788 ± 0.81
1.457ArgThr: 1.457 ± 0.451
2.622ArgVal: 2.622 ± 1.846
0.874ArgTrp: 0.874 ± 0.3
1.457ArgTyr: 1.457 ± 0.843
0.0ArgXaa: 0.0 ± 0.0
Ser
3.788SerAla: 3.788 ± 0.709
1.166SerCys: 1.166 ± 0.326
4.662SerAsp: 4.662 ± 1.028
2.622SerGlu: 2.622 ± 1.077
2.04SerPhe: 2.04 ± 0.802
7.867SerGly: 7.867 ± 3.644
2.331SerHis: 2.331 ± 0.652
6.119SerIle: 6.119 ± 1.394
4.953SerLys: 4.953 ± 0.912
7.284SerLeu: 7.284 ± 1.508
1.166SerMet: 1.166 ± 0.378
4.662SerAsn: 4.662 ± 0.822
2.04SerPro: 2.04 ± 1.239
4.662SerGln: 4.662 ± 0.893
4.662SerArg: 4.662 ± 0.597
5.828SerSer: 5.828 ± 0.718
5.536SerThr: 5.536 ± 1.24
3.497SerVal: 3.497 ± 1.317
0.874SerTrp: 0.874 ± 0.399
2.04SerTyr: 2.04 ± 0.753
0.0SerXaa: 0.0 ± 0.0
Thr
4.079ThrAla: 4.079 ± 0.366
0.583ThrCys: 0.583 ± 0.288
1.457ThrAsp: 1.457 ± 0.582
1.457ThrGlu: 1.457 ± 0.451
2.622ThrPhe: 2.622 ± 0.874
3.205ThrGly: 3.205 ± 0.996
3.497ThrHis: 3.497 ± 0.314
2.622ThrIle: 2.622 ± 0.261
1.457ThrLys: 1.457 ± 1.457
6.41ThrLeu: 6.41 ± 0.948
0.583ThrMet: 0.583 ± 0.337
2.331ThrAsn: 2.331 ± 0.447
3.788ThrPro: 3.788 ± 1.031
2.622ThrGln: 2.622 ± 1.082
2.914ThrArg: 2.914 ± 1.44
6.702ThrSer: 6.702 ± 3.105
4.371ThrThr: 4.371 ± 1.724
3.788ThrVal: 3.788 ± 1.349
1.457ThrTrp: 1.457 ± 0.503
4.953ThrTyr: 4.953 ± 1.125
0.0ThrXaa: 0.0 ± 0.0
Val
2.622ValAla: 2.622 ± 0.808
1.457ValCys: 1.457 ± 0.503
2.622ValAsp: 2.622 ± 0.782
4.079ValGlu: 4.079 ± 0.855
2.914ValPhe: 2.914 ± 0.623
1.457ValGly: 1.457 ± 0.582
2.04ValHis: 2.04 ± 0.279
4.079ValIle: 4.079 ± 0.201
2.622ValLys: 2.622 ± 1.525
5.828ValLeu: 5.828 ± 1.239
1.166ValMet: 1.166 ± 0.576
2.622ValAsn: 2.622 ± 0.874
2.622ValPro: 2.622 ± 0.782
2.331ValGln: 2.331 ± 1.467
2.622ValArg: 2.622 ± 0.9
5.536ValSer: 5.536 ± 1.03
4.371ValThr: 4.371 ± 0.211
3.497ValVal: 3.497 ± 1.167
0.583ValTrp: 0.583 ± 0.288
1.748ValTyr: 1.748 ± 1.011
0.0ValXaa: 0.0 ± 0.0
Trp
0.874TrpAla: 0.874 ± 0.257
0.874TrpCys: 0.874 ± 0.506
0.291TrpAsp: 0.291 ± 0.396
0.874TrpGlu: 0.874 ± 0.506
0.583TrpPhe: 0.583 ± 0.791
0.583TrpGly: 0.583 ± 0.337
0.0TrpHis: 0.0 ± 0.0
1.748TrpIle: 1.748 ± 0.798
0.583TrpLys: 0.583 ± 0.337
1.166TrpLeu: 1.166 ± 0.232
0.874TrpMet: 0.874 ± 0.3
0.583TrpAsn: 0.583 ± 0.288
0.874TrpPro: 0.874 ± 0.3
0.0TrpGln: 0.0 ± 0.0
1.457TrpArg: 1.457 ± 0.503
0.874TrpSer: 0.874 ± 0.506
0.583TrpThr: 0.583 ± 0.785
1.457TrpVal: 1.457 ± 0.503
0.0TrpTrp: 0.0 ± 0.0
0.874TrpTyr: 0.874 ± 0.399
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.748TyrAla: 1.748 ± 0.648
0.874TyrCys: 0.874 ± 0.671
2.331TyrAsp: 2.331 ± 0.447
1.457TyrGlu: 1.457 ± 0.843
0.874TyrPhe: 0.874 ± 0.506
4.662TyrGly: 4.662 ± 1.116
2.04TyrHis: 2.04 ± 0.753
4.079TyrIle: 4.079 ± 1.217
1.748TyrLys: 1.748 ± 0.514
5.828TyrLeu: 5.828 ± 2.468
0.291TyrMet: 0.291 ± 0.169
2.04TyrAsn: 2.04 ± 1.18
2.04TyrPro: 2.04 ± 0.631
1.748TyrGln: 1.748 ± 0.597
1.457TyrArg: 1.457 ± 0.503
2.331TyrSer: 2.331 ± 1.152
1.457TyrThr: 1.457 ± 0.451
1.748TyrVal: 1.748 ± 0.597
0.291TyrTrp: 0.291 ± 0.169
1.457TyrTyr: 1.457 ± 0.843
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3433 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski