Amino acid dipepetide frequency for Beihai shrimp virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.148AlaAla: 6.148 ± 1.704
1.844AlaCys: 1.844 ± 0.934
4.611AlaAsp: 4.611 ± 1.63
3.381AlaGlu: 3.381 ± 0.402
1.844AlaPhe: 1.844 ± 0.229
4.304AlaGly: 4.304 ± 1.474
1.23AlaHis: 1.23 ± 0.082
3.381AlaIle: 3.381 ± 1.007
4.304AlaLys: 4.304 ± 0.065
4.304AlaLeu: 4.304 ± 0.64
3.689AlaMet: 3.689 ± 0.246
3.381AlaAsn: 3.381 ± 0.303
7.993AlaPro: 7.993 ± 0.523
1.844AlaGln: 1.844 ± 1.18
5.841AlaArg: 5.841 ± 0.843
7.378AlaSer: 7.378 ± 0.917
4.611AlaThr: 4.611 ± 0.925
3.996AlaVal: 3.996 ± 0.614
0.922AlaTrp: 0.922 ± 0.467
1.23AlaTyr: 1.23 ± 0.787
0.0AlaXaa: 0.0 ± 0.0
Cys
2.767CysAla: 2.767 ± 0.009
0.307CysCys: 0.307 ± 0.156
1.23CysAsp: 1.23 ± 0.623
1.23CysGlu: 1.23 ± 0.623
0.615CysPhe: 0.615 ± 0.311
2.459CysGly: 2.459 ± 0.164
1.537CysHis: 1.537 ± 0.778
0.307CysIle: 0.307 ± 0.156
0.922CysLys: 0.922 ± 0.467
0.615CysLeu: 0.615 ± 0.311
1.537CysMet: 1.537 ± 0.631
0.0CysAsn: 0.0 ± 0.0
0.922CysPro: 0.922 ± 0.467
0.615CysGln: 0.615 ± 0.311
0.922CysArg: 0.922 ± 0.238
0.307CysSer: 0.307 ± 0.156
0.307CysThr: 0.307 ± 0.156
1.844CysVal: 1.844 ± 0.475
0.615CysTrp: 0.615 ± 0.311
0.615CysTyr: 0.615 ± 0.311
0.0CysXaa: 0.0 ± 0.0
Asp
4.304AspAla: 4.304 ± 1.474
1.23AspCys: 1.23 ± 0.623
2.767AspAsp: 2.767 ± 0.713
3.689AspGlu: 3.689 ± 1.163
4.919AspPhe: 4.919 ± 0.376
3.074AspGly: 3.074 ± 0.852
1.844AspHis: 1.844 ± 0.229
3.074AspIle: 3.074 ± 1.556
0.307AspLys: 0.307 ± 0.549
4.919AspLeu: 4.919 ± 0.328
1.537AspMet: 1.537 ± 0.631
0.615AspAsn: 0.615 ± 0.393
2.152AspPro: 2.152 ± 0.32
3.074AspGln: 3.074 ± 1.262
1.537AspArg: 1.537 ± 0.074
3.689AspSer: 3.689 ± 1.163
3.996AspThr: 3.996 ± 0.091
3.996AspVal: 3.996 ± 1.5
1.844AspTrp: 1.844 ± 0.475
2.152AspTyr: 2.152 ± 0.385
0.0AspXaa: 0.0 ± 0.0
Glu
6.148GluAla: 6.148 ± 1.704
1.537GluCys: 1.537 ± 0.631
2.459GluAsp: 2.459 ± 0.164
11.374GluGlu: 11.374 ± 4.35
2.152GluPhe: 2.152 ± 1.09
3.689GluGly: 3.689 ± 1.163
1.23GluHis: 1.23 ± 0.623
4.919GluIle: 4.919 ± 0.328
4.611GluLys: 4.611 ± 1.63
4.919GluLeu: 4.919 ± 1.033
1.844GluMet: 1.844 ± 0.229
3.074GluAsn: 3.074 ± 0.852
1.537GluPro: 1.537 ± 0.074
1.23GluGln: 1.23 ± 0.623
2.152GluArg: 2.152 ± 0.32
6.456GluSer: 6.456 ± 0.45
3.381GluThr: 3.381 ± 0.402
9.53GluVal: 9.53 ± 0.108
0.922GluTrp: 0.922 ± 0.942
1.844GluTyr: 1.844 ± 0.229
0.0GluXaa: 0.0 ± 0.0
Phe
3.381PheAla: 3.381 ± 0.402
0.307PheCys: 0.307 ± 0.156
3.074PheAsp: 3.074 ± 0.558
1.844PheGlu: 1.844 ± 0.229
2.459PhePhe: 2.459 ± 0.164
3.381PheGly: 3.381 ± 0.402
1.537PheHis: 1.537 ± 0.631
2.767PheIle: 2.767 ± 1.401
3.381PheLys: 3.381 ± 0.402
4.919PheLeu: 4.919 ± 1.081
0.922PheMet: 0.922 ± 0.238
1.537PheAsn: 1.537 ± 0.074
1.537PhePro: 1.537 ± 0.631
1.23PheGln: 1.23 ± 0.082
2.152PheArg: 2.152 ± 1.09
2.459PheSer: 2.459 ± 0.869
3.996PheThr: 3.996 ± 2.205
1.844PheVal: 1.844 ± 1.18
1.23PheTrp: 1.23 ± 0.623
3.074PheTyr: 3.074 ± 0.558
0.0PheXaa: 0.0 ± 0.0
Gly
4.611GlyAla: 4.611 ± 0.221
1.23GlyCys: 1.23 ± 0.082
2.459GlyAsp: 2.459 ± 0.164
3.689GlyGlu: 3.689 ± 1.656
2.767GlyPhe: 2.767 ± 0.713
3.074GlyGly: 3.074 ± 1.262
0.615GlyHis: 0.615 ± 0.311
3.381GlyIle: 3.381 ± 1.712
4.919GlyLys: 4.919 ± 1.786
3.689GlyLeu: 3.689 ± 0.246
1.23GlyMet: 1.23 ± 0.623
2.767GlyAsn: 2.767 ± 0.713
1.537GlyPro: 1.537 ± 0.631
2.767GlyGln: 2.767 ± 0.713
3.996GlyArg: 3.996 ± 0.614
2.459GlySer: 2.459 ± 2.278
2.767GlyThr: 2.767 ± 0.009
4.611GlyVal: 4.611 ± 0.221
1.23GlyTrp: 1.23 ± 0.082
2.459GlyTyr: 2.459 ± 0.164
0.0GlyXaa: 0.0 ± 0.0
His
0.922HisAla: 0.922 ± 0.467
0.922HisCys: 0.922 ± 0.467
1.23HisAsp: 1.23 ± 0.082
2.152HisGlu: 2.152 ± 0.385
0.922HisPhe: 0.922 ± 0.238
1.23HisGly: 1.23 ± 0.623
0.615HisHis: 0.615 ± 0.311
1.537HisIle: 1.537 ± 0.778
1.23HisLys: 1.23 ± 1.491
1.23HisLeu: 1.23 ± 0.623
1.537HisMet: 1.537 ± 0.778
0.0HisAsn: 0.0 ± 0.0
1.23HisPro: 1.23 ± 0.787
0.307HisGln: 0.307 ± 0.156
1.537HisArg: 1.537 ± 0.778
1.537HisSer: 1.537 ± 2.041
0.615HisThr: 0.615 ± 0.311
1.23HisVal: 1.23 ± 0.082
0.0HisTrp: 0.0 ± 0.0
1.23HisTyr: 1.23 ± 0.623
0.0HisXaa: 0.0 ± 0.0
Ile
4.919IleAla: 4.919 ± 2.49
0.615IleCys: 0.615 ± 0.311
4.611IleAsp: 4.611 ± 0.484
3.996IleGlu: 3.996 ± 1.5
2.767IlePhe: 2.767 ± 0.009
2.459IleGly: 2.459 ± 1.245
1.844IleHis: 1.844 ± 0.934
3.074IleIle: 3.074 ± 0.147
2.459IleLys: 2.459 ± 0.164
4.919IleLeu: 4.919 ± 0.328
2.152IleMet: 2.152 ± 0.32
2.767IleAsn: 2.767 ± 0.713
3.074IlePro: 3.074 ± 1.262
1.537IleGln: 1.537 ± 0.074
2.459IleArg: 2.459 ± 0.54
3.381IleSer: 3.381 ± 0.303
4.611IleThr: 4.611 ± 1.189
3.381IleVal: 3.381 ± 1.007
0.615IleTrp: 0.615 ± 0.311
1.537IleTyr: 1.537 ± 0.778
0.0IleXaa: 0.0 ± 0.0
Lys
5.841LysAla: 5.841 ± 2.253
0.307LysCys: 0.307 ± 0.156
3.996LysAsp: 3.996 ± 0.091
3.689LysGlu: 3.689 ± 0.246
3.996LysPhe: 3.996 ± 2.205
1.844LysGly: 1.844 ± 0.229
0.922LysHis: 0.922 ± 0.238
3.996LysIle: 3.996 ± 0.795
3.689LysLys: 3.689 ± 0.458
3.381LysLeu: 3.381 ± 1.107
1.23LysMet: 1.23 ± 0.623
1.537LysAsn: 1.537 ± 0.074
2.767LysPro: 2.767 ± 0.696
2.152LysGln: 2.152 ± 1.09
3.074LysArg: 3.074 ± 1.556
4.304LysSer: 4.304 ± 0.64
4.304LysThr: 4.304 ± 2.179
2.459LysVal: 2.459 ± 0.54
1.23LysTrp: 1.23 ± 0.623
1.844LysTyr: 1.844 ± 0.475
0.0LysXaa: 0.0 ± 0.0
Leu
3.996LeuAla: 3.996 ± 0.795
1.844LeuCys: 1.844 ± 0.475
4.919LeuAsp: 4.919 ± 1.786
7.378LeuGlu: 7.378 ± 0.917
3.381LeuPhe: 3.381 ± 1.107
2.767LeuGly: 2.767 ± 0.696
2.152LeuHis: 2.152 ± 0.32
3.074LeuIle: 3.074 ± 0.147
5.841LeuLys: 5.841 ± 0.139
8.3LeuLeu: 8.3 ± 0.73
2.152LeuMet: 2.152 ± 0.953
3.689LeuAsn: 3.689 ± 1.656
3.689LeuPro: 3.689 ± 0.951
3.996LeuGln: 3.996 ± 0.091
3.381LeuArg: 3.381 ± 0.402
5.533LeuSer: 5.533 ± 0.722
5.226LeuThr: 5.226 ± 0.532
4.611LeuVal: 4.611 ± 0.221
0.307LeuTrp: 0.307 ± 0.156
2.767LeuTyr: 2.767 ± 0.696
0.0LeuXaa: 0.0 ± 0.0
Met
2.459MetAla: 2.459 ± 0.54
0.0MetCys: 0.0 ± 0.0
2.152MetAsp: 2.152 ± 0.385
3.381MetGlu: 3.381 ± 0.303
1.844MetPhe: 1.844 ± 0.934
2.152MetGly: 2.152 ± 1.729
0.615MetHis: 0.615 ± 1.098
1.23MetIle: 1.23 ± 0.082
2.767MetLys: 2.767 ± 2.123
0.922MetLeu: 0.922 ± 0.238
1.23MetMet: 1.23 ± 0.082
0.922MetAsn: 0.922 ± 0.942
1.23MetPro: 1.23 ± 0.082
0.615MetGln: 0.615 ± 0.393
1.537MetArg: 1.537 ± 0.074
1.537MetSer: 1.537 ± 0.074
2.152MetThr: 2.152 ± 0.32
2.152MetVal: 2.152 ± 1.09
0.615MetTrp: 0.615 ± 0.311
1.23MetTyr: 1.23 ± 0.082
0.0MetXaa: 0.0 ± 0.0
Asn
2.767AsnAla: 2.767 ± 0.009
2.152AsnCys: 2.152 ± 0.385
0.922AsnAsp: 0.922 ± 0.942
3.996AsnGlu: 3.996 ± 0.614
1.844AsnPhe: 1.844 ± 1.18
4.304AsnGly: 4.304 ± 0.065
0.615AsnHis: 0.615 ± 1.098
1.844AsnIle: 1.844 ± 1.18
1.23AsnLys: 1.23 ± 0.082
2.459AsnLeu: 2.459 ± 0.869
1.537AsnMet: 1.537 ± 0.631
0.922AsnAsn: 0.922 ± 0.467
3.689AsnPro: 3.689 ± 1.656
0.615AsnGln: 0.615 ± 0.393
2.152AsnArg: 2.152 ± 1.09
3.381AsnSer: 3.381 ± 1.107
1.23AsnThr: 1.23 ± 0.787
1.537AsnVal: 1.537 ± 0.778
0.922AsnTrp: 0.922 ± 0.238
2.459AsnTyr: 2.459 ± 0.869
0.0AsnXaa: 0.0 ± 0.0
Pro
4.304ProAla: 4.304 ± 0.77
0.922ProCys: 0.922 ± 0.238
2.459ProAsp: 2.459 ± 0.869
2.459ProGlu: 2.459 ± 0.54
2.459ProPhe: 2.459 ± 0.869
1.23ProGly: 1.23 ± 0.623
0.922ProHis: 0.922 ± 0.467
3.381ProIle: 3.381 ± 1.811
2.767ProLys: 2.767 ± 0.696
4.919ProLeu: 4.919 ± 1.033
0.615ProMet: 0.615 ± 0.311
1.537ProAsn: 1.537 ± 0.074
2.767ProPro: 2.767 ± 1.418
2.152ProGln: 2.152 ± 0.385
1.537ProArg: 1.537 ± 0.631
3.689ProSer: 3.689 ± 1.868
5.533ProThr: 5.533 ± 1.426
4.304ProVal: 4.304 ± 1.344
0.615ProTrp: 0.615 ± 0.393
2.152ProTyr: 2.152 ± 1.729
0.0ProXaa: 0.0 ± 0.0
Gln
3.689GlnAla: 3.689 ± 0.458
0.615GlnCys: 0.615 ± 0.311
0.922GlnAsp: 0.922 ± 0.238
2.767GlnGlu: 2.767 ± 0.009
2.152GlnPhe: 2.152 ± 0.385
0.307GlnGly: 0.307 ± 0.156
0.307GlnHis: 0.307 ± 0.156
3.074GlnIle: 3.074 ± 0.558
1.537GlnLys: 1.537 ± 0.074
2.152GlnLeu: 2.152 ± 0.385
0.922GlnMet: 0.922 ± 0.467
1.844GlnAsn: 1.844 ± 0.229
3.074GlnPro: 3.074 ± 0.558
1.23GlnGln: 1.23 ± 0.623
1.844GlnArg: 1.844 ± 0.934
1.844GlnSer: 1.844 ± 1.885
2.152GlnThr: 2.152 ± 0.32
2.152GlnVal: 2.152 ± 0.385
0.0GlnTrp: 0.0 ± 0.0
1.23GlnTyr: 1.23 ± 0.623
0.0GlnXaa: 0.0 ± 0.0
Arg
1.844ArgAla: 1.844 ± 0.934
0.615ArgCys: 0.615 ± 0.311
2.459ArgAsp: 2.459 ± 0.54
3.996ArgGlu: 3.996 ± 2.023
1.844ArgPhe: 1.844 ± 0.475
3.074ArgGly: 3.074 ± 1.262
1.537ArgHis: 1.537 ± 0.778
3.689ArgIle: 3.689 ± 0.458
3.074ArgLys: 3.074 ± 1.556
4.919ArgLeu: 4.919 ± 1.033
1.537ArgMet: 1.537 ± 0.074
2.459ArgAsn: 2.459 ± 0.869
2.767ArgPro: 2.767 ± 0.009
0.922ArgGln: 0.922 ± 0.467
3.996ArgArg: 3.996 ± 1.319
4.304ArgSer: 4.304 ± 0.065
1.844ArgThr: 1.844 ± 0.229
3.996ArgVal: 3.996 ± 0.614
0.922ArgTrp: 0.922 ± 0.238
2.459ArgTyr: 2.459 ± 0.54
0.0ArgXaa: 0.0 ± 0.0
Ser
3.689SerAla: 3.689 ± 0.951
0.615SerCys: 0.615 ± 0.311
4.611SerAsp: 4.611 ± 0.221
4.919SerGlu: 4.919 ± 0.328
3.074SerPhe: 3.074 ± 0.558
7.378SerGly: 7.378 ± 4.016
0.615SerHis: 0.615 ± 0.311
3.996SerIle: 3.996 ± 0.795
3.074SerLys: 3.074 ± 1.556
5.841SerLeu: 5.841 ± 1.271
0.922SerMet: 0.922 ± 0.238
4.304SerAsn: 4.304 ± 0.065
3.689SerPro: 3.689 ± 0.458
1.537SerGln: 1.537 ± 0.074
4.304SerArg: 4.304 ± 0.64
3.996SerSer: 3.996 ± 0.614
3.381SerThr: 3.381 ± 1.107
4.611SerVal: 4.611 ± 0.221
0.922SerTrp: 0.922 ± 0.238
2.459SerTyr: 2.459 ± 0.54
0.0SerXaa: 0.0 ± 0.0
Thr
7.378ThrAla: 7.378 ± 0.212
1.844ThrCys: 1.844 ± 0.934
3.996ThrAsp: 3.996 ± 0.091
0.922ThrGlu: 0.922 ± 0.467
3.381ThrPhe: 3.381 ± 0.303
3.381ThrGly: 3.381 ± 0.303
1.844ThrHis: 1.844 ± 0.229
3.074ThrIle: 3.074 ± 1.262
3.074ThrLys: 3.074 ± 0.147
4.919ThrLeu: 4.919 ± 0.328
2.459ThrMet: 2.459 ± 2.278
1.844ThrAsn: 1.844 ± 1.18
1.844ThrPro: 1.844 ± 0.229
1.844ThrGln: 1.844 ± 0.229
3.074ThrArg: 3.074 ± 0.147
3.689ThrSer: 3.689 ± 0.246
3.996ThrThr: 3.996 ± 0.091
4.304ThrVal: 4.304 ± 2.049
1.537ThrTrp: 1.537 ± 0.074
2.152ThrTyr: 2.152 ± 1.025
0.0ThrXaa: 0.0 ± 0.0
Val
4.919ValAla: 4.919 ± 3.147
1.844ValCys: 1.844 ± 0.229
0.615ValAsp: 0.615 ± 0.311
6.456ValGlu: 6.456 ± 0.255
1.844ValPhe: 1.844 ± 0.229
3.074ValGly: 3.074 ± 0.558
0.615ValHis: 0.615 ± 0.311
3.381ValIle: 3.381 ± 0.303
3.996ValLys: 3.996 ± 1.319
7.993ValLeu: 7.993 ± 4.047
2.459ValMet: 2.459 ± 0.54
4.304ValAsn: 4.304 ± 2.049
2.459ValPro: 2.459 ± 0.164
3.074ValGln: 3.074 ± 0.147
5.226ValArg: 5.226 ± 0.173
4.304ValSer: 4.304 ± 0.065
3.689ValThr: 3.689 ± 0.951
3.381ValVal: 3.381 ± 1.712
0.307ValTrp: 0.307 ± 0.549
2.767ValTyr: 2.767 ± 0.713
0.0ValXaa: 0.0 ± 0.0
Trp
0.615TrpAla: 0.615 ± 0.311
0.0TrpCys: 0.0 ± 0.0
0.922TrpAsp: 0.922 ± 0.238
1.844TrpGlu: 1.844 ± 0.934
0.615TrpPhe: 0.615 ± 0.311
0.615TrpGly: 0.615 ± 0.393
0.0TrpHis: 0.0 ± 0.0
1.844TrpIle: 1.844 ± 0.229
0.307TrpLys: 0.307 ± 0.156
1.23TrpLeu: 1.23 ± 0.082
0.615TrpMet: 0.615 ± 0.393
0.615TrpAsn: 0.615 ± 0.393
0.307TrpPro: 0.307 ± 0.156
1.23TrpGln: 1.23 ± 0.623
0.0TrpArg: 0.0 ± 0.0
1.537TrpSer: 1.537 ± 1.336
0.922TrpThr: 0.922 ± 0.238
1.537TrpVal: 1.537 ± 0.074
0.0TrpTrp: 0.0 ± 0.0
0.922TrpTyr: 0.922 ± 0.467
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.537TyrAla: 1.537 ± 0.074
0.922TyrCys: 0.922 ± 0.238
4.304TyrAsp: 4.304 ± 0.77
1.844TyrGlu: 1.844 ± 0.934
2.152TyrPhe: 2.152 ± 0.32
2.459TyrGly: 2.459 ± 0.869
0.615TyrHis: 0.615 ± 0.393
2.152TyrIle: 2.152 ± 1.09
3.074TyrLys: 3.074 ± 0.558
2.767TyrLeu: 2.767 ± 0.009
0.307TyrMet: 0.307 ± 0.293
2.459TyrAsn: 2.459 ± 0.869
2.459TyrPro: 2.459 ± 0.164
1.844TyrGln: 1.844 ± 0.934
1.537TyrArg: 1.537 ± 0.631
2.152TyrSer: 2.152 ± 0.32
1.844TyrThr: 1.844 ± 0.475
1.23TyrVal: 1.23 ± 0.082
0.922TyrTrp: 0.922 ± 0.467
1.537TyrTyr: 1.537 ± 0.631
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (3254 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski