Amino acid dipepetide frequency for Wenzhou shrimp virus 6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.139AlaAla: 5.139 ± 1.309
0.907AlaCys: 0.907 ± 0.466
3.023AlaAsp: 3.023 ± 2.395
3.023AlaGlu: 3.023 ± 0.915
1.209AlaPhe: 1.209 ± 0.366
4.232AlaGly: 4.232 ± 0.788
1.209AlaHis: 1.209 ± 0.366
3.628AlaIle: 3.628 ± 0.382
3.628AlaLys: 3.628 ± 0.876
7.557AlaLeu: 7.557 ± 0.92
1.511AlaMet: 1.511 ± 0.283
2.116AlaAsn: 2.116 ± 0.1
2.721AlaPro: 2.721 ± 0.083
2.116AlaGln: 2.116 ± 0.593
3.628AlaArg: 3.628 ± 1.369
6.953AlaSer: 6.953 ± 1.365
4.232AlaThr: 4.232 ± 0.693
3.023AlaVal: 3.023 ± 0.422
1.209AlaTrp: 1.209 ± 0.859
1.814AlaTyr: 1.814 ± 0.056
0.0AlaXaa: 0.0 ± 0.0
Cys
0.907CysAla: 0.907 ± 0.466
0.0CysCys: 0.0 ± 0.0
1.209CysAsp: 1.209 ± 0.127
2.116CysGlu: 2.116 ± 0.1
0.605CysPhe: 0.605 ± 0.31
0.907CysGly: 0.907 ± 0.028
0.0CysHis: 0.0 ± 0.0
0.605CysIle: 0.605 ± 0.31
1.511CysLys: 1.511 ± 0.776
0.605CysLeu: 0.605 ± 0.183
0.605CysMet: 0.605 ± 0.31
0.302CysAsn: 0.302 ± 0.155
1.209CysPro: 1.209 ± 0.859
1.511CysGln: 1.511 ± 0.776
0.302CysArg: 0.302 ± 0.338
0.907CysSer: 0.907 ± 0.028
1.814CysThr: 1.814 ± 0.438
0.605CysVal: 0.605 ± 0.31
0.0CysTrp: 0.0 ± 0.0
0.605CysTyr: 0.605 ± 0.676
0.0CysXaa: 0.0 ± 0.0
Asp
2.418AspAla: 2.418 ± 0.748
1.511AspCys: 1.511 ± 0.211
5.744AspAsp: 5.744 ± 1.469
3.325AspGlu: 3.325 ± 0.72
5.441AspPhe: 5.441 ± 1.313
2.721AspGly: 2.721 ± 0.083
2.116AspHis: 2.116 ± 0.394
4.534AspIle: 4.534 ± 0.354
3.628AspLys: 3.628 ± 1.369
3.325AspLeu: 3.325 ± 1.253
1.814AspMet: 1.814 ± 0.056
4.232AspAsn: 4.232 ± 0.693
3.325AspPro: 3.325 ± 0.227
2.116AspGln: 2.116 ± 1.381
3.023AspArg: 3.023 ± 0.565
4.534AspSer: 4.534 ± 1.341
3.325AspThr: 3.325 ± 1.747
5.139AspVal: 5.139 ± 0.816
0.605AspTrp: 0.605 ± 0.31
3.325AspTyr: 3.325 ± 1.707
0.0AspXaa: 0.0 ± 0.0
Glu
4.534GluAla: 4.534 ± 0.139
1.209GluCys: 1.209 ± 0.127
4.534GluAsp: 4.534 ± 0.354
4.837GluGlu: 4.837 ± 0.51
1.814GluPhe: 1.814 ± 0.931
4.534GluGly: 4.534 ± 0.633
1.209GluHis: 1.209 ± 0.127
3.023GluIle: 3.023 ± 1.059
0.907GluLys: 0.907 ± 0.028
6.046GluLeu: 6.046 ± 0.35
2.721GluMet: 2.721 ± 0.41
2.418GluAsn: 2.418 ± 0.255
1.511GluPro: 1.511 ± 0.211
1.209GluGln: 1.209 ± 0.366
4.232GluArg: 4.232 ± 0.199
2.721GluSer: 2.721 ± 0.903
2.721GluThr: 2.721 ± 1.07
3.93GluVal: 3.93 ± 0.449
0.302GluTrp: 0.302 ± 0.155
1.511GluTyr: 1.511 ± 0.211
0.0GluXaa: 0.0 ± 0.0
Phe
2.116PheAla: 2.116 ± 0.1
1.209PheCys: 1.209 ± 0.621
3.628PheAsp: 3.628 ± 1.369
1.511PheGlu: 1.511 ± 0.211
1.814PhePhe: 1.814 ± 0.438
1.814PheGly: 1.814 ± 0.056
1.814PheHis: 1.814 ± 0.549
2.721PheIle: 2.721 ± 0.41
2.116PheLys: 2.116 ± 0.593
3.325PheLeu: 3.325 ± 0.266
0.605PheMet: 0.605 ± 0.31
2.116PheAsn: 2.116 ± 1.086
1.209PhePro: 1.209 ± 0.621
1.209PheGln: 1.209 ± 0.366
1.511PheArg: 1.511 ± 0.283
4.837PheSer: 4.837 ± 1.464
4.837PheThr: 4.837 ± 0.51
3.93PheVal: 3.93 ± 0.537
0.302PheTrp: 0.302 ± 0.155
0.907PheTyr: 0.907 ± 0.521
0.0PheXaa: 0.0 ± 0.0
Gly
3.325GlyAla: 3.325 ± 1.747
0.302GlyCys: 0.302 ± 0.338
3.628GlyAsp: 3.628 ± 1.098
3.023GlyGlu: 3.023 ± 0.072
2.721GlyPhe: 2.721 ± 0.577
2.721GlyGly: 2.721 ± 0.083
1.511GlyHis: 1.511 ± 0.211
3.023GlyIle: 3.023 ± 0.072
2.116GlyLys: 2.116 ± 1.086
2.721GlyLeu: 2.721 ± 0.41
1.511GlyMet: 1.511 ± 0.704
3.628GlyAsn: 3.628 ± 0.605
1.511GlyPro: 1.511 ± 0.704
2.721GlyGln: 2.721 ± 0.41
1.814GlyArg: 1.814 ± 0.549
2.418GlySer: 2.418 ± 0.732
5.139GlyThr: 5.139 ± 0.171
3.023GlyVal: 3.023 ± 2.889
0.605GlyTrp: 0.605 ± 0.31
1.814GlyTyr: 1.814 ± 0.056
0.0GlyXaa: 0.0 ± 0.0
His
0.907HisAla: 0.907 ± 1.015
0.302HisCys: 0.302 ± 0.155
1.209HisAsp: 1.209 ± 0.127
1.511HisGlu: 1.511 ± 0.211
0.907HisPhe: 0.907 ± 0.028
0.907HisGly: 0.907 ± 0.466
0.605HisHis: 0.605 ± 0.676
2.116HisIle: 2.116 ± 0.394
1.209HisLys: 1.209 ± 0.621
1.814HisLeu: 1.814 ± 1.042
0.605HisMet: 0.605 ± 0.183
0.302HisAsn: 0.302 ± 0.338
1.511HisPro: 1.511 ± 1.198
2.116HisGln: 2.116 ± 0.593
0.302HisArg: 0.302 ± 0.155
2.418HisSer: 2.418 ± 0.239
2.418HisThr: 2.418 ± 1.226
2.116HisVal: 2.116 ± 0.1
0.302HisTrp: 0.302 ± 0.338
0.605HisTyr: 0.605 ± 0.31
0.0HisXaa: 0.0 ± 0.0
Ile
5.441IleAla: 5.441 ± 0.82
0.907IleCys: 0.907 ± 0.028
4.837IleAsp: 4.837 ± 0.016
3.93IleGlu: 3.93 ± 0.449
2.418IlePhe: 2.418 ± 0.255
2.116IleGly: 2.116 ± 0.887
1.511IleHis: 1.511 ± 0.283
3.93IleIle: 3.93 ± 0.943
3.023IleLys: 3.023 ± 0.565
3.628IleLeu: 3.628 ± 0.876
0.907IleMet: 0.907 ± 0.466
3.023IleAsn: 3.023 ± 0.565
3.628IlePro: 3.628 ± 0.382
1.814IleGln: 1.814 ± 0.438
2.721IleArg: 2.721 ± 0.083
6.651IleSer: 6.651 ± 0.533
6.348IleThr: 6.348 ± 1.182
3.023IleVal: 3.023 ± 0.072
0.302IleTrp: 0.302 ± 0.155
2.721IleTyr: 2.721 ± 0.083
0.0IleXaa: 0.0 ± 0.0
Lys
2.116LysAla: 2.116 ± 0.593
0.605LysCys: 0.605 ± 0.31
3.023LysAsp: 3.023 ± 1.059
3.325LysGlu: 3.325 ± 1.707
3.023LysPhe: 3.023 ± 0.565
1.209LysGly: 1.209 ± 0.127
0.907LysHis: 0.907 ± 0.028
4.534LysIle: 4.534 ± 1.835
3.93LysLys: 3.93 ± 1.524
5.139LysLeu: 5.139 ± 1.652
1.209LysMet: 1.209 ± 0.621
2.721LysAsn: 2.721 ± 0.083
2.721LysPro: 2.721 ± 1.564
1.209LysGln: 1.209 ± 0.127
4.534LysArg: 4.534 ± 2.328
3.325LysSer: 3.325 ± 1.214
3.93LysThr: 3.93 ± 0.537
2.116LysVal: 2.116 ± 0.593
0.907LysTrp: 0.907 ± 0.028
1.511LysTyr: 1.511 ± 0.283
0.0LysXaa: 0.0 ± 0.0
Leu
6.953LeuAla: 6.953 ± 0.378
2.418LeuCys: 2.418 ± 0.748
7.557LeuAsp: 7.557 ± 1.906
4.534LeuGlu: 4.534 ± 0.139
4.534LeuPhe: 4.534 ± 1.835
3.628LeuGly: 3.628 ± 0.382
2.116LeuHis: 2.116 ± 0.1
4.534LeuIle: 4.534 ± 0.848
4.837LeuLys: 4.837 ± 1.003
5.139LeuLeu: 5.139 ± 0.665
1.209LeuMet: 1.209 ± 0.127
6.953LeuAsn: 6.953 ± 2.351
3.325LeuPro: 3.325 ± 0.266
1.814LeuGln: 1.814 ± 0.056
4.837LeuArg: 4.837 ± 0.016
7.557LeuSer: 7.557 ± 1.054
5.441LeuThr: 5.441 ± 0.327
6.348LeuVal: 6.348 ± 0.299
0.302LeuTrp: 0.302 ± 0.338
3.325LeuTyr: 3.325 ± 0.266
0.0LeuXaa: 0.0 ± 0.0
Met
1.209MetAla: 1.209 ± 0.127
0.605MetCys: 0.605 ± 0.31
0.605MetAsp: 0.605 ± 0.31
1.814MetGlu: 1.814 ± 0.438
0.605MetPhe: 0.605 ± 0.31
1.511MetGly: 1.511 ± 0.211
0.605MetHis: 0.605 ± 0.676
0.605MetIle: 0.605 ± 0.31
1.814MetLys: 1.814 ± 0.931
3.023MetLeu: 3.023 ± 0.565
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.814MetPro: 1.814 ± 0.056
0.0MetGln: 0.0 ± 0.0
1.511MetArg: 1.511 ± 0.283
1.209MetSer: 1.209 ± 0.366
2.116MetThr: 2.116 ± 0.1
0.907MetVal: 0.907 ± 0.028
0.907MetTrp: 0.907 ± 0.521
1.814MetTyr: 1.814 ± 0.931
0.0MetXaa: 0.0 ± 0.0
Asn
2.418AsnAla: 2.418 ± 1.242
1.209AsnCys: 1.209 ± 0.621
3.325AsnAsp: 3.325 ± 1.214
0.907AsnGlu: 0.907 ± 0.028
2.418AsnPhe: 2.418 ± 0.748
0.907AsnGly: 0.907 ± 0.028
0.907AsnHis: 0.907 ± 0.521
3.628AsnIle: 3.628 ± 0.382
3.023AsnLys: 3.023 ± 1.059
4.837AsnLeu: 4.837 ± 0.971
1.209AsnMet: 1.209 ± 0.127
1.814AsnAsn: 1.814 ± 0.549
3.023AsnPro: 3.023 ± 0.072
1.511AsnGln: 1.511 ± 0.283
2.721AsnArg: 2.721 ± 0.903
8.162AsnSer: 8.162 ± 2.217
3.628AsnThr: 3.628 ± 2.085
1.814AsnVal: 1.814 ± 0.056
0.302AsnTrp: 0.302 ± 0.338
0.605AsnTyr: 0.605 ± 0.676
0.0AsnXaa: 0.0 ± 0.0
Pro
2.116ProAla: 2.116 ± 1.086
0.907ProCys: 0.907 ± 0.521
2.418ProAsp: 2.418 ± 0.255
1.814ProGlu: 1.814 ± 0.056
2.116ProPhe: 2.116 ± 0.394
3.023ProGly: 3.023 ± 0.915
0.302ProHis: 0.302 ± 0.155
3.628ProIle: 3.628 ± 2.578
1.209ProLys: 1.209 ± 0.621
4.232ProLeu: 4.232 ± 0.199
1.209ProMet: 1.209 ± 0.127
2.116ProAsn: 2.116 ± 0.1
2.116ProPro: 2.116 ± 1.381
0.907ProGln: 0.907 ± 0.028
1.209ProArg: 1.209 ± 0.366
6.953ProSer: 6.953 ± 0.609
3.93ProThr: 3.93 ± 1.93
3.023ProVal: 3.023 ± 1.409
0.302ProTrp: 0.302 ± 0.338
3.023ProTyr: 3.023 ± 0.565
0.0ProXaa: 0.0 ± 0.0
Gln
3.325GlnAla: 3.325 ± 0.227
0.605GlnCys: 0.605 ± 0.183
2.418GlnAsp: 2.418 ± 0.239
1.209GlnGlu: 1.209 ± 0.621
0.907GlnPhe: 0.907 ± 0.028
2.418GlnGly: 2.418 ± 0.255
0.605GlnHis: 0.605 ± 0.676
2.116GlnIle: 2.116 ± 0.1
1.511GlnLys: 1.511 ± 0.776
4.232GlnLeu: 4.232 ± 0.294
1.814GlnMet: 1.814 ± 0.056
3.325GlnAsn: 3.325 ± 0.72
1.209GlnPro: 1.209 ± 0.127
1.814GlnGln: 1.814 ± 0.549
2.721GlnArg: 2.721 ± 0.41
1.511GlnSer: 1.511 ± 0.211
2.721GlnThr: 2.721 ± 0.41
2.418GlnVal: 2.418 ± 0.239
0.0GlnTrp: 0.0 ± 0.0
1.511GlnTyr: 1.511 ± 0.211
0.0GlnXaa: 0.0 ± 0.0
Arg
3.628ArgAla: 3.628 ± 0.111
0.302ArgCys: 0.302 ± 0.338
2.721ArgAsp: 2.721 ± 0.41
1.209ArgGlu: 1.209 ± 0.859
1.814ArgPhe: 1.814 ± 0.056
2.721ArgGly: 2.721 ± 1.07
1.814ArgHis: 1.814 ± 0.438
2.721ArgIle: 2.721 ± 0.903
2.116ArgLys: 2.116 ± 1.086
4.232ArgLeu: 4.232 ± 0.788
1.209ArgMet: 1.209 ± 0.621
2.116ArgAsn: 2.116 ± 0.593
1.814ArgPro: 1.814 ± 0.438
3.325ArgGln: 3.325 ± 0.72
2.418ArgArg: 2.418 ± 0.255
3.023ArgSer: 3.023 ± 0.072
5.744ArgThr: 5.744 ± 2.455
1.209ArgVal: 1.209 ± 0.127
0.605ArgTrp: 0.605 ± 0.183
1.209ArgTyr: 1.209 ± 0.127
0.0ArgXaa: 0.0 ± 0.0
Ser
5.139SerAla: 5.139 ± 1.802
0.907SerCys: 0.907 ± 0.466
2.721SerAsp: 2.721 ± 0.083
3.628SerGlu: 3.628 ± 0.605
3.628SerPhe: 3.628 ± 0.605
4.837SerGly: 4.837 ± 0.971
1.511SerHis: 1.511 ± 0.283
5.441SerIle: 5.441 ± 0.167
6.046SerLys: 6.046 ± 0.143
9.674SerLeu: 9.674 ± 1.512
1.814SerMet: 1.814 ± 1.016
5.139SerAsn: 5.139 ± 1.652
3.93SerPro: 3.93 ± 0.449
5.139SerGln: 5.139 ± 1.158
1.511SerArg: 1.511 ± 0.283
6.651SerSer: 6.651 ± 0.533
8.162SerThr: 8.162 ± 2.224
6.651SerVal: 6.651 ± 2.013
2.116SerTrp: 2.116 ± 0.887
2.721SerTyr: 2.721 ± 0.903
0.0SerXaa: 0.0 ± 0.0
Thr
5.441ThrAla: 5.441 ± 0.327
1.209ThrCys: 1.209 ± 0.127
5.441ThrAsp: 5.441 ± 0.66
6.046ThrGlu: 6.046 ± 1.13
3.325ThrPhe: 3.325 ± 0.266
3.93ThrGly: 3.93 ± 1.436
2.418ThrHis: 2.418 ± 0.732
6.348ThrIle: 6.348 ± 1.182
3.93ThrLys: 3.93 ± 0.449
8.464ThrLeu: 8.464 ± 1.385
0.907ThrMet: 0.907 ± 0.371
2.116ThrAsn: 2.116 ± 0.1
3.93ThrPro: 3.93 ± 0.943
3.93ThrGln: 3.93 ± 0.449
2.721ThrArg: 2.721 ± 0.41
9.069ThrSer: 9.069 ± 1.265
6.651ThrThr: 6.651 ± 0.454
5.744ThrVal: 5.744 ± 2.479
0.605ThrTrp: 0.605 ± 0.183
1.511ThrTyr: 1.511 ± 0.283
0.0ThrXaa: 0.0 ± 0.0
Val
3.628ValAla: 3.628 ± 2.085
0.0ValCys: 0.0 ± 0.0
4.232ValAsp: 4.232 ± 0.788
6.953ValGlu: 6.953 ± 0.116
1.814ValPhe: 1.814 ± 0.549
3.628ValGly: 3.628 ± 2.085
1.209ValHis: 1.209 ± 0.127
2.721ValIle: 2.721 ± 1.07
2.721ValLys: 2.721 ± 0.903
5.139ValLeu: 5.139 ± 0.322
0.907ValMet: 0.907 ± 0.466
2.418ValAsn: 2.418 ± 1.242
3.93ValPro: 3.93 ± 0.537
2.721ValGln: 2.721 ± 0.577
2.418ValArg: 2.418 ± 0.239
4.232ValSer: 4.232 ± 1.775
5.139ValThr: 5.139 ± 0.816
4.232ValVal: 4.232 ± 1.281
0.605ValTrp: 0.605 ± 0.183
2.418ValTyr: 2.418 ± 0.732
0.0ValXaa: 0.0 ± 0.0
Trp
0.605TrpAla: 0.605 ± 0.183
0.302TrpCys: 0.302 ± 0.338
0.907TrpAsp: 0.907 ± 0.028
0.0TrpGlu: 0.0 ± 0.0
0.302TrpPhe: 0.302 ± 0.155
0.302TrpGly: 0.302 ± 0.155
0.302TrpHis: 0.302 ± 0.338
0.907TrpIle: 0.907 ± 0.521
1.209TrpLys: 1.209 ± 0.366
0.907TrpLeu: 0.907 ± 0.028
0.605TrpMet: 0.605 ± 0.676
0.302TrpAsn: 0.302 ± 0.155
0.0TrpPro: 0.0 ± 0.0
0.302TrpGln: 0.302 ± 0.338
1.511TrpArg: 1.511 ± 0.704
0.605TrpSer: 0.605 ± 0.183
0.605TrpThr: 0.605 ± 0.183
0.302TrpVal: 0.302 ± 0.155
0.0TrpTrp: 0.0 ± 0.0
0.605TrpTyr: 0.605 ± 0.183
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.511TyrAla: 1.511 ± 0.211
0.907TyrCys: 0.907 ± 0.028
3.023TyrAsp: 3.023 ± 1.552
1.209TyrGlu: 1.209 ± 0.366
2.116TyrPhe: 2.116 ± 0.394
1.209TyrGly: 1.209 ± 0.127
1.511TyrHis: 1.511 ± 0.704
2.116TyrIle: 2.116 ± 0.593
1.511TyrLys: 1.511 ± 0.283
3.325TyrLeu: 3.325 ± 0.72
0.0TyrMet: 0.0 ± 0.0
1.209TyrAsn: 1.209 ± 0.127
2.116TyrPro: 2.116 ± 0.887
0.907TyrGln: 0.907 ± 0.028
0.302TyrArg: 0.302 ± 0.338
3.93TyrSer: 3.93 ± 0.537
4.837TyrThr: 4.837 ± 0.016
1.511TyrVal: 1.511 ± 0.776
0.302TyrTrp: 0.302 ± 0.338
1.209TyrTyr: 1.209 ± 0.127
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (3309 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski