Amino acid dipepetide frequency for Nodamura virus (strain Mag115) (NoV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.991AlaAla: 18.991 ± 8.845
1.628AlaCys: 1.628 ± 1.646
6.511AlaAsp: 6.511 ± 2.479
5.969AlaGlu: 5.969 ± 0.967
5.426AlaPhe: 5.426 ± 0.709
10.852AlaGly: 10.852 ± 4.544
2.713AlaHis: 2.713 ± 1.584
5.426AlaIle: 5.426 ± 1.747
4.883AlaLys: 4.883 ± 1.39
5.426AlaLeu: 5.426 ± 0.928
0.543AlaMet: 0.543 ± 0.299
3.256AlaAsn: 3.256 ± 2.087
4.883AlaPro: 4.883 ± 2.188
6.511AlaGln: 6.511 ± 3.998
10.309AlaArg: 10.309 ± 0.996
11.394AlaSer: 11.394 ± 3.865
7.596AlaThr: 7.596 ± 2.639
2.713AlaVal: 2.713 ± 1.383
1.085AlaTrp: 1.085 ± 0.598
4.883AlaTyr: 4.883 ± 2.456
0.0AlaXaa: 0.0 ± 0.0
Cys
2.713CysAla: 2.713 ± 1.196
0.0CysCys: 0.0 ± 0.0
0.543CysAsp: 0.543 ± 0.299
1.085CysGlu: 1.085 ± 1.628
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.543CysLys: 0.543 ± 0.549
1.628CysLeu: 1.628 ± 0.953
0.0CysMet: 0.0 ± 0.0
1.085CysAsn: 1.085 ± 0.445
1.085CysPro: 1.085 ± 0.598
0.0CysGln: 0.0 ± 0.0
1.085CysArg: 1.085 ± 0.598
0.543CysSer: 0.543 ± 0.299
0.0CysThr: 0.0 ± 0.0
1.628CysVal: 1.628 ± 0.524
0.0CysTrp: 0.0 ± 0.0
0.543CysTyr: 0.543 ± 0.549
0.0CysXaa: 0.0 ± 0.0
Asp
4.883AspAla: 4.883 ± 1.255
0.0AspCys: 0.0 ± 0.0
1.085AspAsp: 1.085 ± 0.598
2.713AspGlu: 2.713 ± 0.78
1.628AspPhe: 1.628 ± 0.953
4.341AspGly: 4.341 ± 1.378
0.543AspHis: 0.543 ± 0.299
2.17AspIle: 2.17 ± 0.727
2.17AspLys: 2.17 ± 0.727
6.511AspLeu: 6.511 ± 2.479
2.713AspMet: 2.713 ± 0.981
1.628AspAsn: 1.628 ± 0.524
5.426AspPro: 5.426 ± 0.885
1.628AspGln: 1.628 ± 0.524
5.426AspArg: 5.426 ± 2.368
1.085AspSer: 1.085 ± 0.598
2.17AspThr: 2.17 ± 0.727
4.341AspVal: 4.341 ± 1.432
0.543AspTrp: 0.543 ± 0.299
1.085AspTyr: 1.085 ± 0.598
0.0AspXaa: 0.0 ± 0.0
Glu
4.341GluAla: 4.341 ± 0.87
0.0GluCys: 0.0 ± 0.0
2.17GluAsp: 2.17 ± 1.195
1.628GluGlu: 1.628 ± 1.451
4.341GluPhe: 4.341 ± 2.03
4.883GluGly: 4.883 ± 1.255
1.085GluHis: 1.085 ± 0.598
2.713GluIle: 2.713 ± 1.244
0.543GluLys: 0.543 ± 0.299
3.798GluLeu: 3.798 ± 4.695
2.17GluMet: 2.17 ± 2.961
0.543GluAsn: 0.543 ± 0.299
2.17GluPro: 2.17 ± 0.727
2.713GluGln: 2.713 ± 1.531
2.17GluArg: 2.17 ± 0.727
0.543GluSer: 0.543 ± 0.549
4.883GluThr: 4.883 ± 0.988
3.256GluVal: 3.256 ± 1.793
0.543GluTrp: 0.543 ± 0.299
1.628GluTyr: 1.628 ± 0.953
0.0GluXaa: 0.0 ± 0.0
Phe
4.341PheAla: 4.341 ± 1.781
1.628PheCys: 1.628 ± 1.451
2.17PheAsp: 2.17 ± 0.727
1.085PheGlu: 1.085 ± 0.445
0.543PhePhe: 0.543 ± 0.299
2.17PheGly: 2.17 ± 0.727
0.543PheHis: 0.543 ± 0.549
1.628PheIle: 1.628 ± 0.524
0.0PheLys: 0.0 ± 0.0
3.798PheLeu: 3.798 ± 0.92
0.543PheMet: 0.543 ± 0.299
1.085PheAsn: 1.085 ± 0.598
1.085PhePro: 1.085 ± 0.445
2.713PheGln: 2.713 ± 0.981
2.713PheArg: 2.713 ± 0.981
2.713PheSer: 2.713 ± 1.383
1.628PheThr: 1.628 ± 0.953
1.085PheVal: 1.085 ± 0.598
0.543PheTrp: 0.543 ± 0.549
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
7.596GlyAla: 7.596 ± 2.254
0.543GlyCys: 0.543 ± 0.549
3.798GlyAsp: 3.798 ± 1.136
3.256GlyGlu: 3.256 ± 2.902
2.17GlyPhe: 2.17 ± 0.727
5.969GlyGly: 5.969 ± 2.765
1.628GlyHis: 1.628 ± 1.451
2.713GlyIle: 2.713 ± 2.034
3.256GlyLys: 3.256 ± 0.928
6.511GlyLeu: 6.511 ± 2.276
0.543GlyMet: 0.543 ± 0.299
1.085GlyAsn: 1.085 ± 0.779
5.969GlyPro: 5.969 ± 2.981
0.0GlyGln: 0.0 ± 0.0
9.767GlyArg: 9.767 ± 3.593
8.139GlySer: 8.139 ± 2.88
1.628GlyThr: 1.628 ± 0.524
7.596GlyVal: 7.596 ± 0.067
0.543GlyTrp: 0.543 ± 0.299
1.628GlyTyr: 1.628 ± 0.524
0.0GlyXaa: 0.0 ± 0.0
His
1.085HisAla: 1.085 ± 0.598
1.085HisCys: 1.085 ± 0.445
1.085HisAsp: 1.085 ± 0.779
0.543HisGlu: 0.543 ± 0.299
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.543HisLys: 0.543 ± 0.299
0.543HisLeu: 0.543 ± 0.299
0.0HisMet: 0.0 ± 0.0
0.543HisAsn: 0.543 ± 0.299
1.628HisPro: 1.628 ± 0.524
3.798HisGln: 3.798 ± 3.049
2.17HisArg: 2.17 ± 1.195
0.543HisSer: 0.543 ± 0.299
0.543HisThr: 0.543 ± 0.299
1.628HisVal: 1.628 ± 0.897
1.085HisTrp: 1.085 ± 0.598
2.17HisTyr: 2.17 ± 1.195
0.0HisXaa: 0.0 ± 0.0
Ile
4.341IleAla: 4.341 ± 2.748
0.0IleCys: 0.0 ± 0.0
3.256IleAsp: 3.256 ± 1.254
3.798IleGlu: 3.798 ± 1.231
0.0IlePhe: 0.0 ± 0.0
2.713IleGly: 2.713 ± 0.981
0.543IleHis: 0.543 ± 0.299
2.713IleIle: 2.713 ± 0.981
4.341IleLys: 4.341 ± 4.635
1.085IleLeu: 1.085 ± 1.097
0.543IleMet: 0.543 ± 0.549
1.628IleAsn: 1.628 ± 0.897
4.341IlePro: 4.341 ± 2.334
2.713IleGln: 2.713 ± 1.196
1.085IleArg: 1.085 ± 0.598
3.256IleSer: 3.256 ± 1.793
2.713IleThr: 2.713 ± 0.981
2.17IleVal: 2.17 ± 1.195
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.256LysAla: 3.256 ± 1.314
0.543LysCys: 0.543 ± 0.549
2.713LysAsp: 2.713 ± 1.196
0.0LysGlu: 0.0 ± 0.0
0.543LysPhe: 0.543 ± 0.549
1.085LysGly: 1.085 ± 1.097
1.085LysHis: 1.085 ± 0.445
2.713LysIle: 2.713 ± 0.925
0.543LysLys: 0.543 ± 0.549
1.628LysLeu: 1.628 ± 1.451
1.085LysMet: 1.085 ± 0.703
1.628LysAsn: 1.628 ± 0.897
3.256LysPro: 3.256 ± 2.902
2.17LysGln: 2.17 ± 1.846
3.798LysArg: 3.798 ± 1.681
3.798LysSer: 3.798 ± 0.92
1.085LysThr: 1.085 ± 0.779
3.798LysVal: 3.798 ± 1.537
0.0LysTrp: 0.0 ± 0.0
0.543LysTyr: 0.543 ± 0.299
0.0LysXaa: 0.0 ± 0.0
Leu
9.224LeuAla: 9.224 ± 1.557
0.0LeuCys: 0.0 ± 0.0
4.341LeuAsp: 4.341 ± 0.87
4.341LeuGlu: 4.341 ± 2.621
1.628LeuPhe: 1.628 ± 0.524
4.883LeuGly: 4.883 ± 2.456
1.628LeuHis: 1.628 ± 0.897
3.256LeuIle: 3.256 ± 2.902
5.426LeuLys: 5.426 ± 2.987
2.17LeuLeu: 2.17 ± 1.411
2.713LeuMet: 2.713 ± 1.196
3.256LeuAsn: 3.256 ± 1.047
3.256LeuPro: 3.256 ± 1.314
1.628LeuGln: 1.628 ± 0.897
8.681LeuArg: 8.681 ± 2.659
2.713LeuSer: 2.713 ± 1.494
5.426LeuThr: 5.426 ± 0.709
6.511LeuVal: 6.511 ± 2.181
0.543LeuTrp: 0.543 ± 0.549
0.543LeuTyr: 0.543 ± 0.549
0.0LeuXaa: 0.0 ± 0.0
Met
2.17MetAla: 2.17 ± 0.89
0.0MetCys: 0.0 ± 0.0
0.543MetAsp: 0.543 ± 0.549
1.628MetGlu: 1.628 ± 1.652
1.085MetPhe: 1.085 ± 0.445
0.543MetGly: 0.543 ± 0.299
0.0MetHis: 0.0 ± 0.0
2.17MetIle: 2.17 ± 1.318
0.543MetLys: 0.543 ± 0.299
3.256MetLeu: 3.256 ± 1.047
0.543MetMet: 0.543 ± 0.299
0.543MetAsn: 0.543 ± 0.299
1.085MetPro: 1.085 ± 0.598
0.543MetGln: 0.543 ± 0.549
1.628MetArg: 1.628 ± 0.832
1.628MetSer: 1.628 ± 1.652
1.085MetThr: 1.085 ± 0.861
1.628MetVal: 1.628 ± 0.524
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.713AsnAla: 2.713 ± 0.99
0.0AsnCys: 0.0 ± 0.0
2.17AsnAsp: 2.17 ± 0.732
2.713AsnGlu: 2.713 ± 1.196
1.085AsnPhe: 1.085 ± 0.445
1.628AsnGly: 1.628 ± 0.897
0.543AsnHis: 0.543 ± 0.299
1.085AsnIle: 1.085 ± 0.598
1.085AsnLys: 1.085 ± 1.628
1.628AsnLeu: 1.628 ± 1.646
1.628AsnMet: 1.628 ± 0.89
2.17AsnAsn: 2.17 ± 1.491
2.713AsnPro: 2.713 ± 0.981
3.256AsnGln: 3.256 ± 1.444
1.628AsnArg: 1.628 ± 0.524
3.256AsnSer: 3.256 ± 1.314
1.628AsnThr: 1.628 ± 0.953
3.256AsnVal: 3.256 ± 1.889
0.0AsnTrp: 0.0 ± 0.0
1.628AsnTyr: 1.628 ± 0.524
0.0AsnXaa: 0.0 ± 0.0
Pro
9.767ProAla: 9.767 ± 2.559
0.0ProCys: 0.0 ± 0.0
2.713ProAsp: 2.713 ± 0.99
2.17ProGlu: 2.17 ± 0.727
0.543ProPhe: 0.543 ± 0.299
8.139ProGly: 8.139 ± 2.463
0.543ProHis: 0.543 ± 0.299
2.713ProIle: 2.713 ± 1.383
2.17ProLys: 2.17 ± 0.727
3.798ProLeu: 3.798 ± 1.231
1.085ProMet: 1.085 ± 0.598
4.341ProAsn: 4.341 ± 1.734
2.713ProPro: 2.713 ± 0.99
1.628ProGln: 1.628 ± 0.897
2.713ProArg: 2.713 ± 1.574
2.713ProSer: 2.713 ± 1.184
4.883ProThr: 4.883 ± 1.276
2.713ProVal: 2.713 ± 1.196
0.543ProTrp: 0.543 ± 0.299
2.713ProTyr: 2.713 ± 1.494
0.0ProXaa: 0.0 ± 0.0
Gln
4.883GlnAla: 4.883 ± 2.509
0.543GlnCys: 0.543 ± 0.299
2.17GlnAsp: 2.17 ± 1.195
1.085GlnGlu: 1.085 ± 1.628
0.543GlnPhe: 0.543 ± 0.549
2.713GlnGly: 2.713 ± 0.78
2.17GlnHis: 2.17 ± 1.318
1.085GlnIle: 1.085 ± 0.598
0.0GlnLys: 0.0 ± 0.0
4.883GlnLeu: 4.883 ± 6.468
0.0GlnMet: 0.0 ± 0.0
1.628GlnAsn: 1.628 ± 0.524
1.628GlnPro: 1.628 ± 0.801
5.969GlnGln: 5.969 ± 3.239
5.969GlnArg: 5.969 ± 0.486
3.256GlnSer: 3.256 ± 1.336
3.798GlnThr: 3.798 ± 1.478
5.969GlnVal: 5.969 ± 2.467
0.0GlnTrp: 0.0 ± 0.0
1.085GlnTyr: 1.085 ± 0.598
0.0GlnXaa: 0.0 ± 0.0
Arg
13.565ArgAla: 13.565 ± 2.889
0.0ArgCys: 0.0 ± 0.0
5.969ArgAsp: 5.969 ± 2.19
3.256ArgGlu: 3.256 ± 1.254
3.256ArgPhe: 3.256 ± 1.793
8.681ArgGly: 8.681 ± 4.62
1.628ArgHis: 1.628 ± 0.897
1.628ArgIle: 1.628 ± 0.897
3.256ArgLys: 3.256 ± 1.024
3.798ArgLeu: 3.798 ± 1.681
2.17ArgMet: 2.17 ± 0.727
0.543ArgAsn: 0.543 ± 0.299
2.713ArgPro: 2.713 ± 0.78
5.426ArgGln: 5.426 ± 1.913
10.852ArgArg: 10.852 ± 2.776
5.426ArgSer: 5.426 ± 2.272
2.713ArgThr: 2.713 ± 0.981
9.767ArgVal: 9.767 ± 3.324
0.543ArgTrp: 0.543 ± 0.299
2.713ArgTyr: 2.713 ± 0.925
0.0ArgXaa: 0.0 ± 0.0
Ser
5.969SerAla: 5.969 ± 1.688
2.17SerCys: 2.17 ± 1.318
1.628SerAsp: 1.628 ± 0.832
1.628SerGlu: 1.628 ± 0.897
2.713SerPhe: 2.713 ± 0.925
7.596SerGly: 7.596 ± 1.389
1.628SerHis: 1.628 ± 0.897
3.256SerIle: 3.256 ± 1.336
1.628SerLys: 1.628 ± 0.801
8.139SerLeu: 8.139 ± 2.18
1.085SerMet: 1.085 ± 1.097
3.256SerAsn: 3.256 ± 1.423
4.341SerPro: 4.341 ± 1.454
4.341SerGln: 4.341 ± 1.463
8.139SerArg: 8.139 ± 4.162
7.054SerSer: 7.054 ± 3.446
2.713SerThr: 2.713 ± 1.383
3.256SerVal: 3.256 ± 1.423
1.085SerTrp: 1.085 ± 0.598
2.17SerTyr: 2.17 ± 0.727
0.0SerXaa: 0.0 ± 0.0
Thr
4.883ThrAla: 4.883 ± 1.652
0.543ThrCys: 0.543 ± 0.299
3.798ThrAsp: 3.798 ± 1.231
0.543ThrGlu: 0.543 ± 0.299
4.341ThrPhe: 4.341 ± 1.436
3.256ThrGly: 3.256 ± 1.336
0.543ThrHis: 0.543 ± 0.299
5.426ThrIle: 5.426 ± 0.928
1.085ThrLys: 1.085 ± 0.598
5.426ThrLeu: 5.426 ± 1.144
1.085ThrMet: 1.085 ± 0.445
4.341ThrAsn: 4.341 ± 2.844
2.17ThrPro: 2.17 ± 0.89
1.628ThrGln: 1.628 ± 1.652
3.256ThrArg: 3.256 ± 1.254
4.883ThrSer: 4.883 ± 1.483
3.256ThrThr: 3.256 ± 0.927
5.426ThrVal: 5.426 ± 0.709
0.543ThrTrp: 0.543 ± 0.299
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
10.852ValAla: 10.852 ± 4.784
1.085ValCys: 1.085 ± 0.445
3.798ValAsp: 3.798 ± 2.092
3.798ValGlu: 3.798 ± 1.231
0.543ValPhe: 0.543 ± 0.299
4.883ValGly: 4.883 ± 0.735
2.17ValHis: 2.17 ± 1.195
0.543ValIle: 0.543 ± 0.299
2.17ValLys: 2.17 ± 1.195
4.341ValLeu: 4.341 ± 1.258
1.628ValMet: 1.628 ± 0.541
2.713ValAsn: 2.713 ± 3.234
5.426ValPro: 5.426 ± 1.362
1.628ValGln: 1.628 ± 1.451
2.713ValArg: 2.713 ± 1.196
8.139ValSer: 8.139 ± 1.824
4.883ValThr: 4.883 ± 1.276
4.341ValVal: 4.341 ± 1.432
2.713ValTrp: 2.713 ± 0.981
3.798ValTyr: 3.798 ± 1.231
0.0ValXaa: 0.0 ± 0.0
Trp
1.085TrpAla: 1.085 ± 0.445
1.085TrpCys: 1.085 ± 0.598
0.543TrpAsp: 0.543 ± 0.299
1.628TrpGlu: 1.628 ± 0.524
1.085TrpPhe: 1.085 ± 0.598
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
1.628TrpLeu: 1.628 ± 0.897
0.0TrpMet: 0.0 ± 0.0
0.543TrpAsn: 0.543 ± 0.299
1.628TrpPro: 1.628 ± 0.897
0.0TrpGln: 0.0 ± 0.0
1.085TrpArg: 1.085 ± 0.598
0.543TrpSer: 0.543 ± 0.299
0.0TrpThr: 0.0 ± 0.0
0.543TrpVal: 0.543 ± 0.549
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.341TyrAla: 4.341 ± 1.436
1.628TyrCys: 1.628 ± 0.897
1.085TyrAsp: 1.085 ± 0.598
3.256TyrGlu: 3.256 ± 1.024
0.543TyrPhe: 0.543 ± 0.299
0.0TyrGly: 0.0 ± 0.0
0.0TyrHis: 0.0 ± 0.0
0.0TyrIle: 0.0 ± 0.0
1.628TyrLys: 1.628 ± 0.524
2.17TyrLeu: 2.17 ± 1.195
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
1.085TyrPro: 1.085 ± 0.445
1.085TyrGln: 1.085 ± 0.598
2.713TyrArg: 2.713 ± 1.574
2.17TyrSer: 2.17 ± 1.491
3.798TyrThr: 3.798 ± 1.537
0.543TyrVal: 0.543 ± 0.299
1.085TyrTrp: 1.085 ± 0.598
1.085TyrTyr: 1.085 ± 0.598
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1844 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski