Amino acid dipepetide frequency for Schmallenberg virus (SBV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.086AlaAla: 3.086 ± 1.821
1.8AlaCys: 1.8 ± 0.592
2.314AlaAsp: 2.314 ± 1.513
3.857AlaGlu: 3.857 ± 2.476
2.828AlaPhe: 2.828 ± 1.934
1.286AlaGly: 1.286 ± 0.445
1.543AlaHis: 1.543 ± 0.381
3.343AlaIle: 3.343 ± 0.285
4.371AlaLys: 4.371 ± 1.005
4.886AlaLeu: 4.886 ± 0.924
1.029AlaMet: 1.029 ± 0.293
3.857AlaAsn: 3.857 ± 0.536
0.771AlaPro: 0.771 ± 0.701
1.286AlaGln: 1.286 ± 1.401
2.057AlaArg: 2.057 ± 2.019
3.343AlaSer: 3.343 ± 0.586
3.857AlaThr: 3.857 ± 1.313
2.057AlaVal: 2.057 ± 0.508
0.514AlaTrp: 0.514 ± 0.709
2.828AlaTyr: 2.828 ± 0.735
0.0AlaXaa: 0.0 ± 0.0
Cys
1.286CysAla: 1.286 ± 0.239
0.771CysCys: 0.771 ± 0.159
0.771CysAsp: 0.771 ± 0.159
1.543CysGlu: 1.543 ± 0.703
1.029CysPhe: 1.029 ± 0.59
1.543CysGly: 1.543 ± 1.072
0.257CysHis: 0.257 ± 0.242
2.314CysIle: 2.314 ± 0.467
2.314CysLys: 2.314 ± 1.228
3.857CysLeu: 3.857 ± 2.121
0.771CysMet: 0.771 ± 0.159
2.314CysAsn: 2.314 ± 1.797
1.543CysPro: 1.543 ± 1.072
0.514CysGln: 0.514 ± 0.127
1.8CysArg: 1.8 ± 0.348
3.343CysSer: 3.343 ± 1.644
2.057CysThr: 2.057 ± 1.023
0.771CysVal: 0.771 ± 0.726
0.0CysTrp: 0.0 ± 0.0
1.8CysTyr: 1.8 ± 0.444
0.0CysXaa: 0.0 ± 0.0
Asp
1.8AspAla: 1.8 ± 0.663
0.514AspCys: 0.514 ± 0.321
4.114AspAsp: 4.114 ± 1.491
4.371AspGlu: 4.371 ± 1.042
3.6AspPhe: 3.6 ± 1.164
2.057AspGly: 2.057 ± 0.443
1.286AspHis: 1.286 ± 0.239
7.2AspIle: 7.2 ± 1.391
2.314AspLys: 2.314 ± 0.477
4.114AspLeu: 4.114 ± 1.074
1.029AspMet: 1.029 ± 0.293
4.371AspAsn: 4.371 ± 1.005
1.8AspPro: 1.8 ± 0.444
2.057AspGln: 2.057 ± 0.386
2.828AspArg: 2.828 ± 0.539
2.057AspSer: 2.057 ± 0.919
3.343AspThr: 3.343 ± 0.244
3.6AspVal: 3.6 ± 1.082
0.257AspTrp: 0.257 ± 0.242
3.086AspTyr: 3.086 ± 0.636
0.0AspXaa: 0.0 ± 0.0
Glu
2.314GluAla: 2.314 ± 1.228
1.286GluCys: 1.286 ± 0.47
2.314GluAsp: 2.314 ± 1.228
3.857GluGlu: 3.857 ± 0.924
3.343GluPhe: 3.343 ± 1.718
1.543GluGly: 1.543 ± 0.703
1.543GluHis: 1.543 ± 0.703
5.4GluIle: 5.4 ± 1.078
4.628GluLys: 4.628 ± 1.474
4.114GluLeu: 4.114 ± 0.913
3.6GluMet: 3.6 ± 1.327
2.314GluAsn: 2.314 ± 0.467
2.571GluPro: 2.571 ± 1.238
2.057GluGln: 2.057 ± 0.508
3.086GluArg: 3.086 ± 0.88
3.6GluSer: 3.6 ± 1.882
2.057GluThr: 2.057 ± 0.508
3.343GluVal: 3.343 ± 0.975
0.514GluTrp: 0.514 ± 0.76
2.571GluTyr: 2.571 ± 0.636
0.0GluXaa: 0.0 ± 0.0
Phe
1.543PheAla: 1.543 ± 0.604
1.8PheCys: 1.8 ± 0.348
2.571PheAsp: 2.571 ± 0.355
3.857PheGlu: 3.857 ± 0.924
1.286PhePhe: 1.286 ± 0.584
3.086PheGly: 3.086 ± 1.871
1.029PheHis: 1.029 ± 0.254
4.371PheIle: 4.371 ± 0.954
3.086PheLys: 3.086 ± 0.763
4.886PheLeu: 4.886 ± 1.654
1.029PheMet: 1.029 ± 0.714
2.314PheAsn: 2.314 ± 1.299
0.257PhePro: 0.257 ± 0.752
1.029PheGln: 1.029 ± 0.293
2.314PheArg: 2.314 ± 1.078
3.343PheSer: 3.343 ± 0.638
3.857PheThr: 3.857 ± 0.924
2.314PheVal: 2.314 ± 0.737
0.257PheTrp: 0.257 ± 0.161
2.057PheTyr: 2.057 ± 0.685
0.0PheXaa: 0.0 ± 0.0
Gly
1.543GlyAla: 1.543 ± 0.516
2.057GlyCys: 2.057 ± 1.023
3.086GlyAsp: 3.086 ± 0.584
3.6GlyGlu: 3.6 ± 1.182
1.286GlyPhe: 1.286 ± 1.357
1.286GlyGly: 1.286 ± 0.584
0.771GlyHis: 0.771 ± 0.352
3.343GlyIle: 3.343 ± 0.586
2.571GlyLys: 2.571 ± 0.355
3.6GlyLeu: 3.6 ± 1.185
2.057GlyMet: 2.057 ± 0.386
1.543GlyAsn: 1.543 ± 0.602
2.057GlyPro: 2.057 ± 0.587
1.8GlyGln: 1.8 ± 1.422
1.543GlyArg: 1.543 ± 0.602
3.086GlySer: 3.086 ± 2.144
2.314GlyThr: 2.314 ± 1.228
2.057GlyVal: 2.057 ± 1.266
0.771GlyTrp: 0.771 ± 0.352
1.543GlyTyr: 1.543 ± 2.109
0.0GlyXaa: 0.0 ± 0.0
His
1.286HisAla: 1.286 ± 0.47
0.257HisCys: 0.257 ± 0.242
1.8HisAsp: 1.8 ± 0.348
1.8HisGlu: 1.8 ± 0.592
0.771HisPhe: 0.771 ± 0.701
1.543HisGly: 1.543 ± 0.318
0.514HisHis: 0.514 ± 0.321
1.286HisIle: 1.286 ± 0.47
1.029HisLys: 1.029 ± 0.714
1.286HisLeu: 1.286 ± 0.445
0.771HisMet: 0.771 ± 0.159
1.543HisAsn: 1.543 ± 0.602
1.286HisPro: 1.286 ± 0.831
0.771HisGln: 0.771 ± 0.352
1.286HisArg: 1.286 ± 1.586
2.828HisSer: 2.828 ± 1.046
1.8HisThr: 1.8 ± 0.444
1.8HisVal: 1.8 ± 0.76
0.0HisTrp: 0.0 ± 0.0
1.286HisTyr: 1.286 ± 0.47
0.0HisXaa: 0.0 ± 0.0
Ile
6.428IleAla: 6.428 ± 0.638
2.057IleCys: 2.057 ± 0.508
5.4IleAsp: 5.4 ± 1.615
6.171IleGlu: 6.171 ± 0.895
4.114IlePhe: 4.114 ± 0.294
3.086IleGly: 3.086 ± 0.512
2.057IleHis: 2.057 ± 0.919
5.4IleIle: 5.4 ± 2.462
8.228IleLys: 8.228 ± 1.077
7.971IleLeu: 7.971 ± 1.847
3.086IleMet: 3.086 ± 0.151
6.171IleAsn: 6.171 ± 0.556
3.343IlePro: 3.343 ± 1.29
4.371IleGln: 4.371 ± 0.963
3.086IleArg: 3.086 ± 0.88
6.171IleSer: 6.171 ± 1.525
4.628IleThr: 4.628 ± 0.736
4.371IleVal: 4.371 ± 0.726
0.771IleTrp: 0.771 ± 0.482
4.628IleTyr: 4.628 ± 0.933
0.0IleXaa: 0.0 ± 0.0
Lys
4.886LysAla: 4.886 ± 1.524
1.543LysCys: 1.543 ± 1.452
4.371LysAsp: 4.371 ± 0.336
3.6LysGlu: 3.6 ± 1.52
4.114LysPhe: 4.114 ± 1.074
3.6LysGly: 3.6 ± 0.13
1.286LysHis: 1.286 ± 0.239
6.171LysIle: 6.171 ± 0.35
6.943LysLys: 6.943 ± 1.036
7.971LysLeu: 7.971 ± 1.235
2.057LysMet: 2.057 ± 0.411
2.828LysAsn: 2.828 ± 0.589
2.571LysPro: 2.571 ± 0.891
2.057LysGln: 2.057 ± 0.587
3.086LysArg: 3.086 ± 0.88
6.171LysSer: 6.171 ± 1.643
5.914LysThr: 5.914 ± 0.66
3.343LysVal: 3.343 ± 0.244
1.543LysTrp: 1.543 ± 0.602
2.314LysTyr: 2.314 ± 0.368
0.0LysXaa: 0.0 ± 0.0
Leu
5.657LeuAla: 5.657 ± 2.219
2.828LeuCys: 2.828 ± 0.841
6.171LeuAsp: 6.171 ± 1.18
5.657LeuGlu: 5.657 ± 1.229
4.114LeuPhe: 4.114 ± 0.913
3.343LeuGly: 3.343 ± 0.965
2.057LeuHis: 2.057 ± 1.284
5.657LeuIle: 5.657 ± 1.682
5.914LeuLys: 5.914 ± 0.642
5.914LeuLeu: 5.914 ± 1.161
1.8LeuMet: 1.8 ± 0.76
7.714LeuAsn: 7.714 ± 0.199
3.6LeuPro: 3.6 ± 0.695
3.086LeuGln: 3.086 ± 0.235
3.343LeuArg: 3.343 ± 0.638
6.171LeuSer: 6.171 ± 1.273
6.428LeuThr: 6.428 ± 0.41
3.857LeuVal: 3.857 ± 0.161
0.257LeuTrp: 0.257 ± 0.161
4.628LeuTyr: 4.628 ± 0.933
0.0LeuXaa: 0.0 ± 0.0
Met
2.057MetAla: 2.057 ± 0.386
1.029MetCys: 1.029 ± 0.254
1.8MetAsp: 1.8 ± 0.471
1.543MetGlu: 1.543 ± 0.318
1.286MetPhe: 1.286 ± 0.584
1.029MetGly: 1.029 ± 0.293
1.286MetHis: 1.286 ± 0.83
2.828MetIle: 2.828 ± 0.539
1.8MetLys: 1.8 ± 0.982
3.086MetLeu: 3.086 ± 0.636
0.514MetMet: 0.514 ± 0.484
1.543MetAsn: 1.543 ± 0.602
1.286MetPro: 1.286 ± 0.602
1.286MetGln: 1.286 ± 0.445
1.286MetArg: 1.286 ± 0.445
2.314MetSer: 2.314 ± 1.299
2.571MetThr: 2.571 ± 1.119
1.543MetVal: 1.543 ± 0.604
0.0MetTrp: 0.0 ± 0.0
0.771MetTyr: 0.771 ± 0.842
0.0MetXaa: 0.0 ± 0.0
Asn
2.571AsnAla: 2.571 ± 1.203
2.571AsnCys: 2.571 ± 2.039
2.057AsnAsp: 2.057 ± 0.386
0.771AsnGlu: 0.771 ± 0.159
2.057AsnPhe: 2.057 ± 1.214
2.314AsnGly: 2.314 ± 1.421
1.8AsnHis: 1.8 ± 0.471
6.943AsnIle: 6.943 ± 0.644
4.371AsnLys: 4.371 ± 1.218
6.428AsnLeu: 6.428 ± 1.622
1.8AsnMet: 1.8 ± 0.663
2.571AsnAsn: 2.571 ± 0.631
2.828AsnPro: 2.828 ± 1.177
3.086AsnGln: 3.086 ± 1.647
1.8AsnArg: 1.8 ± 0.348
3.857AsnSer: 3.857 ± 0.161
2.571AsnThr: 2.571 ± 0.636
2.571AsnVal: 2.571 ± 0.399
1.029AsnTrp: 1.029 ± 0.293
2.314AsnTyr: 2.314 ± 0.737
0.0AsnXaa: 0.0 ± 0.0
Pro
2.828ProAla: 2.828 ± 0.41
0.0ProCys: 0.0 ± 0.0
2.314ProAsp: 2.314 ± 0.368
2.057ProGlu: 2.057 ± 0.443
1.543ProPhe: 1.543 ± 0.318
1.8ProGly: 1.8 ± 0.471
1.286ProHis: 1.286 ± 0.47
3.857ProIle: 3.857 ± 0.795
2.571ProLys: 2.571 ± 0.94
3.343ProLeu: 3.343 ± 1.513
0.771ProMet: 0.771 ± 0.159
1.286ProAsn: 1.286 ± 0.239
0.514ProPro: 0.514 ± 0.321
1.8ProGln: 1.8 ± 1.362
0.771ProArg: 0.771 ± 0.159
3.086ProSer: 3.086 ± 0.392
1.8ProThr: 1.8 ± 0.444
1.286ProVal: 1.286 ± 0.584
0.514ProTrp: 0.514 ± 0.321
0.771ProTyr: 0.771 ± 0.159
0.0ProXaa: 0.0 ± 0.0
Gln
1.029GlnAla: 1.029 ± 0.254
0.771GlnCys: 0.771 ± 0.159
2.571GlnAsp: 2.571 ± 0.599
1.029GlnGlu: 1.029 ± 0.254
1.286GlnPhe: 1.286 ± 1.401
1.543GlnGly: 1.543 ± 0.602
0.514GlnHis: 0.514 ± 0.127
4.114GlnIle: 4.114 ± 1.464
4.114GlnLys: 4.114 ± 1.2
2.828GlnLeu: 2.828 ± 1.269
1.286GlnMet: 1.286 ± 0.504
1.8GlnAsn: 1.8 ± 0.76
1.029GlnPro: 1.029 ± 0.714
2.057GlnGln: 2.057 ± 1.214
2.057GlnArg: 2.057 ± 1.459
2.314GlnSer: 2.314 ± 1.055
1.8GlnThr: 1.8 ± 0.348
1.8GlnVal: 1.8 ± 0.348
1.029GlnTrp: 1.029 ± 0.254
1.286GlnTyr: 1.286 ± 0.584
0.0GlnXaa: 0.0 ± 0.0
Arg
1.8ArgAla: 1.8 ± 0.444
1.8ArgCys: 1.8 ± 0.592
3.086ArgAsp: 3.086 ± 1.558
2.057ArgGlu: 2.057 ± 0.587
2.314ArgPhe: 2.314 ± 0.467
1.286ArgGly: 1.286 ± 0.239
1.543ArgHis: 1.543 ± 0.602
4.371ArgIle: 4.371 ± 1.322
2.314ArgLys: 2.314 ± 0.737
3.857ArgLeu: 3.857 ± 1.028
0.771ArgMet: 0.771 ± 0.669
2.314ArgAsn: 2.314 ± 0.368
0.514ArgPro: 0.514 ± 0.484
0.771ArgGln: 0.771 ± 0.701
1.029ArgArg: 1.029 ± 0.59
3.6ArgSer: 3.6 ± 1.182
0.771ArgThr: 0.771 ± 0.669
2.314ArgVal: 2.314 ± 1.31
0.257ArgTrp: 0.257 ± 0.752
2.828ArgTyr: 2.828 ± 0.41
0.0ArgXaa: 0.0 ± 0.0
Ser
3.6SerAla: 3.6 ± 0.425
3.857SerCys: 3.857 ± 2.121
3.6SerAsp: 3.6 ± 0.696
2.828SerGlu: 2.828 ± 0.589
2.828SerPhe: 2.828 ± 1.531
2.571SerGly: 2.571 ± 1.396
1.8SerHis: 1.8 ± 0.444
7.2SerIle: 7.2 ± 1.549
7.457SerLys: 7.457 ± 1.913
7.457SerLeu: 7.457 ± 1.486
1.286SerMet: 1.286 ± 0.445
3.857SerAsn: 3.857 ± 1.426
2.828SerPro: 2.828 ± 0.907
1.543SerGln: 1.543 ± 0.688
3.6SerArg: 3.6 ± 1.879
6.171SerSer: 6.171 ± 0.556
6.171SerThr: 6.171 ± 1.157
4.114SerVal: 4.114 ± 1.369
0.0SerTrp: 0.0 ± 0.0
3.086SerTyr: 3.086 ± 1.062
0.0SerXaa: 0.0 ± 0.0
Thr
3.6ThrAla: 3.6 ± 1.629
2.571ThrCys: 2.571 ± 1.067
2.571ThrAsp: 2.571 ± 0.891
2.057ThrGlu: 2.057 ± 0.479
3.6ThrPhe: 3.6 ± 1.877
2.828ThrGly: 2.828 ± 0.841
1.029ThrHis: 1.029 ± 0.59
6.686ThrIle: 6.686 ± 1.652
4.628ThrLys: 4.628 ± 1.178
3.857ThrLeu: 3.857 ± 0.647
2.314ThrMet: 2.314 ± 1.31
3.086ThrAsn: 3.086 ± 0.763
2.828ThrPro: 2.828 ± 0.539
2.571ThrGln: 2.571 ± 0.355
1.029ThrArg: 1.029 ± 0.59
5.914ThrSer: 5.914 ± 1.169
2.571ThrThr: 2.571 ± 0.636
1.8ThrVal: 1.8 ± 0.837
1.8ThrTrp: 1.8 ± 0.663
3.086ThrTyr: 3.086 ± 0.636
0.0ThrXaa: 0.0 ± 0.0
Val
1.543ValAla: 1.543 ± 1.403
2.571ValCys: 2.571 ± 1.292
2.828ValAsp: 2.828 ± 0.285
1.543ValGlu: 1.543 ± 0.318
1.8ValPhe: 1.8 ± 0.663
2.314ValGly: 2.314 ± 1.184
1.543ValHis: 1.543 ± 0.703
4.114ValIle: 4.114 ± 0.49
3.343ValLys: 3.343 ± 1.24
4.886ValLeu: 4.886 ± 2.375
2.571ValMet: 2.571 ± 0.355
1.8ValAsn: 1.8 ± 1.289
1.543ValPro: 1.543 ± 1.314
1.286ValGln: 1.286 ± 0.445
1.543ValArg: 1.543 ± 0.602
5.143ValSer: 5.143 ± 0.309
2.828ValThr: 2.828 ± 1.172
2.314ValVal: 2.314 ± 0.368
0.257ValTrp: 0.257 ± 0.242
2.571ValTyr: 2.571 ± 0.636
0.0ValXaa: 0.0 ± 0.0
Trp
0.771TrpAla: 0.771 ± 1.453
0.0TrpCys: 0.0 ± 0.0
1.029TrpAsp: 1.029 ± 0.293
0.257TrpGlu: 0.257 ± 0.161
0.771TrpPhe: 0.771 ± 0.159
0.771TrpGly: 0.771 ± 0.352
0.257TrpHis: 0.257 ± 0.242
1.286TrpIle: 1.286 ± 0.79
0.257TrpLys: 0.257 ± 0.242
0.771TrpLeu: 0.771 ± 0.159
0.514TrpMet: 0.514 ± 0.709
0.771TrpAsn: 0.771 ± 0.482
0.257TrpPro: 0.257 ± 0.242
0.514TrpGln: 0.514 ± 0.127
0.257TrpArg: 0.257 ± 0.242
0.771TrpSer: 0.771 ± 0.482
0.257TrpThr: 0.257 ± 0.161
0.771TrpVal: 0.771 ± 0.159
0.0TrpTrp: 0.0 ± 0.0
0.257TrpTyr: 0.257 ± 0.161
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.286TyrAla: 1.286 ± 0.239
0.514TyrCys: 0.514 ± 0.484
1.029TyrAsp: 1.029 ± 0.254
3.6TyrGlu: 3.6 ± 1.52
2.314TyrPhe: 2.314 ± 0.477
3.086TyrGly: 3.086 ± 1.053
1.543TyrHis: 1.543 ± 0.963
5.657TyrIle: 5.657 ± 1.682
4.371TyrLys: 4.371 ± 0.963
3.086TyrLeu: 3.086 ± 1.753
1.543TyrMet: 1.543 ± 0.602
2.314TyrAsn: 2.314 ± 0.467
0.771TyrPro: 0.771 ± 0.669
2.314TyrGln: 2.314 ± 0.467
1.8TyrArg: 1.8 ± 1.313
2.571TyrSer: 2.571 ± 1.661
2.828TyrThr: 2.828 ± 0.749
2.314TyrVal: 2.314 ± 1.529
0.771TyrTrp: 0.771 ± 0.159
1.286TyrTyr: 1.286 ± 0.831
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3890 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski