Amino acid dipepetide frequency for Nerine virus X

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.124AlaAla: 9.124 ± 7.215
1.369AlaCys: 1.369 ± 0.76
4.562AlaAsp: 4.562 ± 2.706
2.737AlaGlu: 2.737 ± 0.65
1.825AlaPhe: 1.825 ± 0.678
4.106AlaGly: 4.106 ± 2.078
3.193AlaHis: 3.193 ± 1.273
5.931AlaIle: 5.931 ± 2.681
4.562AlaLys: 4.562 ± 1.585
10.493AlaLeu: 10.493 ± 2.636
0.912AlaMet: 0.912 ± 0.507
5.474AlaAsn: 5.474 ± 1.809
5.931AlaPro: 5.931 ± 1.706
4.562AlaGln: 4.562 ± 2.676
2.281AlaArg: 2.281 ± 0.873
4.106AlaSer: 4.106 ± 2.979
6.387AlaThr: 6.387 ± 0.742
5.018AlaVal: 5.018 ± 1.191
0.456AlaTrp: 0.456 ± 0.851
3.65AlaTyr: 3.65 ± 1.18
0.0AlaXaa: 0.0 ± 0.0
Cys
1.369CysAla: 1.369 ± 0.993
0.456CysCys: 0.456 ± 0.674
0.456CysAsp: 0.456 ± 0.253
1.369CysGlu: 1.369 ± 0.505
1.369CysPhe: 1.369 ± 1.206
0.912CysGly: 0.912 ± 0.507
0.456CysHis: 0.456 ± 0.674
0.456CysIle: 0.456 ± 0.674
0.0CysLys: 0.0 ± 0.0
0.912CysLeu: 0.912 ± 0.507
0.912CysMet: 0.912 ± 0.713
0.456CysAsn: 0.456 ± 0.674
0.912CysPro: 0.912 ± 0.961
1.369CysGln: 1.369 ± 0.88
1.369CysArg: 1.369 ± 0.76
2.737CysSer: 2.737 ± 2.261
1.369CysThr: 1.369 ± 0.648
1.369CysVal: 1.369 ± 1.22
0.0CysTrp: 0.0 ± 0.0
0.456CysTyr: 0.456 ± 0.253
0.0CysXaa: 0.0 ± 0.0
Asp
5.474AspAla: 5.474 ± 1.771
0.912AspCys: 0.912 ± 0.507
2.281AspAsp: 2.281 ± 0.792
3.193AspGlu: 3.193 ± 1.182
2.281AspPhe: 2.281 ± 0.792
2.737AspGly: 2.737 ± 1.426
0.0AspHis: 0.0 ± 0.0
3.193AspIle: 3.193 ± 1.182
0.912AspLys: 0.912 ± 0.507
3.65AspLeu: 3.65 ± 1.23
0.912AspMet: 0.912 ± 0.713
3.193AspAsn: 3.193 ± 0.831
2.281AspPro: 2.281 ± 0.755
2.737AspGln: 2.737 ± 1.295
0.0AspArg: 0.0 ± 0.0
2.737AspSer: 2.737 ± 1.618
1.369AspThr: 1.369 ± 0.76
4.106AspVal: 4.106 ± 1.007
0.912AspTrp: 0.912 ± 0.507
2.281AspTyr: 2.281 ± 1.513
0.0AspXaa: 0.0 ± 0.0
Glu
5.931GluAla: 5.931 ± 1.367
0.456GluCys: 0.456 ± 0.253
4.562GluAsp: 4.562 ± 2.534
1.825GluGlu: 1.825 ± 1.014
1.825GluPhe: 1.825 ± 0.678
1.825GluGly: 1.825 ± 0.59
0.912GluHis: 0.912 ± 0.507
3.193GluIle: 3.193 ± 1.246
4.106GluLys: 4.106 ± 1.655
4.562GluLeu: 4.562 ± 2.672
0.912GluMet: 0.912 ± 0.507
2.737GluAsn: 2.737 ± 0.899
6.843GluPro: 6.843 ± 2.155
2.281GluGln: 2.281 ± 0.755
3.65GluArg: 3.65 ± 1.356
3.193GluSer: 3.193 ± 2.044
5.931GluThr: 5.931 ± 2.138
2.281GluVal: 2.281 ± 0.755
0.456GluTrp: 0.456 ± 0.253
1.825GluTyr: 1.825 ± 1.079
0.0GluXaa: 0.0 ± 0.0
Phe
3.65PheAla: 3.65 ± 1.23
1.825PheCys: 1.825 ± 0.819
2.737PheAsp: 2.737 ± 1.295
3.193PheGlu: 3.193 ± 1.774
0.912PhePhe: 0.912 ± 0.713
1.369PheGly: 1.369 ± 0.505
1.825PheHis: 1.825 ± 0.59
3.65PheIle: 3.65 ± 1.828
1.825PheLys: 1.825 ± 1.014
3.193PheLeu: 3.193 ± 1.774
0.912PheMet: 0.912 ± 0.507
1.369PheAsn: 1.369 ± 2.422
0.912PhePro: 0.912 ± 1.4
1.825PheGln: 1.825 ± 1.014
1.369PheArg: 1.369 ± 0.76
1.369PheSer: 1.369 ± 0.505
2.281PheThr: 2.281 ± 0.694
2.281PheVal: 2.281 ± 1.267
0.0PheTrp: 0.0 ± 0.0
0.456PheTyr: 0.456 ± 0.253
0.0PheXaa: 0.0 ± 0.0
Gly
3.65GlyAla: 3.65 ± 1.416
0.912GlyCys: 0.912 ± 0.507
2.737GlyAsp: 2.737 ± 1.048
1.825GlyGlu: 1.825 ± 0.678
1.369GlyPhe: 1.369 ± 0.76
2.737GlyGly: 2.737 ± 1.111
2.281GlyHis: 2.281 ± 1.267
1.825GlyIle: 1.825 ± 1.079
4.106GlyLys: 4.106 ± 1.134
3.65GlyLeu: 3.65 ± 1.474
0.0GlyMet: 0.0 ± 0.0
1.825GlyAsn: 1.825 ± 0.881
2.281GlyPro: 2.281 ± 0.694
0.912GlyGln: 0.912 ± 0.507
3.193GlyArg: 3.193 ± 0.846
1.825GlySer: 1.825 ± 0.819
3.193GlyThr: 3.193 ± 2.809
1.825GlyVal: 1.825 ± 1.425
1.825GlyTrp: 1.825 ± 1.014
2.281GlyTyr: 2.281 ± 1.513
0.0GlyXaa: 0.0 ± 0.0
His
1.825HisAla: 1.825 ± 0.678
1.825HisCys: 1.825 ± 2.378
0.456HisAsp: 0.456 ± 0.253
1.825HisGlu: 1.825 ± 1.014
2.737HisPhe: 2.737 ± 2.1
2.737HisGly: 2.737 ± 0.841
2.737HisHis: 2.737 ± 0.841
3.65HisIle: 3.65 ± 1.085
2.281HisLys: 2.281 ± 1.267
2.737HisLeu: 2.737 ± 0.899
0.912HisMet: 0.912 ± 0.814
1.825HisAsn: 1.825 ± 0.881
2.737HisPro: 2.737 ± 1.111
1.825HisGln: 1.825 ± 1.014
0.912HisArg: 0.912 ± 0.949
2.737HisSer: 2.737 ± 2.086
3.65HisThr: 3.65 ± 0.88
3.193HisVal: 3.193 ± 1.246
0.456HisTrp: 0.456 ± 0.674
0.456HisTyr: 0.456 ± 0.253
0.0HisXaa: 0.0 ± 0.0
Ile
5.931IleAla: 5.931 ± 3.38
1.825IleCys: 1.825 ± 1.541
2.281IleAsp: 2.281 ± 1.727
4.562IleGlu: 4.562 ± 1.904
1.369IlePhe: 1.369 ± 0.76
1.369IleGly: 1.369 ± 1.55
2.281IleHis: 2.281 ± 1.329
4.562IleIle: 4.562 ± 1.536
4.562IleLys: 4.562 ± 1.136
6.387IleLeu: 6.387 ± 2.387
1.369IleMet: 1.369 ± 0.76
2.737IleAsn: 2.737 ± 0.959
2.281IlePro: 2.281 ± 1.329
2.281IleGln: 2.281 ± 1.267
3.193IleArg: 3.193 ± 1.061
4.562IleSer: 4.562 ± 1.565
5.018IleThr: 5.018 ± 1.638
1.825IleVal: 1.825 ± 1.1
0.456IleTrp: 0.456 ± 0.253
1.825IleTyr: 1.825 ± 1.845
0.0IleXaa: 0.0 ± 0.0
Lys
3.65LysAla: 3.65 ± 1.38
0.912LysCys: 0.912 ± 0.961
2.281LysAsp: 2.281 ± 0.792
3.193LysGlu: 3.193 ± 1.774
0.912LysPhe: 0.912 ± 0.507
1.825LysGly: 1.825 ± 0.678
2.737LysHis: 2.737 ± 1.079
2.737LysIle: 2.737 ± 0.962
1.369LysLys: 1.369 ± 0.76
7.755LysLeu: 7.755 ± 1.873
0.456LysMet: 0.456 ± 0.48
1.825LysAsn: 1.825 ± 0.678
3.65LysPro: 3.65 ± 0.999
0.456LysGln: 0.456 ± 0.253
1.825LysArg: 1.825 ± 1.014
5.474LysSer: 5.474 ± 0.981
4.562LysThr: 4.562 ± 1.017
4.106LysVal: 4.106 ± 2.281
0.0LysTrp: 0.0 ± 0.0
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
6.843LeuAla: 6.843 ± 3.509
0.912LeuCys: 0.912 ± 0.507
3.193LeuAsp: 3.193 ± 0.916
5.474LeuGlu: 5.474 ± 0.892
3.65LeuPhe: 3.65 ± 2.027
4.562LeuGly: 4.562 ± 1.017
4.562LeuHis: 4.562 ± 1.585
4.562LeuIle: 4.562 ± 1.505
6.843LeuLys: 6.843 ± 3.802
5.931LeuLeu: 5.931 ± 0.787
0.912LeuMet: 0.912 ± 0.507
4.562LeuAsn: 4.562 ± 1.447
10.949LeuPro: 10.949 ± 2.046
5.018LeuGln: 5.018 ± 1.122
4.106LeuArg: 4.106 ± 1.608
5.018LeuSer: 5.018 ± 3.564
9.124LeuThr: 9.124 ± 3.118
1.369LeuVal: 1.369 ± 1.22
1.369LeuTrp: 1.369 ± 0.648
2.737LeuTyr: 2.737 ± 1.048
0.0LeuXaa: 0.0 ± 0.0
Met
0.456MetAla: 0.456 ± 0.851
0.0MetCys: 0.0 ± 0.0
0.456MetAsp: 0.456 ± 0.253
1.369MetGlu: 1.369 ± 1.22
0.456MetPhe: 0.456 ± 0.253
0.912MetGly: 0.912 ± 0.507
0.456MetHis: 0.456 ± 0.253
0.456MetIle: 0.456 ± 0.851
0.456MetLys: 0.456 ± 0.253
2.281MetLeu: 2.281 ± 0.792
0.456MetMet: 0.456 ± 0.851
1.369MetAsn: 1.369 ± 0.76
0.912MetPro: 0.912 ± 0.961
0.912MetGln: 0.912 ± 0.507
1.825MetArg: 1.825 ± 0.868
2.737MetSer: 2.737 ± 1.521
1.369MetThr: 1.369 ± 0.648
1.369MetVal: 1.369 ± 0.76
0.0MetTrp: 0.0 ± 0.0
1.825MetTyr: 1.825 ± 0.678
0.0MetXaa: 0.0 ± 0.0
Asn
5.931AsnAla: 5.931 ± 1.527
1.369AsnCys: 1.369 ± 1.22
2.281AsnAsp: 2.281 ± 1.267
2.281AsnGlu: 2.281 ± 1.267
1.369AsnPhe: 1.369 ± 0.505
0.912AsnGly: 0.912 ± 0.507
1.369AsnHis: 1.369 ± 0.88
3.193AsnIle: 3.193 ± 1.21
1.825AsnLys: 1.825 ± 1.541
4.106AsnLeu: 4.106 ± 1.057
0.912AsnMet: 0.912 ± 1.522
1.369AsnAsn: 1.369 ± 0.76
2.737AsnPro: 2.737 ± 1.521
1.825AsnGln: 1.825 ± 1.02
0.912AsnArg: 0.912 ± 0.507
3.65AsnSer: 3.65 ± 3.467
5.474AsnThr: 5.474 ± 0.868
1.369AsnVal: 1.369 ± 0.505
0.912AsnTrp: 0.912 ± 0.713
2.281AsnTyr: 2.281 ± 0.792
0.0AsnXaa: 0.0 ± 0.0
Pro
5.018ProAla: 5.018 ± 2.426
0.912ProCys: 0.912 ± 1.404
4.562ProAsp: 4.562 ± 1.152
9.124ProGlu: 9.124 ± 2.32
1.825ProPhe: 1.825 ± 0.868
3.65ProGly: 3.65 ± 0.83
3.65ProHis: 3.65 ± 3.176
4.562ProIle: 4.562 ± 2.376
2.737ProLys: 2.737 ± 0.959
4.106ProLeu: 4.106 ± 1.605
0.456ProMet: 0.456 ± 0.253
2.737ProAsn: 2.737 ± 0.857
8.212ProPro: 8.212 ± 3.908
2.281ProGln: 2.281 ± 1.332
1.825ProArg: 1.825 ± 1.014
5.474ProSer: 5.474 ± 1.563
4.562ProThr: 4.562 ± 1.229
5.018ProVal: 5.018 ± 1.335
0.912ProTrp: 0.912 ± 0.507
1.825ProTyr: 1.825 ± 0.819
0.0ProXaa: 0.0 ± 0.0
Gln
4.562GlnAla: 4.562 ± 1.212
0.0GlnCys: 0.0 ± 0.0
1.825GlnAsp: 1.825 ± 0.59
1.369GlnGlu: 1.369 ± 0.76
3.65GlnPhe: 3.65 ± 1.353
0.912GlnGly: 0.912 ± 0.507
2.281GlnHis: 2.281 ± 0.952
2.281GlnIle: 2.281 ± 0.792
2.281GlnLys: 2.281 ± 1.338
4.106GlnLeu: 4.106 ± 1.608
1.369GlnMet: 1.369 ± 0.648
1.369GlnAsn: 1.369 ± 0.76
4.106GlnPro: 4.106 ± 1.187
1.825GlnGln: 1.825 ± 0.678
1.369GlnArg: 1.369 ± 1.608
3.65GlnSer: 3.65 ± 1.356
3.65GlnThr: 3.65 ± 0.999
2.281GlnVal: 2.281 ± 0.694
0.912GlnTrp: 0.912 ± 0.507
1.369GlnTyr: 1.369 ± 0.993
0.0GlnXaa: 0.0 ± 0.0
Arg
2.737ArgAla: 2.737 ± 0.962
0.456ArgCys: 0.456 ± 0.253
1.825ArgAsp: 1.825 ± 0.868
3.193ArgGlu: 3.193 ± 0.701
3.193ArgPhe: 3.193 ± 1.805
1.369ArgGly: 1.369 ± 1.608
3.193ArgHis: 3.193 ± 1.561
2.281ArgIle: 2.281 ± 1.267
0.912ArgLys: 0.912 ± 0.507
2.281ArgLeu: 2.281 ± 0.952
0.912ArgMet: 0.912 ± 0.507
3.65ArgAsn: 3.65 ± 1.707
4.106ArgPro: 4.106 ± 1.608
3.65ArgGln: 3.65 ± 1.209
2.281ArgArg: 2.281 ± 1.267
1.825ArgSer: 1.825 ± 0.59
3.193ArgThr: 3.193 ± 0.846
1.825ArgVal: 1.825 ± 0.678
0.0ArgTrp: 0.0 ± 0.0
2.281ArgTyr: 2.281 ± 0.928
0.0ArgXaa: 0.0 ± 0.0
Ser
5.018SerAla: 5.018 ± 3.634
0.912SerCys: 0.912 ± 0.961
4.106SerAsp: 4.106 ± 1.516
2.281SerGlu: 2.281 ± 1.592
1.369SerPhe: 1.369 ± 0.76
3.193SerGly: 3.193 ± 1.182
0.912SerHis: 0.912 ± 0.961
5.474SerIle: 5.474 ± 4.193
5.474SerLys: 5.474 ± 1.82
7.299SerLeu: 7.299 ± 0.567
1.369SerMet: 1.369 ± 0.655
3.193SerAsn: 3.193 ± 1.705
5.018SerPro: 5.018 ± 2.851
3.65SerGln: 3.65 ± 1.85
4.106SerArg: 4.106 ± 1.057
8.212SerSer: 8.212 ± 0.664
4.106SerThr: 4.106 ± 0.993
2.281SerVal: 2.281 ± 0.952
0.0SerTrp: 0.0 ± 0.0
2.737SerTyr: 2.737 ± 1.944
0.0SerXaa: 0.0 ± 0.0
Thr
6.843ThrAla: 6.843 ± 4.575
2.737ThrCys: 2.737 ± 0.959
2.281ThrAsp: 2.281 ± 1.338
5.474ThrGlu: 5.474 ± 2.327
3.65ThrPhe: 3.65 ± 1.416
3.65ThrGly: 3.65 ± 2.049
4.562ThrHis: 4.562 ± 1.596
3.65ThrIle: 3.65 ± 0.923
0.912ThrLys: 0.912 ± 0.961
9.58ThrLeu: 9.58 ± 0.495
2.281ThrMet: 2.281 ± 0.948
2.737ThrAsn: 2.737 ± 1.521
5.474ThrPro: 5.474 ± 0.892
2.281ThrGln: 2.281 ± 0.755
5.018ThrArg: 5.018 ± 2.987
4.106ThrSer: 4.106 ± 1.362
3.65ThrThr: 3.65 ± 1.416
1.825ThrVal: 1.825 ± 1.014
0.456ThrTrp: 0.456 ± 0.253
3.65ThrTyr: 3.65 ± 0.923
0.0ThrXaa: 0.0 ± 0.0
Val
3.65ValAla: 3.65 ± 1.316
0.0ValCys: 0.0 ± 0.0
0.912ValAsp: 0.912 ± 0.539
2.737ValGlu: 2.737 ± 0.962
1.825ValPhe: 1.825 ± 1.014
3.193ValGly: 3.193 ± 1.301
1.369ValHis: 1.369 ± 0.88
3.193ValIle: 3.193 ± 0.916
3.193ValLys: 3.193 ± 1.774
2.737ValLeu: 2.737 ± 1.521
1.369ValMet: 1.369 ± 0.76
2.281ValAsn: 2.281 ± 0.952
3.193ValPro: 3.193 ± 0.701
1.825ValGln: 1.825 ± 0.819
4.106ValArg: 4.106 ± 1.134
3.65ValSer: 3.65 ± 1.208
3.65ValThr: 3.65 ± 0.833
2.281ValVal: 2.281 ± 1.285
0.456ValTrp: 0.456 ± 0.851
0.912ValTyr: 0.912 ± 0.507
0.0ValXaa: 0.0 ± 0.0
Trp
0.456TrpAla: 0.456 ± 0.253
0.0TrpCys: 0.0 ± 0.0
0.912TrpAsp: 0.912 ± 0.713
1.369TrpGlu: 1.369 ± 0.505
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.456TrpHis: 0.456 ± 0.253
0.0TrpIle: 0.0 ± 0.0
0.912TrpLys: 0.912 ± 0.713
0.912TrpLeu: 0.912 ± 0.507
0.0TrpMet: 0.0 ± 0.0
0.912TrpAsn: 0.912 ± 0.713
0.456TrpPro: 0.456 ± 0.253
1.369TrpGln: 1.369 ± 0.76
0.0TrpArg: 0.0 ± 0.0
0.912TrpSer: 0.912 ± 0.713
0.456TrpThr: 0.456 ± 0.253
0.912TrpVal: 0.912 ± 0.507
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.562TyrAla: 4.562 ± 1.843
0.912TyrCys: 0.912 ± 0.961
0.456TyrAsp: 0.456 ± 0.253
0.912TyrGlu: 0.912 ± 0.507
1.369TyrPhe: 1.369 ± 0.648
2.281TyrGly: 2.281 ± 0.952
2.281TyrHis: 2.281 ± 0.952
1.825TyrIle: 1.825 ± 0.678
0.912TyrLys: 0.912 ± 0.713
5.018TyrLeu: 5.018 ± 2.878
2.281TyrMet: 2.281 ± 1.267
0.456TyrAsn: 0.456 ± 0.674
0.456TyrPro: 0.456 ± 0.253
2.281TyrGln: 2.281 ± 1.727
1.825TyrArg: 1.825 ± 1.557
2.737TyrSer: 2.737 ± 1.251
1.825TyrThr: 1.825 ± 2.24
0.0TyrVal: 0.0 ± 0.0
0.456TyrTrp: 0.456 ± 0.253
0.456TyrTyr: 0.456 ± 0.253
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2193 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski