Amino acid dipepetide frequency for Youcai mosaic virus (YoMV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.929AlaAla: 7.929 ± 2.069
0.991AlaCys: 0.991 ± 0.509
2.973AlaAsp: 2.973 ± 0.827
4.955AlaGlu: 4.955 ± 2.487
2.973AlaPhe: 2.973 ± 0.827
4.955AlaGly: 4.955 ± 1.113
1.487AlaHis: 1.487 ± 0.829
5.451AlaIle: 5.451 ± 2.028
4.46AlaLys: 4.46 ± 1.47
6.442AlaLeu: 6.442 ± 1.76
5.451AlaMet: 5.451 ± 2.293
3.469AlaAsn: 3.469 ± 1.317
1.487AlaPro: 1.487 ± 1.369
0.496AlaGln: 0.496 ± 0.255
2.478AlaArg: 2.478 ± 1.243
4.46AlaSer: 4.46 ± 1.563
3.964AlaThr: 3.964 ± 5.977
5.946AlaVal: 5.946 ± 0.76
0.991AlaTrp: 0.991 ± 0.509
1.982AlaTyr: 1.982 ± 1.018
0.0AlaXaa: 0.0 ± 0.0
Cys
0.991CysAla: 0.991 ± 0.509
0.991CysCys: 0.991 ± 0.509
3.964CysAsp: 3.964 ± 1.154
0.496CysGlu: 0.496 ± 0.255
0.0CysPhe: 0.0 ± 0.0
1.982CysGly: 1.982 ± 1.018
0.496CysHis: 0.496 ± 0.255
0.496CysIle: 0.496 ± 1.156
2.478CysLys: 2.478 ± 1.273
0.991CysLeu: 0.991 ± 0.509
0.496CysMet: 0.496 ± 0.255
0.0CysAsn: 0.0 ± 0.0
0.991CysPro: 0.991 ± 0.509
0.496CysGln: 0.496 ± 0.255
1.487CysArg: 1.487 ± 0.764
0.496CysSer: 0.496 ± 0.255
0.991CysThr: 0.991 ± 0.509
1.487CysVal: 1.487 ± 1.369
0.0CysTrp: 0.0 ± 0.0
0.496CysTyr: 0.496 ± 0.255
0.0CysXaa: 0.0 ± 0.0
Asp
5.946AspAla: 5.946 ± 0.624
0.991AspCys: 0.991 ± 0.509
2.478AspAsp: 2.478 ± 1.317
5.451AspGlu: 5.451 ± 2.8
2.973AspPhe: 2.973 ± 2.92
1.982AspGly: 1.982 ± 1.018
0.0AspHis: 0.0 ± 0.0
4.955AspIle: 4.955 ± 1.58
4.955AspLys: 4.955 ± 1.491
5.451AspLeu: 5.451 ± 1.809
1.982AspMet: 1.982 ± 1.018
0.991AspAsn: 0.991 ± 0.509
2.478AspPro: 2.478 ± 0.746
0.0AspGln: 0.0 ± 0.0
3.469AspArg: 3.469 ± 0.971
4.955AspSer: 4.955 ± 2.38
4.955AspThr: 4.955 ± 2.487
5.946AspVal: 5.946 ± 2.238
0.991AspTrp: 0.991 ± 0.509
1.487AspTyr: 1.487 ± 0.764
0.0AspXaa: 0.0 ± 0.0
Glu
3.469GluAla: 3.469 ± 0.869
0.496GluCys: 0.496 ± 1.156
2.478GluAsp: 2.478 ± 1.273
6.442GluGlu: 6.442 ± 1.951
5.451GluPhe: 5.451 ± 1.916
3.964GluGly: 3.964 ± 1.492
0.0GluHis: 0.0 ± 0.0
3.469GluIle: 3.469 ± 0.971
3.964GluLys: 3.964 ± 1.154
7.433GluLeu: 7.433 ± 1.424
0.991GluMet: 0.991 ± 0.509
2.973GluAsn: 2.973 ± 1.658
1.982GluPro: 1.982 ± 1.555
0.496GluGln: 0.496 ± 0.255
1.982GluArg: 1.982 ± 0.746
6.442GluSer: 6.442 ± 2.828
2.478GluThr: 2.478 ± 1.243
5.451GluVal: 5.451 ± 2.028
0.991GluTrp: 0.991 ± 0.973
3.469GluTyr: 3.469 ± 1.782
0.0GluXaa: 0.0 ± 0.0
Phe
1.487PheAla: 1.487 ± 1.369
0.991PheCys: 0.991 ± 0.509
2.973PheAsp: 2.973 ± 1.255
2.478PheGlu: 2.478 ± 1.317
2.973PhePhe: 2.973 ± 1.527
1.487PheGly: 1.487 ± 0.764
2.973PheHis: 2.973 ± 1.527
1.487PheIle: 1.487 ± 0.764
4.46PheLys: 4.46 ± 1.47
3.964PheLeu: 3.964 ± 1.154
0.496PheMet: 0.496 ± 0.255
0.496PheAsn: 0.496 ± 0.255
2.478PhePro: 2.478 ± 1.243
2.973PheGln: 2.973 ± 0.827
2.478PheArg: 2.478 ± 1.79
5.451PheSer: 5.451 ± 1.916
2.973PheThr: 2.973 ± 0.827
5.451PheVal: 5.451 ± 2.296
0.991PheTrp: 0.991 ± 0.509
1.487PheTyr: 1.487 ± 0.764
0.0PheXaa: 0.0 ± 0.0
Gly
3.964GlyAla: 3.964 ± 2.036
1.487GlyCys: 1.487 ± 0.764
1.982GlyAsp: 1.982 ± 1.018
1.982GlyGlu: 1.982 ± 0.746
0.991GlyPhe: 0.991 ± 0.973
2.973GlyGly: 2.973 ± 1.658
1.487GlyHis: 1.487 ± 1.369
1.487GlyIle: 1.487 ± 0.764
3.964GlyLys: 3.964 ± 2.036
4.955GlyLeu: 4.955 ± 1.113
0.496GlyMet: 0.496 ± 0.255
1.487GlyAsn: 1.487 ± 0.764
1.487GlyPro: 1.487 ± 0.829
0.496GlyGln: 0.496 ± 0.255
3.469GlyArg: 3.469 ± 1.557
1.487GlySer: 1.487 ± 0.764
2.478GlyThr: 2.478 ± 0.746
5.451GlyVal: 5.451 ± 3.441
0.496GlyTrp: 0.496 ± 1.156
1.487GlyTyr: 1.487 ± 1.369
0.0GlyXaa: 0.0 ± 0.0
His
0.496HisAla: 0.496 ± 0.255
1.982HisCys: 1.982 ± 1.018
0.496HisAsp: 0.496 ± 0.255
1.982HisGlu: 1.982 ± 0.746
1.982HisPhe: 1.982 ± 1.018
0.991HisGly: 0.991 ± 1.494
0.991HisHis: 0.991 ± 0.973
0.991HisIle: 0.991 ± 0.509
1.982HisLys: 1.982 ± 1.018
1.487HisLeu: 1.487 ± 0.764
0.496HisMet: 0.496 ± 0.255
0.991HisAsn: 0.991 ± 0.509
0.496HisPro: 0.496 ± 0.255
0.0HisGln: 0.0 ± 0.0
0.496HisArg: 0.496 ± 0.255
2.478HisSer: 2.478 ± 0.746
1.487HisThr: 1.487 ± 0.764
1.487HisVal: 1.487 ± 0.829
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.982IleAla: 1.982 ± 0.746
0.991IleCys: 0.991 ± 0.509
4.46IleAsp: 4.46 ± 2.487
1.982IleGlu: 1.982 ± 1.555
1.487IlePhe: 1.487 ± 0.764
2.478IleGly: 2.478 ± 1.79
0.496IleHis: 0.496 ± 0.255
4.955IleIle: 4.955 ± 0.51
4.46IleLys: 4.46 ± 2.291
2.973IleLeu: 2.973 ± 0.827
0.496IleMet: 0.496 ± 0.255
1.982IleAsn: 1.982 ± 1.283
3.469IlePro: 3.469 ± 1.782
1.982IleGln: 1.982 ± 1.555
1.982IleArg: 1.982 ± 1.283
4.46IleSer: 4.46 ± 1.47
1.982IleThr: 1.982 ± 1.283
2.973IleVal: 2.973 ± 1.255
0.991IleTrp: 0.991 ± 0.509
0.991IleTyr: 0.991 ± 0.509
0.0IleXaa: 0.0 ± 0.0
Lys
3.469LysAla: 3.469 ± 1.557
0.496LysCys: 0.496 ± 0.255
4.46LysAsp: 4.46 ± 1.47
7.433LysGlu: 7.433 ± 3.039
2.973LysPhe: 2.973 ± 1.527
4.46LysGly: 4.46 ± 1.47
0.991LysHis: 0.991 ± 0.509
3.469LysIle: 3.469 ± 0.971
5.451LysLys: 5.451 ± 3.441
7.433LysLeu: 7.433 ± 2.77
0.496LysMet: 0.496 ± 0.255
1.487LysAsn: 1.487 ± 0.764
3.469LysPro: 3.469 ± 0.869
0.991LysGln: 0.991 ± 0.509
3.964LysArg: 3.964 ± 2.614
4.46LysSer: 4.46 ± 1.563
5.946LysThr: 5.946 ± 2.044
4.955LysVal: 4.955 ± 1.491
0.0LysTrp: 0.0 ± 0.0
4.46LysTyr: 4.46 ± 1.47
0.0LysXaa: 0.0 ± 0.0
Leu
5.946LeuAla: 5.946 ± 1.655
3.469LeuCys: 3.469 ± 0.971
5.946LeuAsp: 5.946 ± 2.044
8.92LeuGlu: 8.92 ± 2.482
4.46LeuPhe: 4.46 ± 2.291
2.973LeuGly: 2.973 ± 0.827
0.991LeuHis: 0.991 ± 0.509
1.487LeuIle: 1.487 ± 0.764
5.451LeuLys: 5.451 ± 0.866
7.929LeuLeu: 7.929 ± 4.883
2.973LeuMet: 2.973 ± 2.738
6.442LeuAsn: 6.442 ± 2.561
3.469LeuPro: 3.469 ± 0.869
2.478LeuGln: 2.478 ± 1.79
4.46LeuArg: 4.46 ± 1.359
9.911LeuSer: 9.911 ± 3.875
5.946LeuThr: 5.946 ± 1.655
9.415LeuVal: 9.415 ± 2.609
0.496LeuTrp: 0.496 ± 0.255
0.991LeuTyr: 0.991 ± 1.494
0.0LeuXaa: 0.0 ± 0.0
Met
0.991MetAla: 0.991 ± 0.509
0.0MetCys: 0.0 ± 0.0
1.487MetAsp: 1.487 ± 0.764
0.496MetGlu: 0.496 ± 0.255
0.496MetPhe: 0.496 ± 0.255
0.496MetGly: 0.496 ± 0.255
0.496MetHis: 0.496 ± 0.255
1.982MetIle: 1.982 ± 1.018
1.487MetLys: 1.487 ± 1.369
2.973MetLeu: 2.973 ± 1.255
0.991MetMet: 0.991 ± 0.509
1.982MetAsn: 1.982 ± 1.283
0.991MetPro: 0.991 ± 0.509
1.487MetGln: 1.487 ± 0.764
0.0MetArg: 0.0 ± 0.0
1.487MetSer: 1.487 ± 0.829
0.991MetThr: 0.991 ± 0.509
1.982MetVal: 1.982 ± 1.555
0.496MetTrp: 0.496 ± 1.65
0.496MetTyr: 0.496 ± 0.255
0.0MetXaa: 0.0 ± 0.0
Asn
3.964AsnAla: 3.964 ± 1.423
0.496AsnCys: 0.496 ± 0.255
0.991AsnAsp: 0.991 ± 0.973
1.982AsnGlu: 1.982 ± 1.283
4.955AsnPhe: 4.955 ± 2.546
1.982AsnGly: 1.982 ± 1.555
0.991AsnHis: 0.991 ± 0.509
0.991AsnIle: 0.991 ± 1.494
1.487AsnLys: 1.487 ± 0.829
5.451AsnLeu: 5.451 ± 0.595
1.487AsnMet: 1.487 ± 0.764
0.991AsnAsn: 0.991 ± 0.973
0.991AsnPro: 0.991 ± 0.509
2.478AsnGln: 2.478 ± 4.629
1.487AsnArg: 1.487 ± 3.137
3.469AsnSer: 3.469 ± 1.867
1.487AsnThr: 1.487 ± 0.764
4.46AsnVal: 4.46 ± 2.487
0.496AsnTrp: 0.496 ± 0.255
1.982AsnTyr: 1.982 ± 0.746
0.0AsnXaa: 0.0 ± 0.0
Pro
3.964ProAla: 3.964 ± 1.423
0.496ProCys: 0.496 ± 0.255
2.478ProAsp: 2.478 ± 1.79
3.469ProGlu: 3.469 ± 0.869
2.478ProPhe: 2.478 ± 1.273
1.487ProGly: 1.487 ± 0.764
0.991ProHis: 0.991 ± 0.509
0.496ProIle: 0.496 ± 0.255
4.955ProLys: 4.955 ± 1.491
4.46ProLeu: 4.46 ± 1.364
1.487ProMet: 1.487 ± 0.764
2.973ProAsn: 2.973 ± 2.12
0.991ProPro: 0.991 ± 0.509
0.0ProGln: 0.0 ± 0.0
0.496ProArg: 0.496 ± 0.255
0.991ProSer: 0.991 ± 1.494
2.478ProThr: 2.478 ± 1.243
1.982ProVal: 1.982 ± 0.746
0.991ProTrp: 0.991 ± 2.043
0.496ProTyr: 0.496 ± 0.255
0.0ProXaa: 0.0 ± 0.0
Gln
3.964GlnAla: 3.964 ± 1.423
0.991GlnCys: 0.991 ± 1.494
1.487GlnAsp: 1.487 ± 0.764
0.991GlnGlu: 0.991 ± 0.509
1.982GlnPhe: 1.982 ± 1.555
0.991GlnGly: 0.991 ± 0.509
0.496GlnHis: 0.496 ± 1.156
1.982GlnIle: 1.982 ± 1.283
1.982GlnLys: 1.982 ± 1.018
1.982GlnLeu: 1.982 ± 1.283
0.496GlnMet: 0.496 ± 0.255
0.991GlnAsn: 0.991 ± 0.973
0.991GlnPro: 0.991 ± 0.973
2.973GlnGln: 2.973 ± 1.255
3.469GlnArg: 3.469 ± 2.641
0.991GlnSer: 0.991 ± 1.494
2.478GlnThr: 2.478 ± 1.243
1.487GlnVal: 1.487 ± 0.829
0.0GlnTrp: 0.0 ± 0.0
1.487GlnTyr: 1.487 ± 3.528
0.496GlnXaa: 0.496 ± 0.255
Arg
3.964ArgAla: 3.964 ± 1.615
0.991ArgCys: 0.991 ± 0.509
4.955ArgAsp: 4.955 ± 0.51
1.487ArgGlu: 1.487 ± 0.829
0.991ArgPhe: 0.991 ± 2.043
2.478ArgGly: 2.478 ± 1.273
0.991ArgHis: 0.991 ± 0.509
1.982ArgIle: 1.982 ± 2.627
3.469ArgLys: 3.469 ± 1.557
4.46ArgLeu: 4.46 ± 1.47
0.991ArgMet: 0.991 ± 0.509
1.487ArgAsn: 1.487 ± 1.369
2.973ArgPro: 2.973 ± 1.086
1.487ArgGln: 1.487 ± 1.369
4.46ArgArg: 4.46 ± 1.359
3.469ArgSer: 3.469 ± 0.869
2.973ArgThr: 2.973 ± 1.527
4.955ArgVal: 4.955 ± 2.487
1.487ArgTrp: 1.487 ± 0.764
2.478ArgTyr: 2.478 ± 3.093
0.0ArgXaa: 0.0 ± 0.0
Ser
4.46SerAla: 4.46 ± 5.839
0.991SerCys: 0.991 ± 0.509
3.469SerAsp: 3.469 ± 2.76
5.946SerGlu: 5.946 ± 0.76
3.964SerPhe: 3.964 ± 0.679
2.973SerGly: 2.973 ± 1.527
0.991SerHis: 0.991 ± 0.509
2.973SerIle: 2.973 ± 0.827
4.955SerLys: 4.955 ± 1.58
6.938SerLeu: 6.938 ± 0.239
0.496SerMet: 0.496 ± 0.255
5.451SerAsn: 5.451 ± 3.815
0.496SerPro: 0.496 ± 0.255
3.964SerGln: 3.964 ± 2.565
3.964SerArg: 3.964 ± 1.423
3.469SerSer: 3.469 ± 1.317
2.973SerThr: 2.973 ± 0.827
7.929SerVal: 7.929 ± 5.229
0.0SerTrp: 0.0 ± 0.0
1.487SerTyr: 1.487 ± 1.797
0.0SerXaa: 0.0 ± 0.0
Thr
5.946ThrAla: 5.946 ± 2.511
0.496ThrCys: 0.496 ± 0.255
1.487ThrAsp: 1.487 ± 0.764
1.982ThrGlu: 1.982 ± 1.283
3.469ThrPhe: 3.469 ± 1.782
2.478ThrGly: 2.478 ± 0.746
0.991ThrHis: 0.991 ± 0.509
2.973ThrIle: 2.973 ± 1.527
1.982ThrLys: 1.982 ± 1.947
6.938ThrLeu: 6.938 ± 3.564
0.496ThrMet: 0.496 ± 1.036
0.991ThrAsn: 0.991 ± 0.973
1.982ThrPro: 1.982 ± 1.283
3.469ThrGln: 3.469 ± 2.641
4.955ThrArg: 4.955 ± 0.51
2.973ThrSer: 2.973 ± 1.255
1.982ThrThr: 1.982 ± 1.018
6.442ThrVal: 6.442 ± 1.951
1.487ThrTrp: 1.487 ± 0.764
3.469ThrTyr: 3.469 ± 0.971
0.0ThrXaa: 0.0 ± 0.0
Val
6.938ValAla: 6.938 ± 2.575
0.991ValCys: 0.991 ± 0.509
7.929ValAsp: 7.929 ± 0.494
3.469ValGlu: 3.469 ± 0.971
2.478ValPhe: 2.478 ± 1.79
1.982ValGly: 1.982 ± 1.947
4.46ValHis: 4.46 ± 1.359
4.955ValIle: 4.955 ± 2.38
4.46ValLys: 4.46 ± 2.487
7.929ValLeu: 7.929 ± 1.765
0.496ValMet: 0.496 ± 0.225
5.946ValAsn: 5.946 ± 2.238
5.451ValPro: 5.451 ± 3.441
2.478ValGln: 2.478 ± 1.273
3.964ValArg: 3.964 ± 0.679
5.451ValSer: 5.451 ± 4.92
5.946ValThr: 5.946 ± 0.624
6.442ValVal: 6.442 ± 4.404
0.991ValTrp: 0.991 ± 0.509
5.946ValTyr: 5.946 ± 2.511
0.0ValXaa: 0.0 ± 0.0
Trp
0.991TrpAla: 0.991 ± 3.299
0.0TrpCys: 0.0 ± 0.0
2.478TrpAsp: 2.478 ± 1.273
0.991TrpGlu: 0.991 ± 0.509
0.991TrpPhe: 0.991 ± 0.509
0.496TrpGly: 0.496 ± 0.255
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
1.982TrpLeu: 1.982 ± 0.746
0.0TrpMet: 0.0 ± 0.0
0.496TrpAsn: 0.496 ± 0.255
0.0TrpPro: 0.0 ± 0.0
0.991TrpGln: 0.991 ± 2.313
0.991TrpArg: 0.991 ± 0.509
0.496TrpSer: 0.496 ± 0.255
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.991TrpTyr: 0.991 ± 0.509
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.478TyrAla: 2.478 ± 0.746
1.487TyrCys: 1.487 ± 0.764
3.469TyrAsp: 3.469 ± 1.782
1.487TyrGlu: 1.487 ± 1.797
2.478TyrPhe: 2.478 ± 1.317
0.496TyrGly: 0.496 ± 0.255
0.991TyrHis: 0.991 ± 0.509
0.991TyrIle: 0.991 ± 1.494
4.46TyrLys: 4.46 ± 1.359
1.982TyrLeu: 1.982 ± 1.018
0.0TyrMet: 0.0 ± 0.0
0.991TyrAsn: 0.991 ± 1.494
1.487TyrPro: 1.487 ± 0.764
2.478TyrGln: 2.478 ± 2.855
2.478TyrArg: 2.478 ± 4.957
0.496TyrSer: 0.496 ± 0.255
2.973TyrThr: 2.973 ± 1.527
4.46TyrVal: 4.46 ± 1.359
0.0TyrTrp: 0.0 ± 0.0
1.982TyrTyr: 1.982 ± 1.018
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.496XaaGln: 0.496 ± 0.255
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2019 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski