Amino acid dipepetide frequency for Maize-associated totivirus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.296AlaAla: 10.296 ± 0.498
1.872AlaCys: 1.872 ± 0.285
3.744AlaAsp: 3.744 ± 0.941
5.616AlaGlu: 5.616 ± 0.855
4.368AlaPhe: 4.368 ± 1.337
3.12AlaGly: 3.12 ± 0.811
1.248AlaHis: 1.248 ± 0.022
5.928AlaIle: 5.928 ± 0.686
5.616AlaLys: 5.616 ± 0.656
10.608AlaLeu: 10.608 ± 0.944
4.368AlaMet: 4.368 ± 0.537
1.872AlaAsn: 1.872 ± 0.722
1.248AlaPro: 1.248 ± 0.508
3.12AlaGln: 3.12 ± 0.7
5.616AlaArg: 5.616 ± 0.153
1.872AlaSer: 1.872 ± 0.219
7.176AlaThr: 7.176 ± 1.395
6.24AlaVal: 6.24 ± 0.897
1.872AlaTrp: 1.872 ± 0.285
2.496AlaTyr: 2.496 ± 0.963
0.0AlaXaa: 0.0 ± 0.0
Cys
1.872CysAla: 1.872 ± 0.219
0.0CysCys: 0.0 ± 0.0
1.872CysAsp: 1.872 ± 0.219
2.496CysGlu: 2.496 ± 0.46
0.624CysPhe: 0.624 ± 0.241
0.0CysGly: 0.0 ± 0.0
0.624CysHis: 0.624 ± 0.263
0.0CysIle: 0.0 ± 0.0
0.624CysLys: 0.624 ± 0.263
1.248CysLeu: 1.248 ± 0.48
0.0CysMet: 0.0 ± 0.0
0.936CysAsn: 0.936 ± 0.318
0.936CysPro: 0.936 ± 0.332
1.248CysGln: 1.248 ± 0.526
0.312CysArg: 0.312 ± 0.207
0.624CysSer: 0.624 ± 0.263
0.624CysThr: 0.624 ± 0.263
0.624CysVal: 0.624 ± 0.241
0.0CysTrp: 0.0 ± 0.0
0.624CysTyr: 0.624 ± 0.241
0.0CysXaa: 0.0 ± 0.0
Asp
5.304AspAla: 5.304 ± 1.378
0.624AspCys: 0.624 ± 0.263
4.368AspAsp: 4.368 ± 0.175
3.12AspGlu: 3.12 ± 0.307
3.744AspPhe: 3.744 ± 0.438
3.12AspGly: 3.12 ± 0.197
0.0AspHis: 0.0 ± 0.0
4.368AspIle: 4.368 ± 0.833
2.496AspLys: 2.496 ± 0.963
4.368AspLeu: 4.368 ± 0.175
1.248AspMet: 1.248 ± 0.482
3.12AspAsn: 3.12 ± 0.307
1.872AspPro: 1.872 ± 0.789
3.432AspGln: 3.432 ± 0.833
3.12AspArg: 3.12 ± 0.307
4.368AspSer: 4.368 ± 0.175
3.432AspThr: 3.432 ± 0.282
6.864AspVal: 6.864 ± 0.131
0.624AspTrp: 0.624 ± 0.241
0.624AspTyr: 0.624 ± 0.241
0.0AspXaa: 0.0 ± 0.0
Glu
4.056GluAla: 4.056 ± 0.964
1.872GluCys: 1.872 ± 0.789
4.368GluAsp: 4.368 ± 0.329
2.496GluGlu: 2.496 ± 0.46
3.744GluPhe: 3.744 ± 0.57
3.12GluGly: 3.12 ± 0.307
3.12GluHis: 3.12 ± 0.197
4.368GluIle: 4.368 ± 1.182
1.872GluLys: 1.872 ± 0.285
4.992GluLeu: 4.992 ± 0.415
0.624GluMet: 0.624 ± 0.263
0.624GluAsn: 0.624 ± 0.263
1.872GluPro: 1.872 ± 0.722
2.184GluGln: 2.184 ± 0.505
4.992GluArg: 4.992 ± 0.919
2.496GluSer: 2.496 ± 0.045
2.808GluThr: 2.808 ± 1.334
3.432GluVal: 3.432 ± 0.282
1.872GluTrp: 1.872 ± 0.219
3.744GluTyr: 3.744 ± 0.57
0.0GluXaa: 0.0 ± 0.0
Phe
2.496PheAla: 2.496 ± 0.045
1.872PheCys: 1.872 ± 0.285
4.992PheAsp: 4.992 ± 0.089
4.368PheGlu: 4.368 ± 0.175
1.872PhePhe: 1.872 ± 0.789
0.624PheGly: 0.624 ± 0.241
1.248PheHis: 1.248 ± 0.482
1.872PheIle: 1.872 ± 0.219
4.992PheLys: 4.992 ± 0.592
4.368PheLeu: 4.368 ± 0.175
1.872PheMet: 1.872 ± 0.285
2.496PheAsn: 2.496 ± 0.045
1.248PhePro: 1.248 ± 0.022
1.872PheGln: 1.872 ± 0.285
1.56PheArg: 1.56 ± 0.818
2.808PheSer: 2.808 ± 0.206
0.624PheThr: 0.624 ± 0.263
4.368PheVal: 4.368 ± 0.833
0.624PheTrp: 0.624 ± 0.263
1.248PheTyr: 1.248 ± 0.022
0.0PheXaa: 0.0 ± 0.0
Gly
6.24GlyAla: 6.24 ± 0.614
0.936GlyCys: 0.936 ± 0.318
2.496GlyAsp: 2.496 ± 0.045
3.12GlyGlu: 3.12 ± 0.618
3.12GlyPhe: 3.12 ± 0.307
4.68GlyGly: 4.68 ± 0.236
0.0GlyHis: 0.0 ± 0.0
2.496GlyIle: 2.496 ± 0.46
3.12GlyLys: 3.12 ± 0.307
6.24GlyLeu: 6.24 ± 1.118
2.496GlyMet: 2.496 ± 0.548
0.312GlyAsn: 0.312 ± 0.207
1.56GlyPro: 1.56 ± 0.356
2.184GlyGln: 2.184 ± 0.217
3.744GlyArg: 3.744 ± 0.57
1.56GlySer: 1.56 ± 0.205
3.744GlyThr: 3.744 ± 1.074
5.304GlyVal: 5.304 ± 0.416
0.624GlyTrp: 0.624 ± 0.241
1.248GlyTyr: 1.248 ± 0.482
0.0GlyXaa: 0.0 ± 0.0
His
2.496HisAla: 2.496 ± 0.46
0.0HisCys: 0.0 ± 0.0
2.496HisAsp: 2.496 ± 0.045
0.624HisGlu: 0.624 ± 0.241
0.624HisPhe: 0.624 ± 0.263
1.248HisGly: 1.248 ± 0.482
1.248HisHis: 1.248 ± 0.482
0.624HisIle: 0.624 ± 0.241
1.248HisLys: 1.248 ± 0.022
2.496HisLeu: 2.496 ± 0.46
0.0HisMet: 0.0 ± 0.0
1.872HisAsn: 1.872 ± 0.219
0.0HisPro: 0.0 ± 0.0
0.624HisGln: 0.624 ± 0.241
3.744HisArg: 3.744 ± 0.941
1.872HisSer: 1.872 ± 0.722
2.496HisThr: 2.496 ± 0.045
1.872HisVal: 1.872 ± 0.285
0.0HisTrp: 0.0 ± 0.0
1.248HisTyr: 1.248 ± 0.482
0.0HisXaa: 0.0 ± 0.0
Ile
4.368IleAla: 4.368 ± 0.678
0.624IleCys: 0.624 ± 0.241
3.744IleAsp: 3.744 ± 0.067
2.496IleGlu: 2.496 ± 0.548
3.12IlePhe: 3.12 ± 0.197
3.12IleGly: 3.12 ± 0.307
3.12IleHis: 3.12 ± 0.7
1.248IleIle: 1.248 ± 0.482
2.496IleLys: 2.496 ± 0.46
2.496IleLeu: 2.496 ± 0.46
1.248IleMet: 1.248 ± 0.482
3.744IleAsn: 3.744 ± 0.57
4.056IlePro: 4.056 ± 0.71
0.624IleGln: 0.624 ± 0.241
3.12IleArg: 3.12 ± 0.197
4.368IleSer: 4.368 ± 0.175
4.056IleThr: 4.056 ± 0.482
3.12IleVal: 3.12 ± 0.7
1.872IleTrp: 1.872 ± 0.219
1.872IleTyr: 1.872 ± 0.285
0.0IleXaa: 0.0 ± 0.0
Lys
1.872LysAla: 1.872 ± 0.722
0.624LysCys: 0.624 ± 0.241
0.624LysAsp: 0.624 ± 0.241
4.992LysGlu: 4.992 ± 0.415
2.496LysPhe: 2.496 ± 0.548
1.248LysGly: 1.248 ± 0.022
1.248LysHis: 1.248 ± 0.482
4.992LysIle: 4.992 ± 0.089
0.624LysLys: 0.624 ± 0.263
1.248LysLeu: 1.248 ± 0.526
1.872LysMet: 1.872 ± 0.219
2.496LysAsn: 2.496 ± 0.46
1.248LysPro: 1.248 ± 0.482
3.12LysGln: 3.12 ± 0.307
2.496LysArg: 2.496 ± 0.963
1.872LysSer: 1.872 ± 0.219
3.12LysThr: 3.12 ± 0.7
2.496LysVal: 2.496 ± 0.045
0.624LysTrp: 0.624 ± 0.263
4.368LysTyr: 4.368 ± 0.678
0.0LysXaa: 0.0 ± 0.0
Leu
5.928LeuAla: 5.928 ± 0.247
0.0LeuCys: 0.0 ± 0.0
5.616LeuAsp: 5.616 ± 0.855
3.12LeuGlu: 3.12 ± 0.307
1.872LeuPhe: 1.872 ± 0.285
4.056LeuGly: 4.056 ± 0.209
1.248LeuHis: 1.248 ± 0.022
1.248LeuIle: 1.248 ± 0.022
3.744LeuLys: 3.744 ± 0.067
7.488LeuLeu: 7.488 ± 0.134
1.872LeuMet: 1.872 ± 0.381
3.744LeuAsn: 3.744 ± 0.438
6.24LeuPro: 6.24 ± 1.622
1.872LeuGln: 1.872 ± 0.285
7.488LeuArg: 7.488 ± 0.134
4.68LeuSer: 4.68 ± 0.382
9.36LeuThr: 9.36 ± 0.088
6.24LeuVal: 6.24 ± 1.904
0.624LeuTrp: 0.624 ± 0.241
4.368LeuTyr: 4.368 ± 0.329
0.0LeuXaa: 0.0 ± 0.0
Met
1.248MetAla: 1.248 ± 0.022
0.0MetCys: 0.0 ± 0.0
1.872MetAsp: 1.872 ± 0.285
0.624MetGlu: 0.624 ± 0.263
1.248MetPhe: 1.248 ± 0.482
1.248MetGly: 1.248 ± 0.022
1.248MetHis: 1.248 ± 0.022
1.248MetIle: 1.248 ± 0.022
0.624MetLys: 0.624 ± 0.263
3.744MetLeu: 3.744 ± 0.067
1.248MetMet: 1.248 ± 0.022
1.872MetAsn: 1.872 ± 0.285
1.872MetPro: 1.872 ± 0.285
0.624MetGln: 0.624 ± 0.241
1.248MetArg: 1.248 ± 0.526
2.496MetSer: 2.496 ± 0.045
1.872MetThr: 1.872 ± 0.285
0.624MetVal: 0.624 ± 0.241
0.624MetTrp: 0.624 ± 0.241
1.248MetTyr: 1.248 ± 0.482
0.0MetXaa: 0.0 ± 0.0
Asn
3.12AsnAla: 3.12 ± 0.197
0.0AsnCys: 0.0 ± 0.0
1.872AsnAsp: 1.872 ± 0.219
3.744AsnGlu: 3.744 ± 0.57
2.496AsnPhe: 2.496 ± 0.548
1.872AsnGly: 1.872 ± 0.285
1.248AsnHis: 1.248 ± 0.022
4.368AsnIle: 4.368 ± 0.678
0.0AsnLys: 0.0 ± 0.0
2.496AsnLeu: 2.496 ± 0.548
3.12AsnMet: 3.12 ± 0.307
1.872AsnAsn: 1.872 ± 0.285
0.624AsnPro: 0.624 ± 0.263
0.624AsnGln: 0.624 ± 0.263
2.184AsnArg: 2.184 ± 0.348
0.624AsnSer: 0.624 ± 0.241
1.872AsnThr: 1.872 ± 0.285
1.872AsnVal: 1.872 ± 0.722
0.936AsnTrp: 0.936 ± 0.332
2.496AsnTyr: 2.496 ± 0.045
0.0AsnXaa: 0.0 ± 0.0
Pro
3.744ProAla: 3.744 ± 0.57
0.624ProCys: 0.624 ± 0.241
2.808ProAsp: 2.808 ± 0.84
6.864ProGlu: 6.864 ± 0.374
1.872ProPhe: 1.872 ± 0.789
4.368ProGly: 4.368 ± 0.329
0.624ProHis: 0.624 ± 0.241
3.744ProIle: 3.744 ± 0.941
0.624ProLys: 0.624 ± 0.241
3.12ProLeu: 3.12 ± 0.307
1.248ProMet: 1.248 ± 0.526
0.624ProAsn: 0.624 ± 0.263
4.056ProPro: 4.056 ± 0.277
0.624ProGln: 0.624 ± 0.241
1.872ProArg: 1.872 ± 0.285
0.0ProSer: 0.0 ± 0.0
3.744ProThr: 3.744 ± 0.57
3.744ProVal: 3.744 ± 0.938
0.0ProTrp: 0.0 ± 0.0
1.872ProTyr: 1.872 ± 0.219
0.0ProXaa: 0.0 ± 0.0
Gln
6.552GlnAla: 6.552 ± 0.256
1.56GlnCys: 1.56 ± 0.205
2.496GlnAsp: 2.496 ± 0.548
2.184GlnGlu: 2.184 ± 0.589
1.248GlnPhe: 1.248 ± 0.526
1.248GlnGly: 1.248 ± 0.022
2.496GlnHis: 2.496 ± 0.963
1.248GlnIle: 1.248 ± 0.482
1.248GlnLys: 1.248 ± 0.482
0.0GlnLeu: 0.0 ± 0.0
0.0GlnMet: 0.0 ± 0.0
1.248GlnAsn: 1.248 ± 0.526
1.872GlnPro: 1.872 ± 0.219
1.248GlnGln: 1.248 ± 0.022
1.872GlnArg: 1.872 ± 0.219
2.496GlnSer: 2.496 ± 0.548
1.872GlnThr: 1.872 ± 0.789
0.624GlnVal: 0.624 ± 0.241
0.0GlnTrp: 0.0 ± 0.0
3.12GlnTyr: 3.12 ± 0.197
0.0GlnXaa: 0.0 ± 0.0
Arg
7.176ArgAla: 7.176 ± 1.655
2.808ArgCys: 2.808 ± 0.986
4.056ArgAsp: 4.056 ± 1.09
1.872ArgGlu: 1.872 ± 0.722
4.368ArgPhe: 4.368 ± 0.833
7.176ArgGly: 7.176 ± 2.394
1.872ArgHis: 1.872 ± 0.219
3.12ArgIle: 3.12 ± 0.307
4.368ArgLys: 4.368 ± 1.686
3.432ArgLeu: 3.432 ± 0.611
0.624ArgMet: 0.624 ± 0.241
1.872ArgAsn: 1.872 ± 0.219
3.12ArgPro: 3.12 ± 0.7
2.496ArgGln: 2.496 ± 0.46
5.616ArgArg: 5.616 ± 0.855
5.304ArgSer: 5.304 ± 0.461
3.12ArgThr: 3.12 ± 0.197
6.24ArgVal: 6.24 ± 0.614
0.624ArgTrp: 0.624 ± 0.241
2.496ArgTyr: 2.496 ± 0.045
0.0ArgXaa: 0.0 ± 0.0
Ser
6.864SerAla: 6.864 ± 0.771
0.624SerCys: 0.624 ± 0.414
1.872SerAsp: 1.872 ± 0.722
0.0SerGlu: 0.0 ± 0.0
1.248SerPhe: 1.248 ± 0.482
1.872SerGly: 1.872 ± 0.285
1.248SerHis: 1.248 ± 0.482
4.368SerIle: 4.368 ± 0.329
1.248SerLys: 1.248 ± 0.482
3.12SerLeu: 3.12 ± 0.811
2.496SerMet: 2.496 ± 0.46
3.12SerAsn: 3.12 ± 0.811
0.624SerPro: 0.624 ± 0.241
2.496SerGln: 2.496 ± 0.46
3.744SerArg: 3.744 ± 0.813
4.056SerSer: 4.056 ± 1.09
3.744SerThr: 3.744 ± 0.067
3.744SerVal: 3.744 ± 1.445
2.184SerTrp: 2.184 ± 0.813
2.496SerTyr: 2.496 ± 1.052
0.0SerXaa: 0.0 ± 0.0
Thr
3.432ThrAla: 3.432 ± 0.282
1.248ThrCys: 1.248 ± 0.022
1.872ThrAsp: 1.872 ± 0.285
4.368ThrGlu: 4.368 ± 0.678
4.368ThrPhe: 4.368 ± 0.175
4.056ThrGly: 4.056 ± 0.395
1.248ThrHis: 1.248 ± 0.022
1.872ThrIle: 1.872 ± 0.219
3.744ThrLys: 3.744 ± 0.067
6.864ThrLeu: 6.864 ± 0.131
0.312ThrMet: 0.312 ± 0.207
2.496ThrAsn: 2.496 ± 0.548
4.992ThrPro: 4.992 ± 0.592
3.12ThrGln: 3.12 ± 0.307
6.864ThrArg: 6.864 ± 0.374
3.432ThrSer: 3.432 ± 0.226
6.864ThrThr: 6.864 ± 0.131
5.616ThrVal: 5.616 ± 0.855
1.872ThrTrp: 1.872 ± 0.285
2.496ThrTyr: 2.496 ± 1.052
0.0ThrXaa: 0.0 ± 0.0
Val
7.176ValAla: 7.176 ± 0.432
0.0ValCys: 0.0 ± 0.0
6.24ValAsp: 6.24 ± 0.393
2.808ValGlu: 2.808 ± 0.731
2.496ValPhe: 2.496 ± 0.963
5.928ValGly: 5.928 ± 1.178
1.248ValHis: 1.248 ± 0.482
4.056ValIle: 4.056 ± 0.277
3.12ValLys: 3.12 ± 1.204
6.864ValLeu: 6.864 ± 0.131
1.872ValMet: 1.872 ± 0.789
2.496ValAsn: 2.496 ± 0.045
4.368ValPro: 4.368 ± 1.337
0.624ValGln: 0.624 ± 0.241
4.992ValArg: 4.992 ± 0.089
4.992ValSer: 4.992 ± 0.089
8.112ValThr: 8.112 ± 0.899
4.368ValVal: 4.368 ± 0.175
0.624ValTrp: 0.624 ± 0.241
0.624ValTyr: 0.624 ± 0.241
0.0ValXaa: 0.0 ± 0.0
Trp
1.248TrpAla: 1.248 ± 0.482
0.0TrpCys: 0.0 ± 0.0
0.312TrpAsp: 0.312 ± 0.207
1.872TrpGlu: 1.872 ± 0.285
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
1.248TrpHis: 1.248 ± 0.526
0.0TrpIle: 0.0 ± 0.0
0.624TrpLys: 0.624 ± 0.263
1.872TrpLeu: 1.872 ± 0.219
0.0TrpMet: 0.0 ± 0.0
0.624TrpAsn: 0.624 ± 0.263
1.872TrpPro: 1.872 ± 0.789
0.0TrpGln: 0.0 ± 0.0
3.432TrpArg: 3.432 ± 0.282
0.624TrpSer: 0.624 ± 0.241
1.872TrpThr: 1.872 ± 0.285
1.248TrpVal: 1.248 ± 0.482
0.312TrpTrp: 0.312 ± 0.207
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.12TyrAla: 3.12 ± 0.197
0.0TyrCys: 0.0 ± 0.0
2.496TyrAsp: 2.496 ± 0.045
1.872TyrGlu: 1.872 ± 0.722
2.496TyrPhe: 2.496 ± 0.46
3.12TyrGly: 3.12 ± 0.307
1.248TyrHis: 1.248 ± 0.482
3.744TyrIle: 3.744 ± 0.438
1.248TyrLys: 1.248 ± 0.022
3.744TyrLeu: 3.744 ± 0.438
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
2.496TyrPro: 2.496 ± 0.045
2.496TyrGln: 2.496 ± 1.052
3.744TyrArg: 3.744 ± 0.941
0.624TyrSer: 0.624 ± 0.263
0.624TyrThr: 0.624 ± 0.263
4.368TyrVal: 4.368 ± 0.329
1.248TyrTrp: 1.248 ± 0.526
0.624TyrTyr: 0.624 ± 0.241
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3206 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski