Amino acid dipepetide frequency for Maize-associated pteridovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.915AlaAla: 5.915 ± 3.693
2.218AlaCys: 2.218 ± 0.372
3.327AlaAsp: 3.327 ± 0.157
4.806AlaGlu: 4.806 ± 0.821
2.218AlaPhe: 2.218 ± 1.489
2.957AlaGly: 2.957 ± 2.082
1.848AlaHis: 1.848 ± 0.52
4.067AlaIle: 4.067 ± 1.432
2.957AlaLys: 2.957 ± 1.466
8.503AlaLeu: 8.503 ± 2.195
2.218AlaMet: 2.218 ± 0.453
1.109AlaAsn: 1.109 ± 1.093
2.588AlaPro: 2.588 ± 0.813
2.218AlaGln: 2.218 ± 0.669
4.067AlaArg: 4.067 ± 2.083
7.394AlaSer: 7.394 ± 3.071
2.588AlaThr: 2.588 ± 0.42
5.915AlaVal: 5.915 ± 1.23
0.739AlaTrp: 0.739 ± 0.503
1.479AlaTyr: 1.479 ± 0.561
0.0AlaXaa: 0.0 ± 0.0
Cys
0.739CysAla: 0.739 ± 0.473
0.739CysCys: 0.739 ± 0.355
1.479CysAsp: 1.479 ± 0.71
0.739CysGlu: 0.739 ± 0.481
1.479CysPhe: 1.479 ± 0.71
2.218CysGly: 2.218 ± 0.861
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
1.848CysLeu: 1.848 ± 1.1
1.848CysMet: 1.848 ± 0.546
0.739CysAsn: 0.739 ± 0.355
1.109CysPro: 1.109 ± 0.532
0.739CysGln: 0.739 ± 0.355
0.739CysArg: 0.739 ± 0.355
0.739CysSer: 0.739 ± 0.481
1.848CysThr: 1.848 ± 0.579
2.588CysVal: 2.588 ± 0.871
0.0CysTrp: 0.0 ± 0.0
1.479CysTyr: 1.479 ± 0.71
0.0CysXaa: 0.0 ± 0.0
Asp
4.067AspAla: 4.067 ± 1.965
1.848AspCys: 1.848 ± 0.887
4.806AspAsp: 4.806 ± 1.062
2.957AspGlu: 2.957 ± 0.937
3.327AspPhe: 3.327 ± 1.124
4.436AspGly: 4.436 ± 0.842
2.218AspHis: 2.218 ± 0.533
3.327AspIle: 3.327 ± 0.589
4.067AspLys: 4.067 ± 0.731
6.654AspLeu: 6.654 ± 0.597
1.479AspMet: 1.479 ± 0.71
1.848AspAsn: 1.848 ± 0.89
1.848AspPro: 1.848 ± 0.52
1.479AspGln: 1.479 ± 0.476
2.218AspArg: 2.218 ± 0.669
5.545AspSer: 5.545 ± 0.964
1.848AspThr: 1.848 ± 0.52
6.654AspVal: 6.654 ± 0.54
0.739AspTrp: 0.739 ± 0.481
1.109AspTyr: 1.109 ± 0.431
0.0AspXaa: 0.0 ± 0.0
Glu
1.479GluAla: 1.479 ± 0.71
1.109GluCys: 1.109 ± 0.431
3.327GluAsp: 3.327 ± 1.277
2.218GluGlu: 2.218 ± 0.669
3.697GluPhe: 3.697 ± 1.372
2.588GluGly: 2.588 ± 1.889
2.218GluHis: 2.218 ± 0.932
3.327GluIle: 3.327 ± 0.712
2.957GluLys: 2.957 ± 0.964
3.327GluLeu: 3.327 ± 0.762
1.479GluMet: 1.479 ± 0.441
2.957GluAsn: 2.957 ± 0.49
3.327GluPro: 3.327 ± 0.894
1.479GluGln: 1.479 ± 0.476
2.588GluArg: 2.588 ± 1.242
4.436GluSer: 4.436 ± 1.563
3.327GluThr: 3.327 ± 0.589
4.436GluVal: 4.436 ± 0.84
1.109GluTrp: 1.109 ± 1.093
4.806GluTyr: 4.806 ± 0.48
0.0GluXaa: 0.0 ± 0.0
Phe
2.218PheAla: 2.218 ± 1.284
0.739PheCys: 0.739 ± 0.355
1.848PheAsp: 1.848 ± 0.579
3.697PheGlu: 3.697 ± 1.091
0.739PhePhe: 0.739 ± 0.481
2.218PheGly: 2.218 ± 0.372
2.218PheHis: 2.218 ± 1.065
2.588PheIle: 2.588 ± 0.848
2.218PheLys: 2.218 ± 0.669
2.588PheLeu: 2.588 ± 0.42
1.109PheMet: 1.109 ± 0.532
2.957PheAsn: 2.957 ± 0.815
3.697PhePro: 3.697 ± 1.171
0.739PheGln: 0.739 ± 0.355
2.218PheArg: 2.218 ± 0.669
5.915PheSer: 5.915 ± 1.13
4.067PheThr: 4.067 ± 1.465
2.957PheVal: 2.957 ± 0.926
0.37PheTrp: 0.37 ± 0.177
0.739PheTyr: 0.739 ± 0.473
0.0PheXaa: 0.0 ± 0.0
Gly
4.067GlyAla: 4.067 ± 1.385
1.479GlyCys: 1.479 ± 0.441
3.697GlyAsp: 3.697 ± 1.864
3.697GlyGlu: 3.697 ± 0.133
2.588GlyPhe: 2.588 ± 0.544
3.697GlyGly: 3.697 ± 1.781
0.739GlyHis: 0.739 ± 0.355
1.848GlyIle: 1.848 ± 1.049
5.176GlyLys: 5.176 ± 1.511
3.327GlyLeu: 3.327 ± 0.674
2.218GlyMet: 2.218 ± 0.64
2.218GlyAsn: 2.218 ± 1.489
1.479GlyPro: 1.479 ± 0.561
1.848GlyGln: 1.848 ± 1.438
3.697GlyArg: 3.697 ± 1.171
4.436GlySer: 4.436 ± 2.43
4.067GlyThr: 4.067 ± 0.272
5.176GlyVal: 5.176 ± 1.125
0.739GlyTrp: 0.739 ± 0.481
1.109GlyTyr: 1.109 ± 1.339
0.0GlyXaa: 0.0 ± 0.0
His
1.109HisAla: 1.109 ± 0.456
0.37HisCys: 0.37 ± 0.177
1.479HisAsp: 1.479 ± 0.458
1.479HisGlu: 1.479 ± 0.458
0.37HisPhe: 0.37 ± 0.177
1.848HisGly: 1.848 ± 0.887
0.37HisHis: 0.37 ± 0.57
0.37HisIle: 0.37 ± 0.177
1.848HisLys: 1.848 ± 0.579
3.697HisLeu: 3.697 ± 1.041
1.109HisMet: 1.109 ± 0.532
1.109HisAsn: 1.109 ± 0.532
0.37HisPro: 0.37 ± 0.177
0.739HisGln: 0.739 ± 0.355
0.739HisArg: 0.739 ± 0.355
2.218HisSer: 2.218 ± 0.851
1.109HisThr: 1.109 ± 0.456
2.218HisVal: 2.218 ± 0.42
0.37HisTrp: 0.37 ± 0.57
1.109HisTyr: 1.109 ± 0.532
0.0HisXaa: 0.0 ± 0.0
Ile
4.436IleAla: 4.436 ± 0.439
0.739IleCys: 0.739 ± 0.355
4.806IleAsp: 4.806 ± 1.568
3.327IleGlu: 3.327 ± 1.098
2.588IlePhe: 2.588 ± 0.381
2.588IleGly: 2.588 ± 0.477
0.37IleHis: 0.37 ± 0.177
1.848IleIle: 1.848 ± 0.489
2.218IleLys: 2.218 ± 0.42
2.957IleLeu: 2.957 ± 0.964
1.109IleMet: 1.109 ± 0.532
1.109IleAsn: 1.109 ± 0.532
4.067IlePro: 4.067 ± 1.572
1.848IleGln: 1.848 ± 0.555
3.697IleArg: 3.697 ± 1.296
5.176IleSer: 5.176 ± 2.189
2.957IleThr: 2.957 ± 0.728
2.588IleVal: 2.588 ± 0.813
0.37IleTrp: 0.37 ± 0.57
1.479IleTyr: 1.479 ± 0.71
0.0IleXaa: 0.0 ± 0.0
Lys
5.545LysAla: 5.545 ± 2.253
1.109LysCys: 1.109 ± 0.532
4.067LysAsp: 4.067 ± 1.952
2.957LysGlu: 2.957 ± 0.815
1.479LysPhe: 1.479 ± 0.71
5.176LysGly: 5.176 ± 3.439
1.848LysHis: 1.848 ± 0.887
2.218LysIle: 2.218 ± 0.912
3.697LysLys: 3.697 ± 1.209
3.697LysLeu: 3.697 ± 1.365
0.739LysMet: 0.739 ± 0.355
1.109LysAsn: 1.109 ± 0.745
3.327LysPro: 3.327 ± 0.589
1.848LysGln: 1.848 ± 0.52
2.218LysArg: 2.218 ± 1.284
6.654LysSer: 6.654 ± 0.807
3.327LysThr: 3.327 ± 1.018
4.806LysVal: 4.806 ± 1.179
1.479LysTrp: 1.479 ± 0.476
1.479LysTyr: 1.479 ± 0.71
0.0LysXaa: 0.0 ± 0.0
Leu
4.806LeuAla: 4.806 ± 1.082
1.848LeuCys: 1.848 ± 0.887
7.024LeuAsp: 7.024 ± 1.538
5.915LeuGlu: 5.915 ± 1.13
2.588LeuPhe: 2.588 ± 0.871
5.176LeuGly: 5.176 ± 0.836
1.109LeuHis: 1.109 ± 0.532
3.697LeuIle: 3.697 ± 1.468
6.654LeuLys: 6.654 ± 0.85
6.654LeuLeu: 6.654 ± 2.248
4.067LeuMet: 4.067 ± 1.765
4.067LeuAsn: 4.067 ± 0.895
2.957LeuPro: 2.957 ± 0.767
2.588LeuGln: 2.588 ± 0.782
5.915LeuArg: 5.915 ± 1.937
7.024LeuSer: 7.024 ± 1.428
5.915LeuThr: 5.915 ± 0.741
5.176LeuVal: 5.176 ± 0.462
0.739LeuTrp: 0.739 ± 0.481
2.957LeuTyr: 2.957 ± 0.964
0.0LeuXaa: 0.0 ± 0.0
Met
2.588MetAla: 2.588 ± 0.856
0.37MetCys: 0.37 ± 0.177
1.848MetAsp: 1.848 ± 0.887
1.109MetGlu: 1.109 ± 0.426
2.588MetPhe: 2.588 ± 0.811
1.479MetGly: 1.479 ± 0.561
0.37MetHis: 0.37 ± 0.177
0.739MetIle: 0.739 ± 0.355
2.218MetLys: 2.218 ± 0.64
3.327MetLeu: 3.327 ± 1.13
1.109MetMet: 1.109 ± 0.532
1.479MetAsn: 1.479 ± 0.71
0.37MetPro: 0.37 ± 0.177
0.37MetGln: 0.37 ± 0.177
0.739MetArg: 0.739 ± 0.473
1.848MetSer: 1.848 ± 1.794
2.957MetThr: 2.957 ± 0.49
3.327MetVal: 3.327 ± 0.157
0.739MetTrp: 0.739 ± 0.355
0.739MetTyr: 0.739 ± 0.355
0.0MetXaa: 0.0 ± 0.0
Asn
1.848AsnAla: 1.848 ± 0.489
0.37AsnCys: 0.37 ± 0.177
1.479AsnAsp: 1.479 ± 0.604
1.109AsnGlu: 1.109 ± 0.532
2.588AsnPhe: 2.588 ± 0.782
3.697AsnGly: 3.697 ± 0.606
1.109AsnHis: 1.109 ± 0.532
1.848AsnIle: 1.848 ± 0.555
2.218AsnLys: 2.218 ± 0.672
2.957AsnLeu: 2.957 ± 0.49
1.848AsnMet: 1.848 ± 0.476
1.848AsnAsn: 1.848 ± 0.887
2.588AsnPro: 2.588 ± 0.42
0.739AsnGln: 0.739 ± 1.201
1.479AsnArg: 1.479 ± 0.441
2.588AsnSer: 2.588 ± 1.366
2.588AsnThr: 2.588 ± 0.896
3.697AsnVal: 3.697 ± 1.296
0.37AsnTrp: 0.37 ± 0.177
0.37AsnTyr: 0.37 ± 0.57
0.0AsnXaa: 0.0 ± 0.0
Pro
3.697ProAla: 3.697 ± 1.931
0.37ProCys: 0.37 ± 0.6
2.218ProAsp: 2.218 ± 1.065
2.218ProGlu: 2.218 ± 0.932
1.109ProPhe: 1.109 ± 0.426
3.327ProGly: 3.327 ± 0.712
1.109ProHis: 1.109 ± 0.426
4.067ProIle: 4.067 ± 0.727
0.739ProLys: 0.739 ± 0.481
4.806ProLeu: 4.806 ± 1.988
1.109ProMet: 1.109 ± 0.792
2.218ProAsn: 2.218 ± 0.533
1.109ProPro: 1.109 ± 1.033
1.848ProGln: 1.848 ± 0.887
2.957ProArg: 2.957 ± 0.728
2.588ProSer: 2.588 ± 1.242
3.327ProThr: 3.327 ± 1.582
2.588ProVal: 2.588 ± 0.915
0.37ProTrp: 0.37 ± 0.177
1.479ProTyr: 1.479 ± 0.671
0.0ProXaa: 0.0 ± 0.0
Gln
2.218GlnAla: 2.218 ± 0.672
0.37GlnCys: 0.37 ± 0.177
1.479GlnAsp: 1.479 ± 0.71
2.218GlnGlu: 2.218 ± 0.651
0.37GlnPhe: 0.37 ± 0.177
1.479GlnGly: 1.479 ± 0.441
0.0GlnHis: 0.0 ± 0.0
1.479GlnIle: 1.479 ± 0.71
0.37GlnLys: 0.37 ± 0.6
2.218GlnLeu: 2.218 ± 1.065
1.109GlnMet: 1.109 ± 0.431
1.848GlnAsn: 1.848 ± 0.555
1.109GlnPro: 1.109 ± 0.426
0.739GlnGln: 0.739 ± 0.473
3.327GlnArg: 3.327 ± 1.368
3.697GlnSer: 3.697 ± 1.159
1.479GlnThr: 1.479 ± 0.71
2.957GlnVal: 2.957 ± 1.123
0.37GlnTrp: 0.37 ± 0.177
1.109GlnTyr: 1.109 ± 0.532
0.0GlnXaa: 0.0 ± 0.0
Arg
3.697ArgAla: 3.697 ± 0.745
1.848ArgCys: 1.848 ± 0.887
3.697ArgAsp: 3.697 ± 0.858
3.327ArgGlu: 3.327 ± 1.582
2.218ArgPhe: 2.218 ± 1.065
2.218ArgGly: 2.218 ± 0.669
1.848ArgHis: 1.848 ± 0.887
2.218ArgIle: 2.218 ± 0.932
4.806ArgLys: 4.806 ± 0.821
7.024ArgLeu: 7.024 ± 1.229
1.109ArgMet: 1.109 ± 0.532
2.588ArgAsn: 2.588 ± 0.848
0.739ArgPro: 0.739 ± 0.473
2.218ArgGln: 2.218 ± 0.669
3.327ArgArg: 3.327 ± 0.589
3.327ArgSer: 3.327 ± 0.992
2.957ArgThr: 2.957 ± 1.42
3.327ArgVal: 3.327 ± 1.475
0.37ArgTrp: 0.37 ± 0.177
2.957ArgTyr: 2.957 ± 0.917
0.0ArgXaa: 0.0 ± 0.0
Ser
5.176SerAla: 5.176 ± 1.575
1.109SerCys: 1.109 ± 0.456
4.806SerAsp: 4.806 ± 0.613
4.067SerGlu: 4.067 ± 0.936
5.176SerPhe: 5.176 ± 1.268
2.218SerGly: 2.218 ± 1.047
1.479SerHis: 1.479 ± 1.005
5.915SerIle: 5.915 ± 2.075
7.024SerLys: 7.024 ± 1.235
9.242SerLeu: 9.242 ± 1.377
1.848SerMet: 1.848 ± 0.489
0.37SerAsn: 0.37 ± 0.6
3.697SerPro: 3.697 ± 2.088
2.588SerGln: 2.588 ± 0.955
4.806SerArg: 4.806 ± 0.32
8.872SerSer: 8.872 ± 1.792
2.588SerThr: 2.588 ± 0.848
10.721SerVal: 10.721 ± 0.672
0.739SerTrp: 0.739 ± 0.355
2.957SerTyr: 2.957 ± 0.937
0.0SerXaa: 0.0 ± 0.0
Thr
3.697ThrAla: 3.697 ± 0.979
1.479ThrCys: 1.479 ± 0.458
2.588ThrAsp: 2.588 ± 0.811
2.957ThrGlu: 2.957 ± 2.011
4.067ThrPhe: 4.067 ± 0.731
4.067ThrGly: 4.067 ± 0.731
2.218ThrHis: 2.218 ± 1.065
4.067ThrIle: 4.067 ± 1.153
2.218ThrLys: 2.218 ± 0.42
4.067ThrLeu: 4.067 ± 0.895
0.37ThrMet: 0.37 ± 0.177
2.588ThrAsn: 2.588 ± 0.856
4.067ThrPro: 4.067 ± 0.727
1.109ThrGln: 1.109 ± 0.431
3.697ThrArg: 3.697 ± 0.683
3.697ThrSer: 3.697 ± 0.858
3.327ThrThr: 3.327 ± 0.712
4.067ThrVal: 4.067 ± 0.895
0.37ThrTrp: 0.37 ± 0.177
2.957ThrTyr: 2.957 ± 0.463
0.0ThrXaa: 0.0 ± 0.0
Val
9.242ValAla: 9.242 ± 3.304
1.848ValCys: 1.848 ± 1.064
6.285ValAsp: 6.285 ± 1.044
3.697ValGlu: 3.697 ± 1.372
3.697ValPhe: 3.697 ± 0.745
4.067ValGly: 4.067 ± 0.936
2.588ValHis: 2.588 ± 0.42
3.327ValIle: 3.327 ± 0.589
5.915ValLys: 5.915 ± 0.979
5.545ValLeu: 5.545 ± 0.913
2.957ValMet: 2.957 ± 0.728
3.697ValAsn: 3.697 ± 1.296
4.436ValPro: 4.436 ± 1.563
2.588ValGln: 2.588 ± 0.42
4.806ValArg: 4.806 ± 1.227
5.915ValSer: 5.915 ± 1.179
4.436ValThr: 4.436 ± 1.059
6.654ValVal: 6.654 ± 2.265
0.739ValTrp: 0.739 ± 0.473
1.479ValTyr: 1.479 ± 0.961
0.0ValXaa: 0.0 ± 0.0
Trp
0.739TrpAla: 0.739 ± 0.355
0.0TrpCys: 0.0 ± 0.0
0.739TrpAsp: 0.739 ± 0.481
0.37TrpGlu: 0.37 ± 0.177
1.109TrpPhe: 1.109 ± 0.431
0.739TrpGly: 0.739 ± 0.473
0.0TrpHis: 0.0 ± 0.0
1.109TrpIle: 1.109 ± 0.745
0.739TrpLys: 0.739 ± 0.355
1.479TrpLeu: 1.479 ± 0.476
0.739TrpMet: 0.739 ± 0.503
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.37TrpGln: 0.37 ± 0.177
0.37TrpArg: 0.37 ± 0.177
1.109TrpSer: 1.109 ± 0.426
0.739TrpThr: 0.739 ± 0.503
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.37TrpTyr: 0.37 ± 0.57
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.218TyrAla: 2.218 ± 0.669
1.109TyrCys: 1.109 ± 0.431
1.479TyrAsp: 1.479 ± 0.71
3.327TyrGlu: 3.327 ± 0.992
1.848TyrPhe: 1.848 ± 0.52
1.109TyrGly: 1.109 ± 0.431
0.37TyrHis: 0.37 ± 0.177
2.218TyrIle: 2.218 ± 0.669
0.739TyrLys: 0.739 ± 0.503
2.957TyrLeu: 2.957 ± 0.917
0.37TyrMet: 0.37 ± 0.177
1.479TyrAsn: 1.479 ± 0.441
0.37TyrPro: 0.37 ± 0.177
1.848TyrGln: 1.848 ± 0.489
2.218TyrArg: 2.218 ± 0.64
2.218TyrSer: 2.218 ± 0.64
2.218TyrThr: 2.218 ± 1.592
4.067TyrVal: 4.067 ± 0.566
0.0TyrTrp: 0.0 ± 0.0
1.479TyrTyr: 1.479 ± 0.604
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (2706 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski