Amino acid dipepetide frequency for Capybara microvirus Cap1_SP_123

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.545AlaAla: 5.545 ± 4.535
0.0AlaCys: 0.0 ± 0.0
3.697AlaAsp: 3.697 ± 1.146
3.697AlaGlu: 3.697 ± 1.776
4.313AlaPhe: 4.313 ± 1.861
4.313AlaGly: 4.313 ± 2.54
0.616AlaHis: 0.616 ± 0.451
4.929AlaIle: 4.929 ± 3.022
4.313AlaLys: 4.313 ± 1.814
6.161AlaLeu: 6.161 ± 4.013
0.616AlaMet: 0.616 ± 0.451
6.778AlaAsn: 6.778 ± 1.41
0.616AlaPro: 0.616 ± 0.571
3.081AlaGln: 3.081 ± 1.624
1.232AlaArg: 1.232 ± 1.056
4.313AlaSer: 4.313 ± 1.885
3.081AlaThr: 3.081 ± 2.149
4.313AlaVal: 4.313 ± 1.162
0.616AlaTrp: 0.616 ± 0.571
1.848AlaTyr: 1.848 ± 1.141
0.0AlaXaa: 0.0 ± 0.0
Cys
0.616CysAla: 0.616 ± 0.451
0.0CysCys: 0.0 ± 0.0
0.616CysAsp: 0.616 ± 0.451
1.232CysGlu: 1.232 ± 0.959
0.616CysPhe: 0.616 ± 0.571
1.848CysGly: 1.848 ± 1.598
1.232CysHis: 1.232 ± 0.902
1.848CysIle: 1.848 ± 1.31
0.0CysLys: 0.0 ± 0.0
0.616CysLeu: 0.616 ± 0.451
1.232CysMet: 1.232 ± 1.143
0.0CysAsn: 0.0 ± 0.0
1.232CysPro: 1.232 ± 0.587
1.232CysGln: 1.232 ± 1.065
0.0CysArg: 0.0 ± 0.0
0.616CysSer: 0.616 ± 0.571
0.616CysThr: 0.616 ± 0.571
1.232CysVal: 1.232 ± 0.959
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.081AspAla: 3.081 ± 1.074
1.232AspCys: 1.232 ± 0.902
2.465AspAsp: 2.465 ± 1.265
1.848AspGlu: 1.848 ± 0.377
0.0AspPhe: 0.0 ± 0.0
2.465AspGly: 2.465 ± 0.48
1.232AspHis: 1.232 ± 0.902
3.697AspIle: 3.697 ± 0.754
4.313AspLys: 4.313 ± 1.91
4.313AspLeu: 4.313 ± 1.138
1.848AspMet: 1.848 ± 0.707
4.313AspAsn: 4.313 ± 1.205
3.081AspPro: 3.081 ± 2.083
1.848AspGln: 1.848 ± 0.377
1.232AspArg: 1.232 ± 0.902
3.697AspSer: 3.697 ± 1.992
2.465AspThr: 2.465 ± 0.843
4.929AspVal: 4.929 ± 2.531
0.616AspTrp: 0.616 ± 0.6
7.394AspTyr: 7.394 ± 2.104
0.0AspXaa: 0.0 ± 0.0
Glu
1.848GluAla: 1.848 ± 1.147
0.616GluCys: 0.616 ± 0.571
1.848GluAsp: 1.848 ± 1.102
2.465GluGlu: 2.465 ± 2.216
3.081GluPhe: 3.081 ± 0.783
1.232GluGly: 1.232 ± 1.108
1.232GluHis: 1.232 ± 0.902
2.465GluIle: 2.465 ± 1.547
2.465GluLys: 2.465 ± 1.629
6.161GluLeu: 6.161 ± 1.41
0.616GluMet: 0.616 ± 0.837
5.545GluAsn: 5.545 ± 2.182
1.232GluPro: 1.232 ± 0.476
3.081GluGln: 3.081 ± 1.591
3.081GluArg: 3.081 ± 1.74
7.394GluSer: 7.394 ± 2.876
2.465GluThr: 2.465 ± 0.957
1.232GluVal: 1.232 ± 0.476
0.616GluTrp: 0.616 ± 0.571
3.081GluTyr: 3.081 ± 1.506
0.0GluXaa: 0.0 ± 0.0
Phe
1.848PheAla: 1.848 ± 1.353
0.0PheCys: 0.0 ± 0.0
3.081PheAsp: 3.081 ± 1.625
3.697PheGlu: 3.697 ± 2.984
2.465PhePhe: 2.465 ± 1.265
3.081PheGly: 3.081 ± 0.783
0.0PheHis: 0.0 ± 0.0
3.697PheIle: 3.697 ± 1.521
3.081PheLys: 3.081 ± 0.783
3.697PheLeu: 3.697 ± 1.416
0.616PheMet: 0.616 ± 0.451
1.232PheAsn: 1.232 ± 0.587
0.0PhePro: 0.0 ± 0.0
0.616PheGln: 0.616 ± 0.6
3.081PheArg: 3.081 ± 1.624
3.697PheSer: 3.697 ± 2.707
4.313PheThr: 4.313 ± 3.158
2.465PheVal: 2.465 ± 1.446
0.0PheTrp: 0.0 ± 0.0
1.232PheTyr: 1.232 ± 0.587
0.0PheXaa: 0.0 ± 0.0
Gly
1.848GlyAla: 1.848 ± 0.985
0.0GlyCys: 0.0 ± 0.0
4.313GlyAsp: 4.313 ± 1.28
2.465GlyGlu: 2.465 ± 1.56
3.081GlyPhe: 3.081 ± 0.901
2.465GlyGly: 2.465 ± 1.56
2.465GlyHis: 2.465 ± 0.806
1.848GlyIle: 1.848 ± 0.985
3.697GlyLys: 3.697 ± 1.964
2.465GlyLeu: 2.465 ± 1.701
0.0GlyMet: 0.0 ± 0.0
2.465GlyAsn: 2.465 ± 1.941
1.232GlyPro: 1.232 ± 0.587
0.616GlyGln: 0.616 ± 0.6
3.697GlyArg: 3.697 ± 0.961
5.545GlySer: 5.545 ± 2.946
2.465GlyThr: 2.465 ± 0.884
1.232GlyVal: 1.232 ± 0.902
0.0GlyTrp: 0.0 ± 0.0
1.848GlyTyr: 1.848 ± 0.377
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.232HisCys: 1.232 ± 1.143
1.848HisAsp: 1.848 ± 1.067
1.848HisGlu: 1.848 ± 0.877
1.848HisPhe: 1.848 ± 1.353
1.232HisGly: 1.232 ± 1.143
0.0HisHis: 0.0 ± 0.0
4.929HisIle: 4.929 ± 2.348
0.0HisLys: 0.0 ± 0.0
0.616HisLeu: 0.616 ± 0.973
0.0HisMet: 0.0 ± 0.0
1.848HisAsn: 1.848 ± 1.067
1.232HisPro: 1.232 ± 0.476
0.0HisGln: 0.0 ± 0.0
3.697HisArg: 3.697 ± 1.416
1.848HisSer: 1.848 ± 1.681
0.616HisThr: 0.616 ± 0.451
0.0HisVal: 0.0 ± 0.0
0.616HisTrp: 0.616 ± 0.571
2.465HisTyr: 2.465 ± 1.256
0.0HisXaa: 0.0 ± 0.0
Ile
6.778IleAla: 6.778 ± 3.671
0.0IleCys: 0.0 ± 0.0
8.01IleAsp: 8.01 ± 1.931
2.465IleGlu: 2.465 ± 1.087
3.081IlePhe: 3.081 ± 0.851
3.081IleGly: 3.081 ± 0.641
1.232IleHis: 1.232 ± 1.065
4.929IleIle: 4.929 ± 0.618
1.848IleLys: 1.848 ± 1.709
4.929IleLeu: 4.929 ± 1.864
3.081IleMet: 3.081 ± 0.641
4.313IleAsn: 4.313 ± 2.427
2.465IlePro: 2.465 ± 1.5
3.697IleGln: 3.697 ± 0.754
4.929IleArg: 4.929 ± 2.348
4.313IleSer: 4.313 ± 0.817
4.313IleThr: 4.313 ± 2.323
1.848IleVal: 1.848 ± 1.61
0.0IleTrp: 0.0 ± 0.0
3.697IleTyr: 3.697 ± 1.637
0.0IleXaa: 0.0 ± 0.0
Lys
3.081LysAla: 3.081 ± 2.783
0.616LysCys: 0.616 ± 0.988
1.232LysAsp: 1.232 ± 0.587
3.081LysGlu: 3.081 ± 2.489
3.081LysPhe: 3.081 ± 1.506
3.697LysGly: 3.697 ± 2.295
1.848LysHis: 1.848 ± 1.067
3.697LysIle: 3.697 ± 2.508
5.545LysLys: 5.545 ± 2.263
6.161LysLeu: 6.161 ± 1.671
1.232LysMet: 1.232 ± 1.085
2.465LysAsn: 2.465 ± 1.56
3.081LysPro: 3.081 ± 1.74
4.313LysGln: 4.313 ± 1.116
3.081LysArg: 3.081 ± 1.635
3.081LysSer: 3.081 ± 0.783
3.081LysThr: 3.081 ± 1.423
4.313LysVal: 4.313 ± 0.85
0.0LysTrp: 0.0 ± 0.0
3.697LysTyr: 3.697 ± 1.471
0.0LysXaa: 0.0 ± 0.0
Leu
5.545LeuAla: 5.545 ± 1.238
0.616LeuCys: 0.616 ± 0.973
3.697LeuAsp: 3.697 ± 1.964
3.697LeuGlu: 3.697 ± 2.142
2.465LeuPhe: 2.465 ± 1.51
1.848LeuGly: 1.848 ± 0.707
3.081LeuHis: 3.081 ± 2.165
7.394LeuIle: 7.394 ± 3.624
3.697LeuLys: 3.697 ± 0.815
6.161LeuLeu: 6.161 ± 1.203
3.081LeuMet: 3.081 ± 0.824
6.161LeuAsn: 6.161 ± 1.599
2.465LeuPro: 2.465 ± 1.804
3.697LeuGln: 3.697 ± 1.141
2.465LeuArg: 2.465 ± 1.087
6.778LeuSer: 6.778 ± 1.032
4.929LeuThr: 4.929 ± 1.039
1.848LeuVal: 1.848 ± 1.164
0.0LeuTrp: 0.0 ± 0.0
4.929LeuTyr: 4.929 ± 2.562
0.0LeuXaa: 0.0 ± 0.0
Met
1.848MetAla: 1.848 ± 1.61
1.232MetCys: 1.232 ± 0.587
1.232MetAsp: 1.232 ± 1.056
0.616MetGlu: 0.616 ± 0.571
1.848MetPhe: 1.848 ± 0.877
0.616MetGly: 0.616 ± 0.451
0.0MetHis: 0.0 ± 0.0
0.616MetIle: 0.616 ± 0.973
2.465MetLys: 2.465 ± 1.871
1.848MetLeu: 1.848 ± 1.102
0.616MetMet: 0.616 ± 0.571
1.848MetAsn: 1.848 ± 0.707
1.848MetPro: 1.848 ± 1.714
0.616MetGln: 0.616 ± 0.451
1.232MetArg: 1.232 ± 0.587
3.081MetSer: 3.081 ± 1.118
1.848MetThr: 1.848 ± 0.877
0.616MetVal: 0.616 ± 0.571
0.0MetTrp: 0.0 ± 0.0
0.616MetTyr: 0.616 ± 0.571
0.0MetXaa: 0.0 ± 0.0
Asn
4.313AsnAla: 4.313 ± 1.566
3.697AsnCys: 3.697 ± 2.282
2.465AsnAsp: 2.465 ± 1.62
6.161AsnGlu: 6.161 ± 2.12
0.616AsnPhe: 0.616 ± 0.571
2.465AsnGly: 2.465 ± 1.174
1.848AsnHis: 1.848 ± 1.714
3.697AsnIle: 3.697 ± 1.275
6.161AsnLys: 6.161 ± 0.877
4.313AsnLeu: 4.313 ± 1.749
2.465AsnMet: 2.465 ± 0.813
6.778AsnAsn: 6.778 ± 0.984
6.778AsnPro: 6.778 ± 0.892
3.697AsnGln: 3.697 ± 1.414
4.313AsnArg: 4.313 ± 1.566
6.161AsnSer: 6.161 ± 2.801
6.778AsnThr: 6.778 ± 1.56
1.232AsnVal: 1.232 ± 0.476
0.0AsnTrp: 0.0 ± 0.0
6.778AsnTyr: 6.778 ± 2.803
0.0AsnXaa: 0.0 ± 0.0
Pro
1.848ProAla: 1.848 ± 0.707
0.616ProCys: 0.616 ± 0.571
3.081ProAsp: 3.081 ± 1.493
1.848ProGlu: 1.848 ± 1.147
1.848ProPhe: 1.848 ± 1.353
1.848ProGly: 1.848 ± 1.095
1.848ProHis: 1.848 ± 1.067
1.232ProIle: 1.232 ± 0.587
0.0ProLys: 0.0 ± 0.0
5.545ProLeu: 5.545 ± 1.166
1.232ProMet: 1.232 ± 0.902
4.313ProAsn: 4.313 ± 1.767
0.0ProPro: 0.0 ± 0.0
0.0ProGln: 0.0 ± 0.0
1.232ProArg: 1.232 ± 0.587
6.161ProSer: 6.161 ± 1.046
0.616ProThr: 0.616 ± 1.05
1.232ProVal: 1.232 ± 0.902
0.616ProTrp: 0.616 ± 0.6
4.313ProTyr: 4.313 ± 1.91
0.0ProXaa: 0.0 ± 0.0
Gln
2.465GlnAla: 2.465 ± 1.56
0.0GlnCys: 0.0 ± 0.0
1.848GlnAsp: 1.848 ± 0.985
1.232GlnGlu: 1.232 ± 1.201
3.081GlnPhe: 3.081 ± 1.039
1.232GlnGly: 1.232 ± 0.476
0.616GlnHis: 0.616 ± 0.973
3.697GlnIle: 3.697 ± 1.146
3.081GlnLys: 3.081 ± 1.378
3.081GlnLeu: 3.081 ± 1.426
0.616GlnMet: 0.616 ± 0.571
4.313GlnAsn: 4.313 ± 0.799
0.616GlnPro: 0.616 ± 0.451
1.232GlnGln: 1.232 ± 0.959
4.929GlnArg: 4.929 ± 1.025
3.697GlnSer: 3.697 ± 0.754
2.465GlnThr: 2.465 ± 1.464
1.848GlnVal: 1.848 ± 0.707
0.0GlnTrp: 0.0 ± 0.0
3.081GlnTyr: 3.081 ± 1.213
0.0GlnXaa: 0.0 ± 0.0
Arg
3.697ArgAla: 3.697 ± 2.624
0.616ArgCys: 0.616 ± 0.571
1.232ArgAsp: 1.232 ± 0.902
2.465ArgGlu: 2.465 ± 1.256
2.465ArgPhe: 2.465 ± 1.446
1.848ArgGly: 1.848 ± 0.877
1.848ArgHis: 1.848 ± 1.164
2.465ArgIle: 2.465 ± 2.083
4.929ArgLys: 4.929 ± 2.44
1.848ArgLeu: 1.848 ± 1.439
1.848ArgMet: 1.848 ± 1.067
6.778ArgAsn: 6.778 ± 1.493
2.465ArgPro: 2.465 ± 1.26
3.697ArgGln: 3.697 ± 1.97
2.465ArgArg: 2.465 ± 0.843
1.232ArgSer: 1.232 ± 1.056
3.697ArgThr: 3.697 ± 1.275
1.232ArgVal: 1.232 ± 0.902
0.0ArgTrp: 0.0 ± 0.0
3.081ArgTyr: 3.081 ± 1.118
0.0ArgXaa: 0.0 ± 0.0
Ser
8.01SerAla: 8.01 ± 2.195
1.848SerCys: 1.848 ± 0.856
6.161SerAsp: 6.161 ± 2.428
4.313SerGlu: 4.313 ± 0.85
1.848SerPhe: 1.848 ± 1.238
4.929SerGly: 4.929 ± 1.167
2.465SerHis: 2.465 ± 1.608
7.394SerIle: 7.394 ± 1.509
3.081SerLys: 3.081 ± 1.118
8.01SerLeu: 8.01 ± 4.318
1.848SerMet: 1.848 ± 2.057
3.697SerAsn: 3.697 ± 1.429
2.465SerPro: 2.465 ± 1.5
3.697SerGln: 3.697 ± 1.17
1.848SerArg: 1.848 ± 0.377
7.394SerSer: 7.394 ± 2.694
6.778SerThr: 6.778 ± 1.778
5.545SerVal: 5.545 ± 1.635
0.616SerTrp: 0.616 ± 0.451
4.929SerTyr: 4.929 ± 1.08
0.0SerXaa: 0.0 ± 0.0
Thr
3.697ThrAla: 3.697 ± 2.096
0.0ThrCys: 0.0 ± 0.0
2.465ThrAsp: 2.465 ± 1.51
2.465ThrGlu: 2.465 ± 1.464
2.465ThrPhe: 2.465 ± 1.804
1.848ThrGly: 1.848 ± 0.707
0.616ThrHis: 0.616 ± 0.571
3.697ThrIle: 3.697 ± 2.017
5.545ThrLys: 5.545 ± 2.044
4.313ThrLeu: 4.313 ± 1.992
0.616ThrMet: 0.616 ± 0.571
8.626ThrAsn: 8.626 ± 3.633
4.313ThrPro: 4.313 ± 0.913
2.465ThrGln: 2.465 ± 0.952
2.465ThrArg: 2.465 ± 1.751
4.929ThrSer: 4.929 ± 2.157
4.313ThrThr: 4.313 ± 3.061
2.465ThrVal: 2.465 ± 1.174
1.232ThrTrp: 1.232 ± 0.902
3.697ThrTyr: 3.697 ± 1.275
0.0ThrXaa: 0.0 ± 0.0
Val
3.081ValAla: 3.081 ± 1.573
0.616ValCys: 0.616 ± 0.571
2.465ValAsp: 2.465 ± 1.376
1.232ValGlu: 1.232 ± 0.902
1.848ValPhe: 1.848 ± 1.164
1.848ValGly: 1.848 ± 1.238
1.232ValHis: 1.232 ± 0.902
1.232ValIle: 1.232 ± 0.587
2.465ValLys: 2.465 ± 1.358
1.848ValLeu: 1.848 ± 0.877
0.616ValMet: 0.616 ± 0.892
2.465ValAsn: 2.465 ± 1.087
3.697ValPro: 3.697 ± 2.707
1.848ValGln: 1.848 ± 0.377
2.465ValArg: 2.465 ± 1.446
6.778ValSer: 6.778 ± 2.121
4.313ValThr: 4.313 ± 1.712
3.081ValVal: 3.081 ± 1.504
0.0ValTrp: 0.0 ± 0.0
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.616TrpAla: 0.616 ± 0.451
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.616TrpGlu: 0.616 ± 0.451
0.616TrpPhe: 0.616 ± 0.6
0.0TrpGly: 0.0 ± 0.0
0.616TrpHis: 0.616 ± 0.571
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.616TrpLeu: 0.616 ± 0.571
0.0TrpMet: 0.0 ± 0.0
0.616TrpAsn: 0.616 ± 0.571
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.616TrpArg: 0.616 ± 0.6
0.616TrpSer: 0.616 ± 0.571
0.616TrpThr: 0.616 ± 0.571
0.616TrpVal: 0.616 ± 0.571
0.0TrpTrp: 0.0 ± 0.0
0.616TrpTyr: 0.616 ± 0.451
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.313TyrAla: 4.313 ± 2.174
1.848TyrCys: 1.848 ± 1.405
4.313TyrAsp: 4.313 ± 1.065
4.313TyrGlu: 4.313 ± 2.382
1.232TyrPhe: 1.232 ± 0.902
1.848TyrGly: 1.848 ± 0.877
1.848TyrHis: 1.848 ± 1.141
5.545TyrIle: 5.545 ± 2.325
4.313TyrLys: 4.313 ± 1.492
1.848TyrLeu: 1.848 ± 1.141
1.848TyrMet: 1.848 ± 1.357
6.161TyrAsn: 6.161 ± 1.929
0.616TyrPro: 0.616 ± 0.451
3.697TyrGln: 3.697 ± 1.146
1.848TyrArg: 1.848 ± 0.949
5.545TyrSer: 5.545 ± 1.687
2.465TyrThr: 2.465 ± 1.256
1.848TyrVal: 1.848 ± 1.353
1.848TyrTrp: 1.848 ± 1.714
3.697TyrTyr: 3.697 ± 2.999
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1624 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski