Amino acid dipepetide frequency for Biratnagar virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.521AlaAla: 3.521 ± 1.142
1.056AlaCys: 1.056 ± 0.554
2.817AlaAsp: 2.817 ± 1.477
1.408AlaGlu: 1.408 ± 0.738
3.521AlaPhe: 3.521 ± 3.611
1.056AlaGly: 1.056 ± 0.554
2.113AlaHis: 2.113 ± 1.107
3.169AlaIle: 3.169 ± 1.048
3.521AlaLys: 3.521 ± 1.214
5.282AlaLeu: 5.282 ± 2.688
1.056AlaMet: 1.056 ± 1.201
3.521AlaAsn: 3.521 ± 1.214
2.465AlaPro: 2.465 ± 0.977
1.408AlaGln: 1.408 ± 0.738
0.0AlaArg: 0.0 ± 0.0
3.169AlaSer: 3.169 ± 1.661
3.169AlaThr: 3.169 ± 2.152
3.873AlaVal: 3.873 ± 3.745
0.352AlaTrp: 0.352 ± 0.742
2.113AlaTyr: 2.113 ± 0.987
0.0AlaXaa: 0.0 ± 0.0
Cys
0.352CysAla: 0.352 ± 0.185
0.352CysCys: 0.352 ± 0.185
1.761CysAsp: 1.761 ± 0.923
1.056CysGlu: 1.056 ± 0.528
1.408CysPhe: 1.408 ± 0.738
0.352CysGly: 0.352 ± 0.185
1.761CysHis: 1.761 ± 0.923
1.056CysIle: 1.056 ± 1.353
1.408CysLys: 1.408 ± 0.495
2.113CysLeu: 2.113 ± 1.057
0.0CysMet: 0.0 ± 0.0
0.704CysAsn: 0.704 ± 0.369
0.704CysPro: 0.704 ± 1.485
0.704CysGln: 0.704 ± 0.617
1.408CysArg: 1.408 ± 0.738
2.465CysSer: 2.465 ± 0.848
2.113CysThr: 2.113 ± 0.987
2.465CysVal: 2.465 ± 1.007
0.0CysTrp: 0.0 ± 0.0
1.408CysTyr: 1.408 ± 1.104
0.0CysXaa: 0.0 ± 0.0
Asp
1.056AspAla: 1.056 ± 1.353
2.113AspCys: 2.113 ± 1.107
2.465AspAsp: 2.465 ± 1.292
2.113AspGlu: 2.113 ± 1.107
3.169AspPhe: 3.169 ± 0.583
1.761AspGly: 1.761 ± 1.171
2.113AspHis: 2.113 ± 1.107
6.338AspIle: 6.338 ± 2.632
4.577AspLys: 4.577 ± 2.399
5.634AspLeu: 5.634 ± 1.78
2.465AspMet: 2.465 ± 1.425
3.169AspAsn: 3.169 ± 1.585
2.817AspPro: 2.817 ± 1.477
2.465AspGln: 2.465 ± 1.292
3.873AspArg: 3.873 ± 1.136
4.93AspSer: 4.93 ± 0.764
3.873AspThr: 3.873 ± 1.136
7.394AspVal: 7.394 ± 2.598
0.0AspTrp: 0.0 ± 0.0
3.521AspTyr: 3.521 ± 1.531
0.0AspXaa: 0.0 ± 0.0
Glu
2.465GluAla: 2.465 ± 0.744
1.761GluCys: 1.761 ± 1.171
4.577GluAsp: 4.577 ± 2.399
2.817GluGlu: 2.817 ± 1.477
4.225GluPhe: 4.225 ± 2.113
0.704GluGly: 0.704 ± 0.369
1.056GluHis: 1.056 ± 0.528
4.577GluIle: 4.577 ± 2.399
4.577GluLys: 4.577 ± 0.631
7.746GluLeu: 7.746 ± 3.638
1.408GluMet: 1.408 ± 0.738
2.465GluAsn: 2.465 ± 1.292
0.704GluPro: 0.704 ± 0.617
0.352GluGln: 0.352 ± 0.185
2.113GluArg: 2.113 ± 0.987
3.169GluSer: 3.169 ± 1.008
0.352GluThr: 0.352 ± 0.185
2.113GluVal: 2.113 ± 1.006
0.0GluTrp: 0.0 ± 0.0
2.465GluTyr: 2.465 ± 0.977
0.0GluXaa: 0.0 ± 0.0
Phe
3.873PheAla: 3.873 ± 3.466
1.761PheCys: 1.761 ± 0.529
3.521PheAsp: 3.521 ± 1.058
3.873PheGlu: 3.873 ± 1.385
2.465PhePhe: 2.465 ± 0.848
1.761PheGly: 1.761 ± 1.031
1.056PheHis: 1.056 ± 0.528
4.225PheIle: 4.225 ± 2.228
5.986PheLys: 5.986 ± 1.465
3.521PheLeu: 3.521 ± 4.956
0.704PheMet: 0.704 ± 0.617
0.352PheAsn: 0.352 ± 0.185
2.465PhePro: 2.465 ± 0.848
1.761PheGln: 1.761 ± 1.031
2.817PheArg: 2.817 ± 1.477
8.451PheSer: 8.451 ± 0.549
3.169PheThr: 3.169 ± 2.363
6.69PheVal: 6.69 ± 5.61
0.352PheTrp: 0.352 ± 0.185
2.817PheTyr: 2.817 ± 0.991
0.0PheXaa: 0.0 ± 0.0
Gly
2.113GlyAla: 2.113 ± 0.618
0.0GlyCys: 0.0 ± 0.0
3.521GlyAsp: 3.521 ± 0.502
0.352GlyGlu: 0.352 ± 1.448
2.113GlyPhe: 2.113 ± 2.402
0.704GlyGly: 0.704 ± 0.369
1.408GlyHis: 1.408 ± 1.104
2.113GlyIle: 2.113 ± 0.987
3.169GlyLys: 3.169 ± 1.008
1.761GlyLeu: 1.761 ± 0.923
0.352GlyMet: 0.352 ± 0.742
2.465GlyAsn: 2.465 ± 0.977
0.352GlyPro: 0.352 ± 0.185
1.056GlyGln: 1.056 ± 0.528
1.056GlyArg: 1.056 ± 0.554
2.817GlySer: 2.817 ± 1.001
1.408GlyThr: 1.408 ± 1.342
1.761GlyVal: 1.761 ± 1.031
0.0GlyTrp: 0.0 ± 0.0
1.056GlyTyr: 1.056 ± 0.528
0.0GlyXaa: 0.0 ± 0.0
His
1.408HisAla: 1.408 ± 0.495
1.056HisCys: 1.056 ± 0.528
2.113HisAsp: 2.113 ± 0.618
0.704HisGlu: 0.704 ± 0.369
1.056HisPhe: 1.056 ± 0.528
1.056HisGly: 1.056 ± 1.201
0.352HisHis: 0.352 ± 0.185
2.113HisIle: 2.113 ± 1.006
1.056HisLys: 1.056 ± 0.554
1.761HisLeu: 1.761 ± 0.529
0.704HisMet: 0.704 ± 0.369
1.056HisAsn: 1.056 ± 1.201
2.113HisPro: 2.113 ± 0.618
0.352HisGln: 0.352 ± 0.185
1.761HisArg: 1.761 ± 0.923
3.521HisSer: 3.521 ± 1.531
1.761HisThr: 1.761 ± 0.529
3.169HisVal: 3.169 ± 1.661
0.0HisTrp: 0.0 ± 0.0
1.408HisTyr: 1.408 ± 0.738
0.0HisXaa: 0.0 ± 0.0
Ile
4.577IleAla: 4.577 ± 2.399
1.408IleCys: 1.408 ± 0.738
6.338IleAsp: 6.338 ± 3.17
3.169IleGlu: 3.169 ± 1.008
3.169IlePhe: 3.169 ± 0.583
2.817IleGly: 2.817 ± 1.252
1.408IleHis: 1.408 ± 0.738
5.634IleIle: 5.634 ± 0.723
4.577IleLys: 4.577 ± 1.355
4.93IleLeu: 4.93 ± 1.697
2.113IleMet: 2.113 ± 2.586
1.761IleAsn: 1.761 ± 0.923
3.521IlePro: 3.521 ± 0.502
2.113IleGln: 2.113 ± 1.107
3.873IleArg: 3.873 ± 2.03
5.634IleSer: 5.634 ± 2.953
4.577IleThr: 4.577 ± 1.734
4.577IleVal: 4.577 ± 0.631
0.0IleTrp: 0.0 ± 0.0
3.873IleTyr: 3.873 ± 2.184
0.0IleXaa: 0.0 ± 0.0
Lys
1.761LysAla: 1.761 ± 0.923
1.408LysCys: 1.408 ± 0.495
3.521LysAsp: 3.521 ± 0.502
4.225LysGlu: 4.225 ± 1.558
6.338LysPhe: 6.338 ± 0.594
1.761LysGly: 1.761 ± 1.171
0.0LysHis: 0.0 ± 0.0
4.225LysIle: 4.225 ± 1.237
4.577LysLys: 4.577 ± 2.399
7.746LysLeu: 7.746 ± 2.769
1.761LysMet: 1.761 ± 0.622
4.577LysAsn: 4.577 ± 0.631
2.113LysPro: 2.113 ± 1.057
1.056LysGln: 1.056 ± 0.554
3.169LysArg: 3.169 ± 2.128
4.93LysSer: 4.93 ± 1.018
2.817LysThr: 2.817 ± 0.89
5.634LysVal: 5.634 ± 1.661
0.352LysTrp: 0.352 ± 0.185
4.93LysTyr: 4.93 ± 0.764
0.0LysXaa: 0.0 ± 0.0
Leu
2.465LeuAla: 2.465 ± 1.292
2.465LeuCys: 2.465 ± 0.977
5.282LeuAsp: 5.282 ± 0.043
5.282LeuGlu: 5.282 ± 1.63
6.69LeuPhe: 6.69 ± 1.223
5.282LeuGly: 5.282 ± 0.914
1.408LeuHis: 1.408 ± 1.234
9.859LeuIle: 9.859 ± 3.017
6.69LeuLys: 6.69 ± 0.779
7.042LeuLeu: 7.042 ± 1.128
2.817LeuMet: 2.817 ± 1.477
3.521LeuAsn: 3.521 ± 1.214
4.93LeuPro: 4.93 ± 2.185
1.761LeuGln: 1.761 ± 1.134
3.169LeuArg: 3.169 ± 0.583
7.746LeuSer: 7.746 ± 2.197
6.338LeuThr: 6.338 ± 2.632
5.986LeuVal: 5.986 ± 2.319
0.352LeuTrp: 0.352 ± 0.185
2.817LeuTyr: 2.817 ± 1.252
0.0LeuXaa: 0.0 ± 0.0
Met
1.408MetAla: 1.408 ± 0.738
0.352MetCys: 0.352 ± 0.185
2.465MetAsp: 2.465 ± 0.744
0.704MetGlu: 0.704 ± 0.369
1.408MetPhe: 1.408 ± 0.495
0.0MetGly: 0.0 ± 0.0
0.352MetHis: 0.352 ± 0.185
1.056MetIle: 1.056 ± 0.554
1.056MetLys: 1.056 ± 0.554
3.873MetLeu: 3.873 ± 0.482
1.761MetMet: 1.761 ± 0.923
0.704MetAsn: 0.704 ± 0.369
0.352MetPro: 0.352 ± 0.185
0.704MetGln: 0.704 ± 1.317
0.704MetArg: 0.704 ± 0.369
1.056MetSer: 1.056 ± 0.554
1.408MetThr: 1.408 ± 1.234
1.056MetVal: 1.056 ± 1.201
0.0MetTrp: 0.0 ± 0.0
1.761MetTyr: 1.761 ± 1.031
0.0MetXaa: 0.0 ± 0.0
Asn
2.817AsnAla: 2.817 ± 0.89
1.056AsnCys: 1.056 ± 1.201
4.225AsnAsp: 4.225 ± 1.558
1.761AsnGlu: 1.761 ± 0.923
3.169AsnPhe: 3.169 ± 2.128
2.817AsnGly: 2.817 ± 0.89
1.761AsnHis: 1.761 ± 0.923
2.465AsnIle: 2.465 ± 1.292
4.577AsnLys: 4.577 ± 1.355
4.225AsnLeu: 4.225 ± 1.37
0.352AsnMet: 0.352 ± 0.185
1.408AsnAsn: 1.408 ± 0.738
1.761AsnPro: 1.761 ± 0.923
1.056AsnGln: 1.056 ± 0.554
3.169AsnArg: 3.169 ± 2.512
1.761AsnSer: 1.761 ± 0.529
1.761AsnThr: 1.761 ± 1.031
3.521AsnVal: 3.521 ± 1.531
0.704AsnTrp: 0.704 ± 0.369
2.817AsnTyr: 2.817 ± 1.001
0.0AsnXaa: 0.0 ± 0.0
Pro
3.169ProAla: 3.169 ± 0.583
0.0ProCys: 0.0 ± 0.0
3.169ProAsp: 3.169 ± 1.661
4.225ProGlu: 4.225 ± 1.351
2.113ProPhe: 2.113 ± 1.621
1.056ProGly: 1.056 ± 0.554
0.352ProHis: 0.352 ± 0.185
2.817ProIle: 2.817 ± 1.477
2.817ProLys: 2.817 ± 1.658
2.465ProLeu: 2.465 ± 2.233
0.0ProMet: 0.0 ± 0.0
1.761ProAsn: 1.761 ± 0.923
2.817ProPro: 2.817 ± 1.658
2.113ProGln: 2.113 ± 1.107
0.704ProArg: 0.704 ± 0.369
3.873ProSer: 3.873 ± 3.745
4.225ProThr: 4.225 ± 0.53
3.169ProVal: 3.169 ± 0.583
0.352ProTrp: 0.352 ± 0.185
2.817ProTyr: 2.817 ± 0.991
0.0ProXaa: 0.0 ± 0.0
Gln
0.704GlnAla: 0.704 ± 0.369
0.704GlnCys: 0.704 ± 0.617
1.761GlnAsp: 1.761 ± 0.923
0.704GlnGlu: 0.704 ± 0.369
3.169GlnPhe: 3.169 ± 3.604
0.704GlnGly: 0.704 ± 0.369
0.352GlnHis: 0.352 ± 0.185
1.056GlnIle: 1.056 ± 0.528
1.761GlnLys: 1.761 ± 0.529
2.817GlnLeu: 2.817 ± 1.477
0.704GlnMet: 0.704 ± 0.369
2.817GlnAsn: 2.817 ± 0.89
1.056GlnPro: 1.056 ± 0.528
0.352GlnGln: 0.352 ± 0.185
1.761GlnArg: 1.761 ± 0.923
2.465GlnSer: 2.465 ± 1.292
1.761GlnThr: 1.761 ± 0.529
0.704GlnVal: 0.704 ± 0.369
0.0GlnTrp: 0.0 ± 0.0
1.056GlnTyr: 1.056 ± 1.515
0.0GlnXaa: 0.0 ± 0.0
Arg
2.465ArgAla: 2.465 ± 3.83
1.408ArgCys: 1.408 ± 0.738
2.465ArgAsp: 2.465 ± 1.292
2.465ArgGlu: 2.465 ± 1.292
2.817ArgPhe: 2.817 ± 0.89
0.352ArgGly: 0.352 ± 0.185
2.113ArgHis: 2.113 ± 1.107
3.169ArgIle: 3.169 ± 1.048
0.352ArgLys: 0.352 ± 0.185
5.634ArgLeu: 5.634 ± 1.076
1.408ArgMet: 1.408 ± 0.738
2.817ArgAsn: 2.817 ± 1.477
2.113ArgPro: 2.113 ± 2.586
0.352ArgGln: 0.352 ± 0.185
2.113ArgArg: 2.113 ± 0.987
3.169ArgSer: 3.169 ± 1.661
3.873ArgThr: 3.873 ± 3.402
1.761ArgVal: 1.761 ± 2.514
0.704ArgTrp: 0.704 ± 0.369
2.817ArgTyr: 2.817 ± 1.658
0.0ArgXaa: 0.0 ± 0.0
Ser
3.873SerAla: 3.873 ± 0.482
0.352SerCys: 0.352 ± 0.742
3.169SerAsp: 3.169 ± 1.008
4.577SerGlu: 4.577 ± 1.182
3.873SerPhe: 3.873 ± 0.698
1.761SerGly: 1.761 ± 1.171
2.113SerHis: 2.113 ± 1.621
5.634SerIle: 5.634 ± 1.96
4.577SerLys: 4.577 ± 0.631
9.507SerLeu: 9.507 ± 3.145
1.056SerMet: 1.056 ± 0.831
4.93SerAsn: 4.93 ± 3.096
3.873SerPro: 3.873 ± 1.385
1.761SerGln: 1.761 ± 0.529
3.521SerArg: 3.521 ± 1.142
2.817SerSer: 2.817 ± 2.053
6.69SerThr: 6.69 ± 2.483
8.451SerVal: 8.451 ± 1.84
0.704SerTrp: 0.704 ± 0.369
3.521SerTyr: 3.521 ± 0.502
0.0SerXaa: 0.0 ± 0.0
Thr
3.873ThrAla: 3.873 ± 1.385
2.113ThrCys: 2.113 ± 1.057
2.465ThrAsp: 2.465 ± 1.007
4.225ThrGlu: 4.225 ± 0.53
2.817ThrPhe: 2.817 ± 2.053
0.704ThrGly: 0.704 ± 1.692
3.521ThrHis: 3.521 ± 1.214
3.873ThrIle: 3.873 ± 1.136
3.169ThrLys: 3.169 ± 1.067
3.873ThrLeu: 3.873 ± 0.698
1.408ThrMet: 1.408 ± 0.738
3.169ThrAsn: 3.169 ± 1.661
4.225ThrPro: 4.225 ± 2.215
2.817ThrGln: 2.817 ± 1.001
1.761ThrArg: 1.761 ± 0.923
5.634ThrSer: 5.634 ± 3.874
5.282ThrThr: 5.282 ± 4.53
4.93ThrVal: 4.93 ± 2.873
0.352ThrTrp: 0.352 ± 0.742
4.93ThrTyr: 4.93 ± 1.487
0.0ThrXaa: 0.0 ± 0.0
Val
5.634ValAla: 5.634 ± 3.874
2.817ValCys: 2.817 ± 1.658
5.282ValAsp: 5.282 ± 2.769
3.169ValGlu: 3.169 ± 1.048
2.113ValPhe: 2.113 ± 1.621
1.408ValGly: 1.408 ± 2.635
3.169ValHis: 3.169 ± 0.583
3.169ValIle: 3.169 ± 1.661
5.986ValLys: 5.986 ± 2.994
8.099ValLeu: 8.099 ± 0.665
0.704ValMet: 0.704 ± 0.369
3.873ValAsn: 3.873 ± 0.482
3.873ValPro: 3.873 ± 1.897
2.465ValGln: 2.465 ± 0.848
3.521ValArg: 3.521 ± 3.499
5.986ValSer: 5.986 ± 0.606
6.69ValThr: 6.69 ± 2.075
5.986ValVal: 5.986 ± 2.856
0.0ValTrp: 0.0 ± 0.0
3.873ValTyr: 3.873 ± 3.274
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.352TrpAsp: 0.352 ± 0.185
0.0TrpGlu: 0.0 ± 0.0
1.056TrpPhe: 1.056 ± 0.528
0.352TrpGly: 0.352 ± 0.185
0.0TrpHis: 0.0 ± 0.0
0.704TrpIle: 0.704 ± 0.369
0.352TrpLys: 0.352 ± 0.185
0.704TrpLeu: 0.704 ± 0.369
0.0TrpMet: 0.0 ± 0.0
0.352TrpAsn: 0.352 ± 0.742
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.352TrpTyr: 0.352 ± 0.185
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.056TyrAla: 1.056 ± 1.515
1.408TyrCys: 1.408 ± 0.495
4.93TyrAsp: 4.93 ± 1.912
2.817TyrGlu: 2.817 ± 2.906
4.93TyrPhe: 4.93 ± 0.764
3.169TyrGly: 3.169 ± 1.661
2.113TyrHis: 2.113 ± 1.852
2.817TyrIle: 2.817 ± 1.252
2.113TyrLys: 2.113 ± 1.107
3.169TyrLeu: 3.169 ± 2.152
1.056TyrMet: 1.056 ± 0.528
1.761TyrAsn: 1.761 ± 1.171
1.761TyrPro: 1.761 ± 1.967
1.761TyrGln: 1.761 ± 0.529
3.873TyrArg: 3.873 ± 0.698
3.169TyrSer: 3.169 ± 1.008
3.873TyrThr: 3.873 ± 0.698
4.577TyrVal: 4.577 ± 0.631
0.0TyrTrp: 0.0 ± 0.0
5.282TyrTyr: 5.282 ± 0.043
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2841 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski