Amino acid dipepetide frequency for Beihai sphaeromadae virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.541AlaAla: 3.541 ± 0.301
0.506AlaCys: 0.506 ± 0.555
3.541AlaAsp: 3.541 ± 0.536
3.035AlaGlu: 3.035 ± 0.019
4.047AlaPhe: 4.047 ± 0.253
4.047AlaGly: 4.047 ± 0.253
2.023AlaHis: 2.023 ± 0.292
4.552AlaIle: 4.552 ± 1.645
5.058AlaLys: 5.058 ± 1.984
4.552AlaLeu: 4.552 ± 0.865
0.0AlaMet: 0.0 ± 0.0
0.506AlaAsn: 0.506 ± 0.282
5.564AlaPro: 5.564 ± 0.244
2.529AlaGln: 2.529 ± 0.574
1.012AlaArg: 1.012 ± 0.564
7.081AlaSer: 7.081 ± 0.602
6.07AlaThr: 6.07 ± 0.799
3.541AlaVal: 3.541 ± 1.138
1.012AlaTrp: 1.012 ± 0.564
0.506AlaTyr: 0.506 ± 0.555
0.0AlaXaa: 0.0 ± 0.0
Cys
1.517CysAla: 1.517 ± 0.846
0.0CysCys: 0.0 ± 0.0
2.529CysAsp: 2.529 ± 0.574
0.0CysGlu: 0.0 ± 0.0
0.506CysPhe: 0.506 ± 0.282
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.506CysLys: 0.506 ± 0.282
0.506CysLeu: 0.506 ± 0.555
0.0CysMet: 0.0 ± 0.0
0.506CysAsn: 0.506 ± 0.282
0.0CysPro: 0.0 ± 0.0
1.012CysGln: 1.012 ± 0.564
1.517CysArg: 1.517 ± 0.827
1.012CysSer: 1.012 ± 0.273
0.506CysThr: 0.506 ± 0.282
1.012CysVal: 1.012 ± 0.564
0.0CysTrp: 0.0 ± 0.0
0.506CysTyr: 0.506 ± 0.282
0.0CysXaa: 0.0 ± 0.0
Asp
2.529AspAla: 2.529 ± 0.574
1.012AspCys: 1.012 ± 0.564
2.529AspAsp: 2.529 ± 0.574
3.035AspGlu: 3.035 ± 0.019
3.035AspPhe: 3.035 ± 0.019
2.529AspGly: 2.529 ± 0.263
0.0AspHis: 0.0 ± 0.0
2.529AspIle: 2.529 ± 0.263
1.517AspLys: 1.517 ± 0.846
2.529AspLeu: 2.529 ± 1.41
0.506AspMet: 0.506 ± 0.282
2.023AspAsn: 2.023 ± 1.128
5.058AspPro: 5.058 ± 0.311
2.023AspGln: 2.023 ± 1.128
1.517AspArg: 1.517 ± 0.846
8.599AspSer: 8.599 ± 6.918
6.07AspThr: 6.07 ± 0.038
5.058AspVal: 5.058 ± 1.363
1.517AspTrp: 1.517 ± 0.827
2.023AspTyr: 2.023 ± 0.545
0.0AspXaa: 0.0 ± 0.0
Glu
2.529GluAla: 2.529 ± 0.263
1.012GluCys: 1.012 ± 0.564
2.529GluAsp: 2.529 ± 1.1
3.541GluGlu: 3.541 ± 1.372
1.012GluPhe: 1.012 ± 0.273
5.058GluGly: 5.058 ± 0.311
1.012GluHis: 1.012 ± 0.564
3.541GluIle: 3.541 ± 1.138
2.529GluLys: 2.529 ± 0.574
4.047GluLeu: 4.047 ± 2.256
0.506GluMet: 0.506 ± 0.282
1.517GluAsn: 1.517 ± 0.827
0.506GluPro: 0.506 ± 0.555
1.012GluGln: 1.012 ± 0.564
1.517GluArg: 1.517 ± 0.846
6.07GluSer: 6.07 ± 1.635
3.541GluThr: 3.541 ± 0.301
1.517GluVal: 1.517 ± 0.01
2.023GluTrp: 2.023 ± 0.292
1.012GluTyr: 1.012 ± 0.564
0.0GluXaa: 0.0 ± 0.0
Phe
3.035PheAla: 3.035 ± 0.019
0.506PheCys: 0.506 ± 0.282
4.047PheAsp: 4.047 ± 0.583
2.529PheGlu: 2.529 ± 0.263
1.517PhePhe: 1.517 ± 0.01
2.529PheGly: 2.529 ± 0.574
1.012PheHis: 1.012 ± 0.273
1.012PheIle: 1.012 ± 1.109
1.012PheLys: 1.012 ± 0.273
5.564PheLeu: 5.564 ± 2.266
0.506PheMet: 0.506 ± 0.555
2.023PheAsn: 2.023 ± 1.128
1.517PhePro: 1.517 ± 0.846
2.529PheGln: 2.529 ± 0.574
3.035PheArg: 3.035 ± 0.019
3.541PheSer: 3.541 ± 0.301
2.529PheThr: 2.529 ± 1.1
3.035PheVal: 3.035 ± 0.818
0.0PheTrp: 0.0 ± 0.0
2.023PheTyr: 2.023 ± 1.128
0.0PheXaa: 0.0 ± 0.0
Gly
3.541GlyAla: 3.541 ± 1.138
1.012GlyCys: 1.012 ± 0.273
3.541GlyAsp: 3.541 ± 0.301
0.506GlyGlu: 0.506 ± 0.282
1.517GlyPhe: 1.517 ± 0.827
4.047GlyGly: 4.047 ± 1.09
1.012GlyHis: 1.012 ± 0.273
1.517GlyIle: 1.517 ± 0.01
4.047GlyLys: 4.047 ± 1.42
6.07GlyLeu: 6.07 ± 0.038
1.517GlyMet: 1.517 ± 0.846
2.529GlyAsn: 2.529 ± 0.263
4.552GlyPro: 4.552 ± 0.808
1.012GlyGln: 1.012 ± 0.273
4.047GlyArg: 4.047 ± 0.253
5.564GlySer: 5.564 ± 1.917
5.058GlyThr: 5.058 ± 1.147
5.058GlyVal: 5.058 ± 0.526
1.517GlyTrp: 1.517 ± 0.01
2.023GlyTyr: 2.023 ± 1.382
0.0GlyXaa: 0.0 ± 0.0
His
1.517HisAla: 1.517 ± 0.846
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.012HisGlu: 1.012 ± 0.273
0.0HisPhe: 0.0 ± 0.0
0.506HisGly: 0.506 ± 0.282
0.506HisHis: 0.506 ± 0.282
0.0HisIle: 0.0 ± 0.0
0.506HisLys: 0.506 ± 0.282
1.517HisLeu: 1.517 ± 0.846
0.506HisMet: 0.506 ± 0.218
1.517HisAsn: 1.517 ± 0.01
1.517HisPro: 1.517 ± 0.846
1.517HisGln: 1.517 ± 0.01
0.506HisArg: 0.506 ± 0.282
2.023HisSer: 2.023 ± 0.292
0.506HisThr: 0.506 ± 0.282
0.506HisVal: 0.506 ± 0.282
0.506HisTrp: 0.506 ± 0.282
0.506HisTyr: 0.506 ± 0.555
0.0HisXaa: 0.0 ± 0.0
Ile
4.047IleAla: 4.047 ± 0.253
0.506IleCys: 0.506 ± 0.555
1.517IleAsp: 1.517 ± 0.827
4.047IleGlu: 4.047 ± 0.253
0.506IlePhe: 0.506 ± 0.282
2.529IleGly: 2.529 ± 2.773
1.012IleHis: 1.012 ± 0.564
0.506IleIle: 0.506 ± 0.555
2.023IleLys: 2.023 ± 0.545
4.552IleLeu: 4.552 ± 0.029
1.012IleMet: 1.012 ± 0.564
1.012IleAsn: 1.012 ± 0.564
4.047IlePro: 4.047 ± 0.253
2.023IleGln: 2.023 ± 0.292
7.081IleArg: 7.081 ± 0.234
2.023IleSer: 2.023 ± 0.292
2.529IleThr: 2.529 ± 1.1
2.529IleVal: 2.529 ± 0.263
0.506IleTrp: 0.506 ± 0.282
3.541IleTyr: 3.541 ± 1.372
0.0IleXaa: 0.0 ± 0.0
Lys
3.035LysAla: 3.035 ± 0.856
1.517LysCys: 1.517 ± 0.846
2.023LysAsp: 2.023 ± 0.292
2.529LysGlu: 2.529 ± 0.574
1.012LysPhe: 1.012 ± 0.564
3.541LysGly: 3.541 ± 0.301
1.517LysHis: 1.517 ± 0.846
1.517LysIle: 1.517 ± 0.827
1.517LysLys: 1.517 ± 0.846
5.058LysLeu: 5.058 ± 0.311
1.012LysMet: 1.012 ± 0.564
1.012LysAsn: 1.012 ± 0.273
3.035LysPro: 3.035 ± 0.818
1.517LysGln: 1.517 ± 0.01
2.023LysArg: 2.023 ± 0.545
5.058LysSer: 5.058 ± 1.147
8.093LysThr: 8.093 ± 0.507
3.541LysVal: 3.541 ± 1.138
0.0LysTrp: 0.0 ± 0.0
1.012LysTyr: 1.012 ± 1.109
0.0LysXaa: 0.0 ± 0.0
Leu
6.576LeuAla: 6.576 ± 0.32
1.012LeuCys: 1.012 ± 0.564
5.058LeuAsp: 5.058 ± 0.526
4.047LeuGlu: 4.047 ± 0.253
4.552LeuPhe: 4.552 ± 1.702
7.587LeuGly: 7.587 ± 1.721
1.012LeuHis: 1.012 ± 0.564
4.552LeuIle: 4.552 ± 0.808
2.023LeuLys: 2.023 ± 0.292
4.047LeuLeu: 4.047 ± 0.253
1.517LeuMet: 1.517 ± 0.601
0.506LeuAsn: 0.506 ± 0.282
7.081LeuPro: 7.081 ± 1.071
2.023LeuGln: 2.023 ± 2.218
1.012LeuArg: 1.012 ± 0.273
8.599LeuSer: 8.599 ± 3.122
4.047LeuThr: 4.047 ± 0.583
3.035LeuVal: 3.035 ± 0.019
0.506LeuTrp: 0.506 ± 0.282
2.529LeuTyr: 2.529 ± 0.263
0.0LeuXaa: 0.0 ± 0.0
Met
2.023MetAla: 2.023 ± 1.128
0.0MetCys: 0.0 ± 0.0
2.023MetAsp: 2.023 ± 1.128
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.506MetIle: 0.506 ± 0.282
0.506MetLys: 0.506 ± 0.282
1.012MetLeu: 1.012 ± 0.564
0.506MetMet: 0.506 ± 0.282
1.012MetAsn: 1.012 ± 0.273
1.517MetPro: 1.517 ± 1.664
2.023MetGln: 2.023 ± 1.128
0.506MetArg: 0.506 ± 0.282
1.012MetSer: 1.012 ± 1.109
0.506MetThr: 0.506 ± 0.555
0.506MetVal: 0.506 ± 0.282
0.0MetTrp: 0.0 ± 0.0
1.517MetTyr: 1.517 ± 0.846
0.0MetXaa: 0.0 ± 0.0
Asn
2.529AsnAla: 2.529 ± 1.1
0.0AsnCys: 0.0 ± 0.0
1.012AsnAsp: 1.012 ± 1.109
0.0AsnGlu: 0.0 ± 0.0
2.023AsnPhe: 2.023 ± 0.292
3.035AsnGly: 3.035 ± 1.692
0.0AsnHis: 0.0 ± 0.0
1.012AsnIle: 1.012 ± 0.273
2.529AsnLys: 2.529 ± 1.1
4.047AsnLeu: 4.047 ± 0.253
0.0AsnMet: 0.0 ± 0.0
0.506AsnAsn: 0.506 ± 0.555
4.047AsnPro: 4.047 ± 0.583
2.529AsnGln: 2.529 ± 0.574
1.517AsnArg: 1.517 ± 0.846
5.058AsnSer: 5.058 ± 1.147
1.012AsnThr: 1.012 ± 0.564
2.023AsnVal: 2.023 ± 0.292
1.012AsnTrp: 1.012 ± 0.564
1.012AsnTyr: 1.012 ± 0.564
0.0AsnXaa: 0.0 ± 0.0
Pro
6.576ProAla: 6.576 ± 0.32
0.506ProCys: 0.506 ± 0.555
6.576ProAsp: 6.576 ± 1.157
5.058ProGlu: 5.058 ± 0.311
3.035ProPhe: 3.035 ± 0.856
2.529ProGly: 2.529 ± 2.773
1.012ProHis: 1.012 ± 0.564
5.564ProIle: 5.564 ± 0.244
4.047ProLys: 4.047 ± 0.253
2.529ProLeu: 2.529 ± 0.263
1.012ProMet: 1.012 ± 0.564
3.035ProAsn: 3.035 ± 0.856
9.105ProPro: 9.105 ± 4.24
4.552ProGln: 4.552 ± 0.029
4.047ProArg: 4.047 ± 2.256
6.07ProSer: 6.07 ± 1.635
6.576ProThr: 6.576 ± 1.157
5.564ProVal: 5.564 ± 0.244
0.0ProTrp: 0.0 ± 0.0
1.012ProTyr: 1.012 ± 0.273
0.0ProXaa: 0.0 ± 0.0
Gln
2.529GlnAla: 2.529 ± 0.574
1.012GlnCys: 1.012 ± 0.564
2.023GlnAsp: 2.023 ± 0.292
1.517GlnGlu: 1.517 ± 0.01
3.035GlnPhe: 3.035 ± 0.019
2.023GlnGly: 2.023 ± 1.382
0.506GlnHis: 0.506 ± 0.282
0.506GlnIle: 0.506 ± 0.555
1.517GlnLys: 1.517 ± 0.01
1.012GlnLeu: 1.012 ± 0.564
1.517GlnMet: 1.517 ± 0.01
1.517GlnAsn: 1.517 ± 0.827
2.529GlnPro: 2.529 ± 0.263
0.0GlnGln: 0.0 ± 0.0
5.058GlnArg: 5.058 ± 1.984
5.564GlnSer: 5.564 ± 1.081
6.07GlnThr: 6.07 ± 1.635
3.541GlnVal: 3.541 ± 1.974
0.0GlnTrp: 0.0 ± 0.0
2.529GlnTyr: 2.529 ± 1.41
0.0GlnXaa: 0.0 ± 0.0
Arg
3.541ArgAla: 3.541 ± 0.536
0.506ArgCys: 0.506 ± 0.282
1.517ArgAsp: 1.517 ± 0.846
1.517ArgGlu: 1.517 ± 0.01
4.047ArgPhe: 4.047 ± 1.42
1.517ArgGly: 1.517 ± 0.846
1.517ArgHis: 1.517 ± 0.01
6.07ArgIle: 6.07 ± 0.038
1.012ArgLys: 1.012 ± 1.109
4.552ArgLeu: 4.552 ± 0.865
0.506ArgMet: 0.506 ± 0.555
2.023ArgAsn: 2.023 ± 0.292
3.541ArgPro: 3.541 ± 1.974
2.529ArgGln: 2.529 ± 0.263
5.058ArgArg: 5.058 ± 1.363
3.035ArgSer: 3.035 ± 0.019
2.529ArgThr: 2.529 ± 1.41
4.047ArgVal: 4.047 ± 1.09
1.012ArgTrp: 1.012 ± 0.564
3.035ArgTyr: 3.035 ± 0.856
0.0ArgXaa: 0.0 ± 0.0
Ser
4.552SerAla: 4.552 ± 1.645
1.517SerCys: 1.517 ± 0.01
3.541SerAsp: 3.541 ± 0.536
3.035SerGlu: 3.035 ± 1.692
6.576SerPhe: 6.576 ± 0.32
7.587SerGly: 7.587 ± 0.048
1.012SerHis: 1.012 ± 0.564
3.541SerIle: 3.541 ± 1.372
8.093SerLys: 8.093 ± 1.166
5.058SerLeu: 5.058 ± 1.147
0.506SerMet: 0.506 ± 0.555
3.035SerAsn: 3.035 ± 0.856
7.081SerPro: 7.081 ± 0.602
5.564SerGln: 5.564 ± 1.917
6.07SerArg: 6.07 ± 0.038
9.105SerSer: 9.105 ± 2.453
11.634SerThr: 11.634 ± 1.042
9.611SerVal: 9.611 ± 4.681
1.517SerTrp: 1.517 ± 0.846
2.529SerTyr: 2.529 ± 1.1
0.0SerXaa: 0.0 ± 0.0
Thr
3.035ThrAla: 3.035 ± 1.654
0.506ThrCys: 0.506 ± 0.282
4.047ThrAsp: 4.047 ± 0.253
3.541ThrGlu: 3.541 ± 0.301
3.541ThrPhe: 3.541 ± 0.301
5.058ThrGly: 5.058 ± 0.311
1.012ThrHis: 1.012 ± 0.564
3.035ThrIle: 3.035 ± 1.692
4.047ThrLys: 4.047 ± 0.253
9.611ThrLeu: 9.611 ± 6.354
2.023ThrMet: 2.023 ± 0.292
3.541ThrAsn: 3.541 ± 0.536
7.081ThrPro: 7.081 ± 0.602
4.047ThrGln: 4.047 ± 0.253
5.058ThrArg: 5.058 ± 1.147
9.105ThrSer: 9.105 ± 0.779
20.233ThrThr: 20.233 ± 13.817
5.058ThrVal: 5.058 ± 0.311
0.506ThrTrp: 0.506 ± 0.555
4.047ThrTyr: 4.047 ± 0.253
0.0ThrXaa: 0.0 ± 0.0
Val
4.047ValAla: 4.047 ± 0.583
0.506ValCys: 0.506 ± 0.282
4.552ValAsp: 4.552 ± 2.481
3.035ValGlu: 3.035 ± 0.856
2.023ValPhe: 2.023 ± 0.545
2.529ValGly: 2.529 ± 0.574
0.506ValHis: 0.506 ± 0.282
6.07ValIle: 6.07 ± 1.635
3.541ValLys: 3.541 ± 1.138
1.517ValLeu: 1.517 ± 0.827
1.517ValMet: 1.517 ± 0.01
3.035ValAsn: 3.035 ± 0.856
8.093ValPro: 8.093 ± 2.84
2.529ValGln: 2.529 ± 1.41
1.517ValArg: 1.517 ± 0.827
7.587ValSer: 7.587 ± 0.884
6.576ValThr: 6.576 ± 4.7
2.023ValVal: 2.023 ± 0.292
1.012ValTrp: 1.012 ± 0.273
1.012ValTyr: 1.012 ± 1.109
0.0ValXaa: 0.0 ± 0.0
Trp
1.012TrpAla: 1.012 ± 0.564
0.0TrpCys: 0.0 ± 0.0
1.012TrpAsp: 1.012 ± 0.564
1.517TrpGlu: 1.517 ± 0.01
1.012TrpPhe: 1.012 ± 1.109
0.0TrpGly: 0.0 ± 0.0
0.506TrpHis: 0.506 ± 0.555
0.0TrpIle: 0.0 ± 0.0
0.506TrpLys: 0.506 ± 0.555
2.023TrpLeu: 2.023 ± 1.128
0.0TrpMet: 0.0 ± 0.0
1.012TrpAsn: 1.012 ± 0.564
0.506TrpPro: 0.506 ± 0.282
1.012TrpGln: 1.012 ± 0.273
0.0TrpArg: 0.0 ± 0.0
0.506TrpSer: 0.506 ± 0.282
1.012TrpThr: 1.012 ± 0.564
0.506TrpVal: 0.506 ± 0.282
0.0TrpTrp: 0.0 ± 0.0
1.012TrpTyr: 1.012 ± 0.564
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.012TyrAla: 1.012 ± 0.273
0.0TyrCys: 0.0 ± 0.0
2.023TyrAsp: 2.023 ± 1.382
2.529TyrGlu: 2.529 ± 0.574
0.506TyrPhe: 0.506 ± 0.282
3.035TyrGly: 3.035 ± 0.818
0.0TyrHis: 0.0 ± 0.0
1.517TyrIle: 1.517 ± 0.01
3.035TyrLys: 3.035 ± 0.818
2.023TyrLeu: 2.023 ± 0.292
0.0TyrMet: 0.0 ± 0.0
3.035TyrAsn: 3.035 ± 0.818
2.023TyrPro: 2.023 ± 1.382
2.023TyrGln: 2.023 ± 0.292
1.012TyrArg: 1.012 ± 0.273
4.552TyrSer: 4.552 ± 1.702
3.035TyrThr: 3.035 ± 0.019
1.517TyrVal: 1.517 ± 0.01
0.506TyrTrp: 0.506 ± 0.282
2.529TyrTyr: 2.529 ± 0.263
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1978 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski